Selection Vs Transmission Decomposition¶
Core Idea¶
Selection vs transmission decomposition is the structural pattern in which a change in the population mean — or weighted average — of any trait, productivity, score, or return is split into two structurally distinct contributions: the covariance of the trait with each unit's growth or replication weight, which is the change produced by differential weighting of existing units; and the within-unit transformation weighted by share, which is the change produced by the units themselves changing. The decomposition is exact whenever the population is partitioned into measurable units with measurable weights and measurable trait values; it is recursive, since each term can be re-decomposed by sub-grouping; and the two terms typically carry different intervention implications.
Formally, given units with trait values, weights such as fitness or share, and within-unit changes, the change in the weighted mean equals the covariance of weight with trait, divided by mean weight, plus the weighted expectation of within-unit change. The first term — selection — aggregates differential weighting; the second — transmission — aggregates within-unit change weighted by share. The Price equation is the canonical formal statement of this decomposition. The defining commitment is that an apparent average change in a heterogeneous population can be produced by two structurally distinct mechanisms whose intervention implications diverge, and that the decomposition is the diagnostic that tells them apart. A bare aggregate hides which mechanism is operating; the prime is the exact, identity-backed cut that recovers it.
How would you explain it like I'm…
Grew vs. Swapped
Who Counts More Vs Who Changed
Reweighting Versus Changing
Structural Signature¶
the partitioned population — the unit weights — the unit trait values — the within-unit changes — the selection term (covariance of weight with trait) — the transmission term (share-weighted within-unit change) — the exact-identity and recursion invariants
A configuration admits the decomposition when each of the following holds:
- A partitioned population. A population is divided into measurable units — individuals, firms, asset classes, regions, cohorts, models — over which a weighted mean is taken.
- Unit weights. Each unit carries a measurable weight — fitness, capital employed, portfolio share, population count — governing its contribution to the mean.
- Unit trait values. Each unit has a measurable trait, productivity, score, or return whose population mean is the quantity of interest.
- Within-unit changes. Each unit's trait can change within itself between the two observations, measurably.
- A selection term. One contribution is the covariance of weight with trait (over mean weight): the change produced by differential weighting of existing units — reallocation, composition shift, allocation effect.
- A transmission term. The other contribution is the share-weighted expectation of within-unit change: the change produced by units changing themselves — within-firm growth, within-cohort learning, fine-tuning.
- Exact-identity and recursion invariants. The two terms sum exactly to the observed mean change — an identity requiring no functional-form assumption, fitting, or residual — and each term re-decomposes recursively by sub-grouping. The two terms typically imply divergent interventions (work entry/exit/reallocation margins versus within-unit improvement margins).
These components compose into an exact attribution: a weighted-mean change splits identically into a selection term (differential weighting) and a transmission term (within-unit change), recursively and assumption-free — the diagnostic that tells which mechanism, and which lever, is operating.
What It Is Not¶
- Not selection bias (see
selection_bias).selection_biasis a distortion — a non-representative sample misleading inference; this prime's selection term is an exact, intended accounting contribution (covariance of weight with trait). One is an error to correct; the other is a real mechanism to measure. - Not generic decomposition (see
decomposition).decompositionbreaks any whole into parts by some scheme; this prime is the specific, exact, residual-free split of a weighted-mean change into differential-weighting and within-unit terms. It is one canonical decomposition, identity-backed, not the general method. - Not competition (see
competition).competitionis a substantive process by which units contend; the selection term accounts for the net reweighting competition (among other forces) produces, without modeling the contest. The prime is arithmetic attribution, not a mechanism of contention. - Not a causal model. The decomposition is an exact accounting identity, not an explanation of why selection or transmission occurred. It partitions the arithmetic of the change; it does not certify that pushing the selection lever will reproduce the gain.
- Not a statement about the distribution. The identity decomposes the weighted mean only; it is silent about variance, skew, and tails. A population can hold its mean while its dispersion explodes, and the prime will not surface it.
- Common misclassification. Reading an aggregate rise as evidence that the units improved (transmission) when it was pure reweighting (selection), or vice versa, and prescribing the wrong lever. Catch it by computing both terms before choosing an intervention; the headline number alone cannot tell which mechanism moved it.
Broad Use¶
The same decomposition recurs across substrates that share nothing except the structural shape of partitioned units, weights, and trait values. In evolutionary biology it is the Price equation itself, splitting change in mean phenotype into selection (covariance of phenotype with fitness) and transmission (within-individual change including mutation and developmental noise), the foundation of multilevel selection theory. In firm productivity dynamics it is the Foster-Haltiwanger-Krizan decomposition: aggregate productivity change splits into within-firm growth plus between-firm reallocation plus entry-and-exit terms — structurally the Price equation applied to firms. In portfolio attribution it is Brinson attribution, decomposing total return into an allocation effect (overweighting better-performing classes) and a selection effect (within-class security selection) — the same cut under a different name. In organizational learning, the change in average employee skill splits into selection (high-skill hires and promotions, low-skill exit) plus within-person learning. The same shape recurs in regional and demographic dynamics (composition shift across regions plus within-region change, as in life-expectancy and mortality decompositions), in educational testing (cohort composition shift plus within-student learning), in cultural and memetic dynamics, and in ML fleet effects (high-accuracy models retained plus within-model fine-tuning). In every case an apparent average change can be produced by two distinct mechanisms whose intervention implications diverge, and the decomposition is the diagnostic.
Clarity¶
The prime makes visible a question the bare aggregate hides: is this average change happening because the units changed, or because the weighting of units changed? The same five-percent rise in aggregate productivity can be entirely within-unit improvement (every firm got better), entirely reallocation (resources moved toward the already-best firms with no within-firm change), or any mix — and the intervention implications are completely different. A selection-dominated change calls for working the entry-exit and reallocation margins: easier capital flow, lower switching costs, letting bad units fail, recruiting and promoting on the trait. A transmission-dominated change calls for working the within-unit improvement margins: training, R&D, process improvement, learning capacity inside existing units. A diagnosis that conflates the two can prescribe the wrong intervention with full confidence — investing in within-firm training when reallocation dominates is wasted, and investing in mobility and selection pressure when within-firm transformation dominates is equally wasted. The clarity the prime delivers is therefore not interpretive but mechanical: an exact identity converts an ambiguous headline number into two separately attributable contributions, each with its own lever, so the choice of intervention follows from the decomposition rather than from a guess about what the aggregate "means."
Manages Complexity¶
The decomposition compresses a wide class of "did the population mean change because of X or Y" questions into a single structural cut. It gives the analyst three measurable inputs — unit weights, unit trait values, within-unit change — and one structural output, the two-term decomposition. And it supports recursive application: each term can be re-decomposed by sub-grouping, producing a Russian-doll of attribution — between-industry versus within-industry, between-region versus within-region, between-cohort versus within-cohort — whenever the substrate has a natural hierarchy. This recursion is what lets the prime scale to genuinely complex populations: rather than confronting a tangle of overlapping causes, the analyst applies the same exact cut at each level of grouping and reads off where the change is concentrated. Because the decomposition is an identity rather than a model, it requires no assumptions about functional form, no fitting, and no residual term to explain away; the two contributions sum exactly to the observed change. That exactness is the source of its complexity-management power: it guarantees that the attribution is complete and the leverage points are exhaustive, so the analyst can be confident that selection and transmission together account for the whole of the observed mean change and that working those two margins is working all of it.
Abstract Reasoning¶
The prime supports reasoning about what is changing as a separate question from what the mean is doing. It exposes a fixed diagnostic chain: identify the population, identify the units, identify the trait, identify the weight, measure the within-unit change, compute the covariance, and report both terms. It supports counterfactual reasoning of a precise kind — if selection had been zero, what would the within-unit-only mean change have been; if within-unit change had been zero, what would pure selection have produced — and because the decomposition is exact, those counterfactuals are not estimates but the two halves of the identity. This is causal-attribution reasoning at the population level that bare aggregate-change reasoning simply cannot perform, because the aggregate destroys the information needed to separate the mechanisms. The abstract move the prime licenses is to treat any weighted-mean change as decomposable in principle, and to refuse to interpret a headline movement until the cut has been made. That refusal is itself a reasoning discipline: it blocks the common error of reading an aggregate rise as evidence that the units improved, when it may be entirely a reweighting, and it does so not by appeal to judgment but by appeal to an identity that holds whenever the ingredients are measurable.
Knowledge Transfer¶
The prime is genuinely prime-shaped: it is a generator that produces its named instances — the Price equation, the FHK decomposition, Brinson attribution, the Arriaga decomposition — as substrate-specific applications of one mathematical identity. The vocabulary travels unmodified because it is fully relational: selection term, transmission term, reallocation effect, within-unit effect, covariance with weight, weighted within-unit average, attribution decomposition. An evolutionary biologist analyzing multilevel selection, an economist analyzing aggregate productivity, a portfolio manager doing return attribution, a labor economist analyzing workforce-skill dynamics, a demographer analyzing mortality change, an education researcher analyzing test-score change, and an ML-ops engineer analyzing fleet accuracy are doing structurally the same work, and the intervention vocabulary transfers with the decomposition: address selection or transmission according to which dominates the observed change. The role-mapping is fixed across all of these: unit maps to individual / firm / asset class / region / cohort / model; weight maps to fitness / capital employed / portfolio share / population count; trait maps to phenotype / productivity / return / score / accuracy; selection maps to reallocation / allocation effect / composition shift; transmission maps to within-firm growth / security selection / within-cohort learning / fine-tuning. Because the same identity holds in every substrate where the ingredients exist, a practitioner who has internalized the within-versus-between cut in one field can apply it immediately in another, reading FHK as the Price equation for firms or Brinson as the Price equation for portfolios, and reaching for the same two-margin intervention logic without re-deriving the mathematics.
Examples¶
Formal/abstract¶
The Price equation applied to a phenotype under natural selection is the prime's canonical formal statement. The partitioned population is a set of organisms (or types) indexed \(i\); each carries a unit weight \(w_i\), its fitness (number of offspring), and a unit trait value \(z_i\), the phenotype whose population mean is of interest. Between generations each lineage can also change within itself by amount \(\Delta z_i\) (mutation, developmental noise, transmission error). The Price equation states the change in mean phenotype exactly: $\(\bar{w}\,\Delta\bar{z} = \operatorname{Cov}(w_i, z_i) + \mathbb{E}(w_i\,\Delta z_i).\)$ The first term, \(\operatorname{Cov}(w_i, z_i)\), is the selection term: it is positive precisely when high-trait units have above-average fitness, capturing change produced by differential weighting of existing types — no individual changed, the population's composition shifted toward the fitter trait. The second term, \(\mathbb{E}(w_i\,\Delta z_i)\), is the transmission term: the fitness-weighted average of within-lineage change, capturing change produced by units changing themselves. The exact-identity invariant is the whole point — these two terms sum to the total mean change with no residual, requiring no model fit or functional-form assumption, because the equation is an algebraic identity. The recursion invariant is what founds multilevel selection theory: the transmission term \(\Delta z_i\) can itself be expanded by the Price equation at the sub-population level (selection within groups), so a between-group selection term and a within-group selection term separate cleanly — group selection and individual selection become two terms of one nested identity rather than rival verbal theories.
Mapped back: Organisms are the partitioned units, fitness is the weight, phenotype is the trait, \(\operatorname{Cov}(w,z)\) is selection (differential weighting), \(\mathbb{E}(w\,\Delta z)\) is transmission (within-lineage change), and the residual-free sum plus the recursive group/individual split are the exact-identity and recursion invariants.
Applied/industry¶
Aggregate productivity accounting (the FHK decomposition) and portfolio return attribution (Brinson) are the same identity in two industries that never cite the Price equation. In firm productivity dynamics, the partitioned population is the firms in an industry; the weight is each firm's share of employment or capital; the trait is firm-level productivity; and the quantity of interest is the change in industry-aggregate productivity. The Foster-Haltiwanger-Krizan decomposition splits that change into a within-firm term (the share-weighted average of firms improving their own productivity — the transmission term, units changing themselves) plus a between-firm/reallocation term (resources shifting toward already-more-productive firms — the selection term, differential weighting) plus entry-and-exit terms (productive entrants replacing unproductive exiters — selection at the population margin). The divergent-intervention payoff the prime promises is concrete and policy-relevant: if a country's productivity growth is transmission-dominated, the lever is within-firm improvement — R&D subsidies, training, process upgrading; if it is selection-dominated (reallocation), the lever is mobility — easing capital flow, lowering barriers to firm entry and exit, letting unproductive firms fail. A government that funds within-firm training when the data say reallocation dominates wastes the money, which is exactly the conflation the decomposition prevents. Brinson portfolio attribution runs the identical cut on investment returns: the units are asset classes (or sectors); the weight is portfolio allocation; the trait is each class's return; and total active return splits into an allocation effect (over- or under-weighting classes relative to a benchmark — selection/differential weighting) plus a selection effect (picking better securities within each class — transmission/within-unit change). A manager reads which term drove performance to know whether their edge is asset allocation or security picking — the same two-margin diagnosis. The transfer the prime makes rigorous is that FHK is the Price equation for firms and Brinson is the Price equation for portfolios: identical algebra, identical within-versus-between logic, identical "which margin do I work?" conclusion.
Mapped back: Firms and asset classes are partitioned units; employment share and portfolio allocation are weights; firm productivity and class return are traits; the reallocation/allocation effect is the selection term and the within-firm/security-selection effect is the transmission term — FHK and Brinson are one exact decomposition across an economics and a finance substrate.
Structural Tensions¶
T1 — Selection Lever versus Transmission Lever (Sign/Direction). The decomposition's whole payoff is that the two terms imply different interventions — selection works the entry/exit/reallocation margins (mobility, letting bad units fail), transmission works the within-unit improvement margins (training, R&D). The failure mode is prescribing one lever for a change driven by the other: funding within-firm training when reallocation dominates, or easing mobility when within-unit transformation dominates — confident, expensive, and wasted. Diagnostic: compute both terms before choosing a lever; an aggregate rise read as "the units improved" when it was pure reweighting (or vice versa) sends resources to the margin that is not moving, which is exactly the conflation the cut exists to prevent.
T2 — Exact Identity versus Measurement Validity (Measurement). The decomposition is an algebraic identity — it sums to the observed change with no residual, no functional-form assumption — but that exactness is conditional on the weights, traits, and within-unit changes being measured correctly and consistently across the two observations. The failure mode is letting the identity's mathematical certainty launder dirty inputs: mis-measured weights or non-comparable trait definitions produce two terms that sum perfectly to a wrong total. Diagnostic: ask whether the ingredients are measured the same way at both time points; the identity guarantees the parts sum to the whole, not that the whole is right, so exactness is no defense against a measurement error that the clean decomposition will faithfully partition.
T3 — Population Partition Choice as Substantive (Scopal). The cut presupposes a partition into units, but which partition is chosen determines what counts as selection versus transmission — change that is "within-unit" at the firm level becomes "between-unit selection" when firms are split into plants. The failure mode is treating the partition as a neutral preliminary, so a result ("transmission dominates") is really an artifact of the grouping granularity. Diagnostic: ask whether the same change would decompose differently under a finer or coarser partition; if it would, the selection/transmission split is partition-relative, and a claim about which mechanism dominates must specify the level of grouping or it is reporting a choice, not a fact.
T4 — Recursive Re-Decomposition versus Stopping Level (Scalar). Each term re-decomposes by sub-grouping — between-industry within-industry, between-region within-region — a Russian-doll that can continue to any depth. The failure mode is stopping at the wrong level: halting at the top cut and missing that the within-term hides a large between-subgroup selection (group selection masquerading as transmission), or recursing past the level where the data support stable estimates. Diagnostic: ask whether the dominant term still looks homogeneous when split one level finer; if re-decomposing the transmission term reveals a big within-level selection component, the headline attribution was an aggregation artifact, and the right stopping level is set by where further splitting stops changing the story or stops being measurable.
T5 — Decomposition versus Causal Mechanism (Scopal). The cut is an exact accounting attribution, not a causal explanation — it says how much of the mean change is reweighting versus within-unit change, not why either occurred. The failure mode is over-reading the identity as causal: concluding that "selection caused the gain" and that pushing the selection lever will reproduce it, when the covariance of weight with trait may itself be driven by a confounder the decomposition cannot see. Diagnostic: ask whether the selection term reflects a manipulable mechanism or a passive correlation; the decomposition cleanly attributes the arithmetic of the change, but treating its terms as causal levers without an independent causal story risks intervening on a correlation that the accounting identity was never designed to certify.
T6 — Mean-Change Focus versus Distributional Change (Scopal). The identity decomposes the change in the weighted mean, and is silent about variance, skew, or tail behavior — a population can hold its mean while its dispersion explodes, or shift its mean while a vulnerable subgroup moves the opposite way. The failure mode is reading a decomposed mean change as a complete account of what happened to the population, missing distributional movements the mean averages away. Diagnostic: ask whether the question is really about the mean or about the spread/tails; if a subgroup's adverse movement is hidden inside a favorable aggregate, the selection/transmission cut on the mean will not surface it, and a distributional analysis is needed alongside the mean decomposition, which answers only "what moved the average."
Structural–Framed Character¶
Selection Vs Transmission Decomposition sits at the structural end of the structural–framed spectrum, consistent with its frontmatter label and an aggregate of 0.0: it is a mathematical identity — a weighted-mean change splitting exactly into a covariance-of-weight-with-trait term and a share-weighted within-unit term — whose vocabulary is fully relational and travels unmodified.
Every diagnostic reads structural, and the prime is among the catalog's purest cases. The vocabulary travels with no residue: selection term, transmission term, reallocation effect, within-unit effect, covariance with weight, and weighted within-unit average describe the Price equation in evolutionary biology, the Foster-Haltiwanger-Krizan decomposition in firm productivity, and Brinson attribution in finance as one identity — FHK is the Price equation for firms and Brinson is the Price equation for portfolios, identical algebra across substrate. The prime carries no evaluative weight: the two terms are an accounting attribution, neither good nor bad, agnostic about whether reallocation or within-unit change is desirable. Its origin is a formal identity requiring no functional-form assumption and no residual, not any human institution; the exact-sum and recursion invariants are properties of the arithmetic. It applies wherever partitioned units, weights, and trait values exist — phenotype under natural selection as readily as portfolio returns or mortality decompositions — so it is not human-practice-bound. And invoking it merely recognizes the two-mechanism split already latent in any weighted-mean change rather than importing a frame. On every axis the prime reads structural, exactly as the 0.0 aggregate records.
Substrate Independence¶
Selection vs Transmission Decomposition is a maximally substrate-independent prime — composite 5 / 5 on the substrate-independence scale. Its core is an exact algebraic identity — the change in a weighted population mean split into a covariance-of-trait-with-weight (selection) term plus a within-unit-change-weighted-by-share (transmission) term — and that identity holds whenever a population is partitioned into measurable units with measurable weights and trait values, making the structural abstraction maximal. The domain breadth is wide and the structural force identical: evolutionary biology (the Price equation), firm productivity dynamics (the Foster-Haltiwanger-Krizan decomposition), portfolio attribution (Brinson allocation-versus-selection effects), organizational learning (hiring/promotion selection plus within-person learning), regional and demographic dynamics (life-expectancy and mortality decompositions), educational testing (cohort composition plus within-student learning), cultural and memetic dynamics, and machine-learning fleet effects. The transfer evidence is exceptionally strong because what recurs is literally the same formal identity under different names — the Price equation, the FHK decomposition, and Brinson attribution are the identical cut applied to genes, firms, and securities — so the decomposition is recognized rather than translated wherever an aggregate change in a heterogeneous population must be separated into reweighting versus within-unit change.
- Composite substrate independence — 5 / 5
- Domain breadth — 5 / 5
- Structural abstraction — 5 / 5
- Transfer evidence — 5 / 5
Relationships to Other Primes¶
Parents (1) — more general patterns this builds on
-
Selection Vs Transmission Decomposition is a kind of Decomposition
An EXACT, residual-free specialization of decomposition: a weighted-mean change split identically into covariance-of-weight-with-trait (selection) + share-weighted within-unit change (transmission). The Price equation is its generator; FHK and Brinson are the same identity for firms/portfolios. Dossier-confirmed specialization edge.
Path to root: Selection Vs Transmission Decomposition → Decomposition
Neighborhood in Abstraction Space¶
Selection Vs Transmission Decomposition sits in a moderately populated region (47th percentile for distinctiveness): it has near-neighbors but no dense thicket of synonyms.
Family — Aggregation & Scale Artifacts (16 primes)
Nearest neighbors
- Variance Bounds Selection Response — 0.75
- Modifiable Areal Unit Problem — 0.73
- Selection Bias — 0.71
- Diversification — 0.70
- Natural Selection — 0.70
Computed from structural-signature embeddings · 2026-06-14
Not to Be Confused With¶
The most important confusion to dispel — and the one the embedding flags at an unusually high similarity of 0.967 — is with selection_bias, which shares the word "selection" and the concern with how a population's composition shapes an observed quantity, but means something structurally opposite. selection_bias is a distortion of inference: a sample is drawn non-representatively (survivorship, self-selection, attrition), so a statistic computed on it misleads about the underlying population, and the corrective is to detect and remove the bias. The selection term in this prime is not a distortion at all — it is an exact, intended, real contribution to the change in the population mean, the covariance of weight with trait, which faithfully captures the genuine effect of differential weighting (reallocation, composition shift) on the aggregate. One is an error to be corrected; the other is a mechanism to be measured. The confusion is consequential in both directions. Reading the prime's selection term as a "bias" tempts an analyst to correct it away — to treat reallocation toward better units as a nuisance to be purged from the productivity number, when reallocation is a real and often desirable driver of aggregate change. Conversely, importing the prime's clean exactness into a genuine selection-bias situation tempts the analyst to trust a sample-distorted estimate because the arithmetic "decomposes cleanly," when the inputs were drawn non-representatively. The discriminating question is whether the weighting reflects a real change in the population's composition (the prime's selection, to be measured) or a non-representative sampling of a fixed population (selection bias, to be corrected). The prime's T2 tension — that the identity faithfully partitions even dirty inputs — is precisely where a hidden selection bias in measurement would be laundered by the clean decomposition.
The prime is also confused with decomposition in the generic sense, since it is a decomposition and is routinely described as one. The distinction is that this prime is a specific, canonical, exact decomposition, not the general method of breaking a whole into parts. decomposition as a general prime covers any principled partition of a system or quantity into components by some chosen scheme — functional, temporal, hierarchical, modal — and most such partitions are approximate, scheme-dependent, or leave a residual. This prime is the particular split of a weighted-mean change into exactly two terms (covariance-of-weight-with-trait and share-weighted-within-unit-change) that sum to the observed change with no residual and no functional-form assumption, because it is an algebraic identity rather than a modeling choice. That exactness and that specific two-way cut are the content; a practitioner who treats it as "just a decomposition" loses what makes it powerful — the guarantee that the two terms are exhaustive (selection and transmission together account for the whole change) and assumption-free. Conversely, treating every useful decomposition as this prime over-claims exactness for partitions (a variance decomposition with interaction terms, a causal decomposition with assumptions) that do not share the Price-equation identity structure. The mark of this prime specifically is the residual-free covariance-plus-within-unit form, recursively re-applicable by sub-grouping.
A subtler confusion is with competition, because the selection term so often arises from competition among units, and the two get fused in causal storytelling. competition is a substantive process: units contend for a limited resource (market share, fitness, capital), and the contest's dynamics determine who grows and who shrinks. The prime's selection term is an accounting summary of the net reweighting that results — it measures how much the change in the mean is attributable to differential weighting, without modeling why the weights shifted or what process produced them. Competition is one process that generates a non-zero selection term, but so are non-competitive reallocations (a planner shifting capital, a demographic shift in regional populations, a portfolio manager rebalancing) that involve no contention at all. The prime is deliberately agnostic about mechanism: it tells you reweighting occurred and how much it mattered, not that competition caused it. This is exactly the prime's T5 tension — the decomposition is an accounting attribution, not a causal explanation — and confusing the selection term with competition leads to the error of assuming that a large selection term implies a competitive process and that intensifying competition will reproduce the gain, when the reweighting may have come from an entirely non-competitive source the accounting cannot see.
These distinctions matter because each protects a different aspect of the prime's correct use. Holding it apart from selection_bias prevents an analyst from "correcting away" a real mechanism or trusting a sample-distorted estimate; holding it apart from generic decomposition preserves the exactness and exhaustiveness that are its whole value; holding it apart from competition blocks the slide from accounting attribution to unwarranted causal claim. The prime's content is narrow and precise — an exact, residual-free, recursively-applicable split of a weighted-mean change into differential-weighting and within-unit terms — and its power comes entirely from respecting that narrowness: it tells you which margin moved the average, and therefore which lever to work, without pretending to tell you why the margin moved or what happened to the rest of the distribution.
Solution Archetypes¶
No catalogued solution archetypes reference this prime yet.