Simpson's Paradox¶
Core Idea¶
A relationship runs one direction inside every subgroup yet the opposite direction in the aggregate, because the subgroups differ along a confounder that is silently mixed away on pooling. The aggregate is correct about its own counts but causally misleading — no subpopulation shows the aggregate direction — and the fix is not more data but the right partition.
How would you explain it like I'm…
The Backwards Total
When the Aggregate Lies
Broad Use¶
- Medicine: a treatment beats control among mild and among severe patients yet loses overall, having been given preferentially to severe cases.
- Public policy: a programme raises scores in every subgroup while aggregate scores fall, because it attracts harder-to-serve participants.
- Sports analytics: a player out-hits a rival every season but trails on career average, their seasons clustering in low-scoring eras.
- Pay-equity analysis: men out-earn women in every department yet the reverse holds in the pool, because departments differ in scale and composition.
- ML fairness audits: a classifier is calibrated within every demographic subgroup but miscalibrated in aggregate, or the reverse.
- Business KPIs: a per-customer conversion rate rises every quarter while the annual aggregate falls due to mix shift.
Clarity¶
It makes visible the distinction between marginal and conditional associations, and between pooling and stratifying — establishing that no level of aggregation is privileged, so the choice between them is causal, not statistical, and the problem is wrong question-to-number matching, not wrong numbers.
Manages Complexity¶
A family of cross-substrate surprises and failure modes collapses to one diagnostic — is there a third variable along which the groups differ, and is the comparison made across it rather than within it? — converting an open-ended worry into a bounded stratify-and-compare procedure.
Abstract Reasoning¶
It licenses decomposing the aggregate into within-group effects plus a between-group composition term: when composition dominates, the aggregate flips — a story about who is in which group, not what happens within them — and warns of the symmetric over-conditioning mistake (collider bias).
Knowledge Transfer¶
- Statistics to program evaluation: the stratify-before-pooling habit is Simpson's-paradox prophylaxis ("what does the within-group story look like?").
- To ML fairness: group-conditional versus marginal calibration is structurally stratified versus pooled associations.
- To business: reporting "same-store sales" alongside totals is a deliberate defence against composition-shift, structurally identical to reporting stratified alongside pooled effects.
Example¶
The Berkeley graduate-admissions case: pooled, men were admitted at a higher rate than women, suggesting bias; stratified by department, women were admitted at equal-or-higher rates — because women applied disproportionately to competitive low-admit departments, so the between-department composition term dominated.
Relationships to Other Primes¶
Parents (3) — more general patterns this builds on
- Simpson's Paradox is a kind of Confounding — Simpson's paradox is the most dramatic SYMPTOM of confounding — the case severe enough to flip the SIGN between aggregate and every subgroup. Every Simpson reversal is a confounding case; most confounding is not a Simpson reversal. A specialization/extreme-case of confounding.
- Simpson's Paradox is a kind of, typical Modifiable Areal Unit Problem — The file: Simpson's paradox is the SIGN-REVERSAL special case of MAUP's broader partition-dependence (the extreme corner where the partition shift crosses zero); MAUP generates quantitative drift even without reversal. Tentative reparent — MAUP as the broader parent. simpsons_paradox is a candidate (R2-016-07).
- Simpson's Paradox presupposes, typical Aggregation — It is the confounded FAILURE MODE of the aggregation operation — pooling across a confounder is a modelling choice that can flip a direction; presupposes aggregation as the collapsing step.
Path to root: Simpson's Paradox → Confounding → Bias
Not to Be Confused With¶
- Simpson's Paradox is not Confounding in general because confounding spans every severity whereas the paradox is the most dramatic symptom — a full sign reversal between aggregate and every subgroup.
- Simpson's Paradox is not Selection Bias because selection bias arises from how units enter the sample whereas the paradox concerns how subgroups are pooled among fully-observed units.
- Simpson's Paradox is not Aggregation as such because aggregation is the neutral combining operation whereas the paradox is its confounded failure mode — the warning that pooling across a confounder carries causal commitments.