Goodhart's Law¶
Core Idea¶
Binding optimization pressure on a proxy degrades the proxy's correlation with the construct it was meant to indicate. Once consequences attach to a measure, agents take the cheapest path to move the proxy, which almost never coincides with the path that would move the underlying thing — and the proxy comes to index optimization effort rather than the construct.
How would you explain it like I'm…
Chasing the Sticker
When the Stand-In Breaks
The Proxy Trap
Broad Use¶
- Education: test scores drive funding and pay, so the easiest path becomes test-prep and narrowed curriculum while scores climb and learning does not.
- Healthcare: four-hour-wait and door-to-balloon targets distort triage and coding.
- Science: citation counts and p-values, tied to careers, breed citation cartels, p-hacking, and a replication crisis.
- AI alignment: trained policies maximize a reward signal in unintended ways — reward hacking is the canonical failure mode.
- Finance and monetary policy: targeting a monetary aggregate collapses the very regularity that motivated targeting it (the original case).
- Management: sales quotas and first-call-resolution metrics invite channel-stuffing and ticket-gaming.
Clarity¶
Distinguishes measurement (passive, low-stakes diagnosis) from incentivization (high-stakes, weighted into consequence): a metric can be excellent for one and catastrophic for the other.
Manages Complexity¶
Compresses Campbell's law, the McNamara fallacy, surrogation, reward hacking, and p-hacking into one mechanism, and bundles a portfolio of remedies — scorecards, held-out evaluation, rotation, decoupling — rather than a single domain fix.
Abstract Reasoning¶
Predicts which proxies collapse first (those with a wide, easily exploited wedge) and licenses robustness via multi-objective design — make the target a joint motion of several loosely correlated proxies so the wedge becomes a multidimensional intersection.
Knowledge Transfer¶
- Public administration: audit-and-investigate and multiple-measures accountability.
- AI: held-out reward evaluation and reward ensembling read as a re-derivation of those same remedies.
- Management accounting: the surrogation effect is the same proxy-collapse under a different name.
Example¶
An emergency department under a four-hour-wait target rushes disposition near the boundary, defers registration to delay the clock, and codes for compliance — the headline metric climbs while twelve-hour trolley waits and delayed-sepsis mortality fail to improve, exposing the proxy's collapse.
Relationships to Other Primes¶
Parents (1) — more general patterns this builds on
- Goodhart's Law is a kind of Proxy-Target Divergence — proxy_target_divergence's file states it directly: "Not goodharts_law. Goodhart is ONE decoupling mechanism — the agent-driven, strategic- adaptation one. This prime is the umbrella ... of which Goodhart is one child." goodharts_law's own file agrees it is the optimization-pressure-on- a-proxy mechanism. So goodharts_law is unambiguously a CHILD of proxy_target_divergence (valid candidate CAND-R25-006-06). High conviction; both files independently license it. Phase-C kept it OFF regulatory_capture (0.824 nearest, distinct mechanism), agency_problem, and moral_hazard — correctly. campbells_law is its high-stakes twin (both children of the same umbrella). NOTE: if the family is consolidated under candidate proxy_target_fidelity (the genus, see proxy_target_divergence EMERGENT), goodharts_law re-parents there; until then the umbrella is the built/ candidate target.
Path to root: Goodhart's Law → Proxy-Target Divergence → Proxy–Target Fidelity
Not to Be Confused With¶
- Goodhart's Law is not Regulatory Capture because Goodhart is a measurement-target failure needing no capturing party, whereas capture is a regulator co-opted by the industry it oversees; the fixes diverge (control-loop redesign versus structural independence).
- Goodhart's Law is not the Agency Problem because Goodhart bites even when objectives are perfectly aligned — the honest agent optimizing the proxy still degrades the construct — whereas the agency problem rests on a divergence of goals.
- Goodhart's Law is not Moral Hazard because Goodhart concerns a proxy's informativeness collapsing under optimization, independent of who bears risk, whereas moral hazard is changed risk-taking under insulation from consequences.