Instrument Interpretive Drift¶
Core Idea¶
A measurement instrument's interpretive practice — how its output is produced from its input — silently shifts over time while its stated specification stays fixed. Longitudinal data then mingles real-world change with artefactual instrument change, and the drift is invisible to cross-sectional quality control because the whole cohort of measurers drifts together.
How would you explain it like I'm…
Same Test, Stricter Grading
The Ruler That Quietly Changed
Spec Frozen, Practice Drifting
Broad Use¶
- Manufacturing metrology: a gauge needs periodic recalibration because its transduction drifts even while its stamped specification is unchanged.
- Machine-learning annotation: human labellers applying a constant guide produce label distributions that drift as the cohort turns over.
- Medical coding: ICD conventions for "sepsis" or "heart failure" tighten or loosen over years without the diagnostic criteria on paper changing.
- Judicial sentencing: the same statutory range, but practice on what counts as a "serious" instance drifts across decades.
- Performance reviews: the rubric document unchanged while the practice of "exceeds expectations" drifts under grade-inflation feedback.
- Academic peer review and test scoring: acceptance standards and applied rubrics drift across editor or scorer cohorts even at constant stated specification.
Clarity¶
It separates three failure modes that conflate under "the data is drifting": the world drifting, the specification changing, and the practice silently moving under a fixed spec — each with a different detection method and fix.
Manages Complexity¶
It decomposes a longitudinal trend into three separately auditable components — real-world change, specification change, and practice drift — with frozen-reference re-measurement isolating the third.
Abstract Reasoning¶
It reframes detection: high inter-rater agreement is consistent with severe drift, because a cohort that drifts together agrees with itself at every instant — so the standard quality-control instrument is blind to the failure that matters.
Knowledge Transfer¶
- Metrology → many fields: gauge-R&R discipline (bank a master, re-measure on a schedule, subtract the drift) transfers to ML pipelines, coding panels, and peer-review meta-analyses.
- Shared toolkit: frozen references, blind re-measurement, drift subtraction, and cohort recalibration apply to master gauges, anchor papers, gold-standard annotation sets, and reference stars alike.
- Versioning move: requiring any practice change to be logged as a version bump, even when the rubric text is unchanged, ports across every interpretive substrate.
Example¶
An economics department changes nothing in its grading rubric from 2010 to 2025, yet the proportion of As doubles; banking 2010 anchor papers and having current graders blindly re-grade them reveals the drift on papers that cannot have changed — which can then be subtracted from the historical comparison.
Relationships to Other Primes¶
Parents (1) — more general patterns this builds on
- Instrument Interpretive Drift is a kind of, typical Temporal Decay and Degradation — A time-dependent, correlated (non-zero-mean) drift of a measurement instrument's interpretive practice while its stated spec stays fixed — a specialization of temporal degradation applied to interpretive calibration (vs a static bias removable by one calibration).
Path to root: Instrument Interpretive Drift → Temporal Decay and Degradation → Entropy (Thermodynamic Sense)
Not to Be Confused With¶
- Instrument Interpretive Drift is not Measurement Uncertainty because noise is zero-mean and averages out whereas this drift is correlated across time and trends in one direction, so more measurements do not cancel it and high agreement is no defense.
- Instrument Interpretive Drift is not a static Bias because a bias is a fixed offset removable by one calibration whereas this is a moving offset that defeats a single recalibration and needs repeated frozen-reference subtraction.
- Instrument Interpretive Drift is not Goodhart's Law because metric-targeting is only one drift mechanism whereas this prime also covers cohort turnover, precedent accumulation, and context shift — drift with no one optimising the measure.