Extrapolation Beyond Sampled Regime¶
Core Idea¶
A calibrated apparatus — model, expert, doctrine, formula, policy — is deployed on inputs outside the regime its calibration was established in, while still reporting the same confidence it would inside. The failure is self-blind: the confidence machinery is a function of the sampled regime, so it cannot register that the question is outside its competence, and the wrongness arrives wearing the apparatus's in-regime face.
How would you explain it like I'm…
The Overconfident Thermometer
Sure But Out Of Range
Confidently Off The Map
Broad Use¶
- Machine learning: a classifier emits high softmax confidence on out-of-distribution inputs; OOD detection exists because the base model is self-blind.
- Statistical regression: prediction outside the data's convex hull carries smooth confidence intervals that do not widen as the input moves out.
- Pharmacological dosing: adult-trial cohorts extrapolated to paediatric, geriatric, or pregnant patients, with efficacy reported at the same precision.
- Military doctrine and policy transfer: tactics or interventions calibrated in one theatre or jurisdiction applied confidently elsewhere.
- Expert intuition: deep experts show high confidence on out-of-base problems while accuracy drops sharply.
- Parametric insurance and forecasting: catastrophe triggers and prediction intervals carry in-sample machinery that cannot encode that the regime has moved.
Clarity¶
Forces the recognition that output and confidence are both functions of the calibration regime, and neither encodes whether the deployment regime matches it — converting "the model might not generalise" into a precise claim about where the missing capability lives.
Manages Complexity¶
Makes the calibration regime a first-class artefact and supplies a closed, shared intervention catalogue: characterise the regime, detect deployment-time exit on the input space, and refuse or hedge in the gap.
Abstract Reasoning¶
Licenses confidence-apparatus self-blindness (improve the indicator and it still cannot diagnose exit), the detection-not-prediction architecture, and refusal as a first-class output — coverage by design versus coverage by hope.
Knowledge Transfer¶
- ML → expert decision support: building OOD detectors ports as requiring the expert, by checklist, to confirm the case is within their experiential base.
- Clinical trials → deployed AI: writing down inclusion criteria and refusing to extrapolate ports as model cards plus deployment-time input checks.
- Engineering → climate adaptation: qualification envelopes port as explicit re-qualification when conditions exit the regime infrastructure was built for.
Example¶
A neural classifier shown pure noise confidently asserts "panda, 99.7%," because the softmax tracks the training regime, not the input's distance from it — so no amount of recalibration helps, and the fix is an out-of-band detector operating on the input space plus a refusal option the always-emits-a-label model must be retrofitted with.
Not to Be Confused With¶
- Extrapolation Beyond Sampled Regime is not Validation because validation establishes in-regime reliability once, whereas this prime is the deployment-time failure of carrying the validated apparatus outside its regime with unchanged confidence.
- Extrapolation Beyond Sampled Regime is not Calibration because calibration makes confidence match reliability within the regime, and perfect in-regime calibration remains self-blind to regime exit.
- Extrapolation Beyond Sampled Regime is not Overfitting because overfitting is fitting noise within the regime, whereas this prime is correct in-regime behaviour failing outside it; an un-overfit model is still self-blind.