Underspecification¶
Core Idea¶
A criterion is treated as picking out a single answer when in fact an equivalence class of answers satisfies it equally well; a hidden closure (a seed, an optimisation trajectory, a default, a prior) silently picks one representative, so the load-bearing behavior may be governed entirely by the closure rather than the constraint.
How would you explain it like I'm…
Many Answers, One Clue
The Clue That Doesn't Decide
Constraint Versus Hidden Closure
Broad Use¶
- Machine learning: many models with identical validation accuracy encode different decision surfaces that diverge sharply on a stress test.
- Inverse problems and physics: recorded data is consistent with infinitely many internal states (Hadamard ill-posedness), and a regulariser picks one.
- Causal inference: multiple causal graphs imply the same conditional independences, and the data alone cannot adjudicate.
- Compiler and language specs: a standard leaves a behavior unspecified, and two conforming implementations produce different programs.
- Legal interpretation: statutory text is consistent with several readings, and precedent or canon picks one.
- Intelligence analysis: the same indicators are consistent with several adversary-intent hypotheses, and the chosen one is the analyst's default.
Clarity¶
Forces a question success metrics never ask — "under my criterion, what other answers are also acceptable, and how do they differ?" — and exposes the closure as a load-bearing input that ordinarily hides.
Manages Complexity¶
Consolidates model brittleness, compiler-dependent bugs, irreproducible conclusions, and doctrine drift into one move: probe the equivalence class, not just the chosen representative.
Abstract Reasoning¶
Licenses a structural prediction available before any field test: properties controlled by the constraint are robust, those controlled only by the closure are free coordinates liable to flip — identifiable by inspecting where admissible builds disagree.
Knowledge Transfer¶
- ML → physics → law: generating the admissible set is one move — training many networks from different seeds, deriving the family of regularised inversions, or laying out admissible statutory readings.
- Across domains: disclosing the closure (naming the tiebreaker) is identical reasoning everywhere.
Example¶
A team trains a classifier selected on validation accuracy; many models differing only in seed achieve identical accuracy yet diverge under a subgroup stress test the criterion never constrained — so the shipped behavior was a free coordinate, not a determined one.
Relationships to Other Primes¶
Parents (1) — more general patterns this builds on
- Underspecification is a kind of, typical Inductive Reasoning — Underspecification is the specific case where an inductive inference is UNDERDETERMINED — many generalizations fit the particulars equally and a hidden closure picks one. The file frames it as generalizing theory-underdetermined-by-data into selection pipelines. is-a inductive_reasoning specialized to a criterion that admits an equivalence class.
Path to root: Underspecification → Inductive Reasoning
Not to Be Confused With¶
- Underspecification is not Overfitting because overfitting is a fit too tight to noise with one determined solution, whereas underspecification is a fit too loose to signal with many equally-good solutions and a hidden tiebreaker — the remedies are opposite.
- Underspecification is not Confirmation Bias because confirmation bias is a cognitive preference for belief-supporting evidence, whereas underspecification is a structural property of the criterion itself, true regardless of anyone's preferences.
- Underspecification is not Selection Bias because selection bias is a defect in how the sample was drawn, whereas underspecification can occur on a flawless sample whose criterion simply fails to pin a unique solution.