Garbage In, Garbage Out¶

Prime #: 879
Origin domain: Information Theory
Subdomain: data quality and pipeline integrity → Information Theory
Aliases: Gigo

Core Idea¶

Garbage in, garbage out (GIGO) is the structural observation that the quality of a transformation's output is bounded above by the quality of its inputs: no amount of internal sophistication can repair defects, errors, biases, or distortions already present in the input. The output is, at best, input-quality-conserving; in practice it is often input-quality-degrading, since the transformation may amplify input noise, propagate input errors through correlated downstream variables, or add its own processing artefacts on top. It is not input-quality-improving in any structural sense that survives adversarial inputs. A system that appears to clean up bad inputs is either using additional trusted information not contained in the bad input, or producing apparently-clean outputs that are not actually faithful to ground truth.

The pattern asserts a quality floor set by inputs. Investing arbitrary effort in downstream sophistication — better algorithms, more compute, fancier models, more careful processing rules — yields diminishing or zero returns once the input-quality floor is binding, and it can manufacture false confidence: polished-looking results whose underlying input defects are no longer visible to the consumer. The structural fix for an output-quality problem caused by GIGO is therefore always an input intervention — better collection, better sensors, better source vetting, source-quality measurement — never a downstream-sophistication intervention.

The load-bearing claim is not the trivial "input quality matters" but the non-substitutability of downstream sophistication for input quality. That non-substitutability is what generates the recurring, expensive failure mode the principle warns against: organizations that invest heavily in analytics on top of unreliable data, ML teams that scale models on biased corpora, governments that build sophisticated decision systems on corrupt records, scientific reviews that meta-analyze unreliable primary studies. The formal backbone is the data-processing inequality — for any chain X → Y → Z, no processing of Y can increase its mutual information with X — and GIGO is the practitioner-level claim that holds even in substrates where the formal information measure is not cleanly defined.

How would you explain it like I'm…

Rotten Eggs, Bad Cake

If you bake a cake with rotten eggs, no matter how fancy your oven is, the cake will still taste bad. A great oven can't fix bad ingredients. So if you want a good cake, you have to start with good eggs, not just a fancier oven.

Bad In, Bad Out

Garbage in, garbage out means that if you put bad information into something, you'll get bad results out, no matter how clever the machine in the middle is. If your starting data is full of mistakes or is biased, then better computers and fancier programs can't truly fix it. In fact, they sometimes make it worse by spreading the errors around or hiding them under polished-looking results. So the real way to fix a bad-output problem is to fix the inputs: collect better data, use better sensors, check your sources. Spending more effort on the fancy processing part won't help once the inputs are the thing holding you back.

Inputs Set the Ceiling

Garbage in, garbage out (GIGO) is the observation that the quality of a transformation's output is capped by the quality of its inputs: no amount of internal sophistication can repair defects, errors, or biases already in the input. At best the output is input-quality-conserving; in practice it is often degrading, because the process can amplify input noise, propagate errors through correlated variables, or add its own artefacts. It is never input-quality-improving in a way that survives adversarial inputs. A system that looks like it cleans up bad inputs is either using extra trusted information not in the bad input, or producing polished outputs that aren't actually faithful to the truth. The load-bearing point isn't the trivial 'input quality matters,' but the non-substitutability of downstream sophistication for input quality. That is why the structural fix for a GIGO-caused output problem is always an input intervention (better collection, sensors, source vetting), never more downstream cleverness, and why piling sophistication on bad data can manufacture false confidence.

Garbage in, garbage out (GIGO) is the structural observation that the quality of a transformation's output is bounded above by the quality of its inputs: no amount of internal sophistication can repair defects, errors, biases, or distortions already present in the input. The output is, at best, input-quality-conserving; in practice it is often input-quality-degrading, since the transformation may amplify input noise, propagate input errors through correlated downstream variables, or add its own processing artefacts on top. It is not input-quality-improving in any structural sense that survives adversarial inputs. A system that appears to clean up bad inputs is either using additional trusted information not contained in the bad input, or producing apparently-clean outputs that are not actually faithful to ground truth. The pattern asserts a quality floor set by inputs: investing arbitrary effort in downstream sophistication yields diminishing or zero returns once that floor is binding, and it can manufacture false confidence, polished-looking results whose underlying input defects are no longer visible to the consumer. The structural fix for a GIGO output problem is therefore always an input intervention (better collection, better sensors, better source vetting, source-quality measurement), never a downstream-sophistication intervention. The load-bearing claim is not the trivial 'input quality matters' but the non-substitutability of downstream sophistication for input quality, which is what generates the recurring, expensive failure mode the principle warns against. Its formal backbone is the data-processing inequality: for any chain X to Y to Z, no processing of Y can increase its mutual information with X.

Structural Signature¶

the quality-bearing input — the transformation that processes it — the output and its quality — the quality-monotonicity relation (no processing raises fidelity to ground truth) — the input-set quality floor on the output — the non-substitutability invariant (downstream sophistication cannot replace input quality)

The pattern is present when the following components co-occur:

The quality-bearing input. Some source material enters the system carrying a definite, measurable fidelity to ground truth — and possibly defects, noise, or bias the system does not introduce itself.
The transformation. A processing stage maps inputs to outputs. Its internal sophistication — algorithm, model, compute, procedure — is the lever that practitioners are tempted to pull when output quality disappoints.
The output and its quality. A result emerges whose fidelity to ground truth is the quantity actually cared about, and which the consumer judges, often by surface polish rather than by traced fidelity.
The monotonicity relation. Across the chain input → transform → output, processing cannot increase the output's information about ground truth beyond the input's — the data-processing inequality in its qualitative form. At best quality is conserved; typically it is degraded.
The input-set floor. The output's quality is bounded above by the input's; investing in the transformation yields diminishing returns once that floor is binding, and may manufacture false confidence by hiding the floor behind polish.
The non-substitutability invariant. Downstream sophistication does not substitute for input quality — the load-bearing claim. Any apparent cleanup is either drawing on additional trusted side information (which is then the real input to audit) or fabricating fidelity (making the output worse, not better).

The components compose into one diagnosis: because a transformation cannot raise fidelity above its inputs', an output-quality problem is structurally an input problem, and the only effective intervention is upstream — better collection, vetting, and provenance — never more downstream processing.

What It Is Not¶

Not transformation in general. See transformation: a transformation maps inputs to outputs along any dimension. GIGO is the specific quality-monotonicity constraint on that map — fidelity cannot rise across it — not the mapping itself.
Not refinement. See refinement: refinement improves an artifact toward a goal. GIGO denies that downstream processing can improve fidelity to ground truth beyond the input's; any apparent improvement is borrowed from a trusted side channel.
Not purity/pollution as a normative category. See purity_and_pollution: that carries symbolic and evaluative weight about contamination. GIGO is an information-theoretic ceiling — the data-processing inequality — with only incidental evaluative flavor.
Not robustness. See robustness: a robust system tolerates input perturbation and still functions. GIGO is the orthogonal claim that no processing recovers fidelity the input never carried; robustness is about graceful degradation, not fidelity ceilings.
Not diminishing returns. See diminishing_incremental_gains: that describes a smooth falloff in marginal benefit. GIGO describes a binding floor set by inputs — downstream returns drop to zero once the floor binds, regardless of effort.
Common misclassification. Reading any "bad result" as GIGO. The signature is non-substitutability: the output problem must be traceable to an input-fidelity ceiling that no processing can breach. A failure caused by a buggy transformation on good inputs is a processing defect, not GIGO.

Broad Use¶

The principle's origin is computing and data engineering, where GIGO labels any pipeline in which bad inputs produce bad outputs regardless of program correctness. In machine learning and AI, training-data quality, label noise, and biased corpora set the ceiling on accuracy and fairness — "you can't model your way out of bad data," the spirit of the data-centric-AI movement. In statistics and meta-analysis, a synthesis inherits the bias of its primary studies, which is why evidence-rating frameworks grade the underlying trials. In accounting and audit, financial reports are bounded by the integrity of underlying transaction records, and major audit failures are often GIGO at the data-source layer. In intelligence analysis, assessment quality is bounded by source quality, with notorious failures traceable to bad source intelligence that no analytic tradecraft could repair. In scientific research, derived results inherit primary-data quality, and the replication crisis is in part a GIGO story. In decision-making and policy, cost-benefit analyses, climate models, and epidemiological forecasts are bounded by input-parameter quality, so sophisticated models on bad inputs yield high-confidence wrong answers. The same floor governs education (learning bounded by the quality of what is taught), manufacturing (product quality bounded by component quality, hence the supplier-quality function), journalism (reporting bounded by source reliability), and legal adjudication (verdicts bounded by evidence quality, hence chain-of-custody). The principle surfaces under many names — data-centric AI, the evidence pyramid, supplier quality, single source of truth, chain of custody — that all encode the same non-substitutability.

Clarity¶

GIGO clarifies a specific non-substitutability: downstream sophistication does not substitute for input quality. This makes visible an investment-allocation question that organizations chronically get wrong — they invest in better analytics, models, and reporting instead of better data collection, because the downstream investments are more legible to stakeholders. Naming the principle re-orders the diagnosis: when output quality is unsatisfactory, the first question becomes "what is the quality of the inputs?" rather than "what is wrong with the processing?"

The principle also names a related danger — false confidence. A sophisticated pipeline can launder bad inputs into outputs that look polished and credible, defeating the natural quality cue (rough output prompts a check of inputs) that cruder systems provide. The more sophisticated the downstream, the more important explicit input-quality measurement becomes, because the output no longer betrays the input defects. The clarifying force is to insist that input quality be made observable to downstream consumers — through provenance, lineage, and confidence bounds on inputs — rather than inferred from the surface polish of the result.

Manages Complexity¶

GIGO compresses a wide family of "we invested in the wrong place" failures — across data engineering, ML, statistics, accounting, intelligence, science, policy, education, manufacturing, journalism, and law — into one diagnostic family with a uniform intervention vocabulary: measure input quality explicitly rather than assuming it, vet sources before processing, audit the input pipeline when output quality disappoints, resist adding downstream sophistication on top of unreliable inputs, and make input quality observable to consumers through provenance and lineage.

The principle also organizes a recurring pattern in post-mortems: most "model failure," "report failure," and "assessment failure" stories, on inspection, are data failure stories with the processing layer taking the blame. By relocating the diagnosis to the input layer, GIGO turns a confusing class of expensive surprises into a single, checkable question about where the quality floor sits and what the marginal return to further processing is given that floor — a large reduction in the search space a diagnostician must traverse.

Abstract Reasoning¶

Reasoning about GIGO involves several structural moves. Quality-floor identification characterizes the upper bound on output quality set by inputs, often through an information-theoretic argument: the output's mutual information with ground truth cannot exceed the input's, by the data-processing inequality. Investment marginal-return analysis notes that once the input-quality floor is binding, marginal returns to downstream sophistication approach zero, so investment should shift to input quality. Provenance and lineage tracking ensures input-quality metadata travels with the data so downstream consumers can discount appropriately. Adversarial-input analysis asks what happens if inputs are deliberately corrupted; many systems pass nominal-input tests but catastrophically amplify adversarial inputs. The data-processing inequality itself is the formal backbone, with GIGO as its qualitative restatement when "quality" stands in for mutual information. And the brittle-cleanup heuristic asks, whenever a system appears to clean up bad inputs, whether it is drawing on trusted side information — in which case the side information is the real input and should be audited — or fabricating apparent quality, in which case the output is worse than the raw input, not better.

Knowledge Transfer¶

The principle transferred from early computing folklore into statistics, where it became a recurring methodological refrain; into AI and ML, where data-centric AI is its programmatic formulation; and into business intelligence, audit, intelligence tradecraft, and increasingly public-policy debate about algorithmic decision-making, where biased training data producing biased decisions is the fairness community's GIGO story. The information-theoretic formalization — the data-processing inequality — predates the practitioner label, and the label is its informal restatement with cross-substrate currency wherever a pipeline exists. The transfer between substrates is substantive: auditors cite ML failures, ML researchers cite intelligence failures, and intelligence analysts cite scientific replication failures, all sharing the same diagnosis.

Consider a clinical-decision-support tool, sophisticated and validated on held-out data, that systematically under-predicts care needs for one patient population. Investigation traces the failure not to the model but to the training target: the proxy for "future medical need" was "future medical-care spending," which diverged from need along access lines, so the input's validity floor-bound the output's fairness, and no model sophistication could fix it; the fix was to redefine the target — an input intervention. The same structural moves appear in an intelligence assessment resting on a single unreliable source, a meta-analysis floor-bound by publication bias in its primary literature, an audit floor-bound by corrupt transaction records, and a manufacturing-yield problem traced to component-supplier defects rather than assembly-line process improvements. In every case the transferable diagnosis is identical: locate the input-quality floor, intervene at the input, and recognize that downstream sophistication is not a substitute. Because the diagnosis is structural, a practitioner who has internalized it in one field applies it intact in any other where a transformation maps quality-bearing inputs to quality-bounded outputs.

Examples¶

Formal/abstract¶

The data-processing inequality is GIGO's exact formal backbone. For any Markov chain \(X \to Y \to Z\) — where \(X\) is the ground-truth quantity, \(Y\) the observed input, and \(Z\) any deterministic or stochastic processing of \(Y\) — the mutual information obeys \(I(X; Z) \leq I(X; Y)\). No transformation applied to \(Y\) alone can raise its information about \(X\); processing can only preserve or destroy it. The quality floor is literally an information ceiling: \(I(X; Y)\) caps how much an output can ever tell you about the truth, and the most elaborate model in the world is a function of \(Y\), hence subject to the bound. This makes the non-substitutability invariant a theorem rather than folklore: downstream sophistication is a post-processing of \(Y\), and post-processing cannot increase \(I(X; \cdot)\). The intervention is read straight off the inequality — to raise the ceiling you must change \(Y\) itself (better sensor, better collection, a new measurement channel), increasing \(I(X; Y)\) at the source. The one apparent exception — a system that seems to "clean up" \(Y\) — is always importing a second input \(W\) with its own \(I(X; W) > 0\); the chain is then \(X \to (Y, W) \to Z\), and \(W\) is the real input to audit.

Mapped back: The quality-bearing input is \(Y\); the transformation is any processing producing \(Z\); the output quality is \(I(X; Z)\); the monotonicity relation is \(I(X; Z) \leq I(X; Y)\); the input-set floor is the ceiling \(I(X; Y)\); and the non-substitutability invariant is that no function of \(Y\) alone can breach that ceiling.

Applied/industry¶

A health system deploys a clinical-decision-support model, carefully validated on held-out data, to flag patients who need extra care. In production it systematically under-flags one patient population. The instinct is to retrain with a bigger model — a downstream-sophistication move. GIGO redirects the diagnosis upstream: the training target was "future medical-care spending" used as a proxy for "future medical need," and the two diverge along access-to-care lines, so the input's validity floor-bound the output's fairness no matter how sophisticated the model. The quality-bearing input here is the label (the proxy target); the transformation is the model; the output is the risk score whose fidelity to true need is what is cared about. The monotonicity relation bites: no model can recover need-information the spending-proxy label never contained. The only effective fix is an input intervention — redefine the target to a direct measure of health need, or add trusted side information (clinical indicators) as a genuinely new input channel. The same structure governs a meta-analysis floor-bound by publication bias in its primary studies (fix: register and include unpublished trials, an input intervention) and a manufacturing yield problem traced to component-supplier defects (fix: supplier-quality vetting, not more assembly-line tuning).

Mapped back: The quality-bearing input is the spending-proxy label; the transformation is the decision-support model; the output quality is the score's fidelity to true medical need; the monotonicity relation forbids the model from manufacturing need-information absent in the label; the input-set floor is the proxy's validity ceiling; and the non-substitutability invariant is why retraining cannot fix it and only redefining the target can.

Structural Tensions¶

T1 — Single Input versus Side-Information Channel (coupling). The non-substitutability invariant holds for a function of one input — but the prime itself concedes the escape: a system that appears to clean up garbage is importing a second trusted channel. The boundary is where "no processing helps" gives way to "a new input channel does." The failure mode runs both ways: dismissing a genuine multi-channel fusion as impossible ("you can't fix bad data"), or crediting a downstream stage with quality it actually borrowed from an un-audited side input. Diagnostic: when output beats its nominal input, find the second channel \(W\) and audit it as the real input — do not attribute the gain to the transformation.

T2 — Input Floor versus Aggregation Gain (scalar). GIGO bounds a single transformation, but pooling many independently-noisy inputs can drive error below any one input's quality — wisdom-of-crowds, ensemble averaging, sensor fusion. The competing prime is statistical aggregation, where redundancy buys fidelity the data-processing inequality seems to forbid. The subtlety: aggregation does not violate the bound (each input contributes independent information), but a naive GIGO reading mis-predicts that bad inputs cap the pooled result. The failure mode is refusing to aggregate noisy-but-independent sources because "garbage in." Diagnostic: distinguish correlated defects (which the floor genuinely caps) from independent noise (which averaging reduces).

T3 — Quality Floor versus Useful Lossy Output (scopal). The prime treats fidelity-to-ground-truth as the quantity cared about — but many transformations legitimately discard input quality to serve a different purpose (lossy compression, summarization, abstraction), and the output is fit-for-use despite lower fidelity. The failure mode is misapplying GIGO to a deliberate, beneficial information reduction, flagging a good summary as "degraded" when the lost detail was the point. Diagnostic: ask whether the output's purpose is faithful reconstruction (where the floor binds) or useful reduction (where loss is intended) before calling lost quality a defect.

T4 — Static Input Quality versus Feedback Improvement (temporal). GIGO is stated as a one-pass bound, but real pipelines loop: outputs trigger corrections to upstream collection, so a system's input quality rises over time even though any single pass obeys the floor. The competing prime is feedback / active learning, where the transformation's errors are routed back to improve the input channel. The failure mode is reading the static one-pass bound as a permanent ceiling, missing that the binding constraint moves as the loop closes. Diagnostic: ask whether the input is fixed or whether the system can act on the world to raise input quality across iterations.

T5 — Input Intervention versus Diminishing Floor Returns (measurement). The prime says fix the input, not the processing — but input quality is itself bounded and expensive, and at some point the marginal trusted bit costs more than the decision is worth. The competing consideration is that "improve the input" is no more unlimited than "improve the model." The failure mode is an infinite-regress data-quality crusade: chasing provenance and source vetting past the point where the residual input defect no longer affects the decision. Diagnostic: measure how much the decision would change if the input floor rose, and stop input investment when that sensitivity goes to zero.

T6 — Detectable Garbage versus Laundered Polish (sign/direction). GIGO's most dangerous form is false confidence — sophisticated processing that makes bad inputs look clean, defeating the natural cue that rough output prompts an input check. The tension is between two opposite failure surfaces: crude systems that visibly fail (safe, because the garbage shows) and polished systems that fail invisibly (dangerous, because the garbage is hidden). The failure mode is trusting output polish as a quality signal, with trust increasing exactly as the laundering gets better. Diagnostic: require explicit input-quality metadata (provenance, lineage, confidence bounds) to travel with the output, so fidelity is traced rather than inferred from surface smoothness.

Structural–Framed Character¶

Garbage in, garbage out sits just on the structural side of the structural–framed spectrum — a mixed-structural prime with an aggregate of 0.4. The grade records a real tension: the underlying claim is a substrate-neutral information-theoretic inequality, but the prime as named is computing folklore with a faint evaluative tint, so it does not reach the pure-structural floor that its formal backbone alone might suggest.

The diagnostics tell that story. Import-versus-recognize reads 0.0, the most structural reading: invoking GIGO imports no interpretive frame — it recognizes a ceiling already wired into any chain \(X \to Y \to Z\), the data-processing inequality \(I(X;Z) \le I(X;Y)\), which holds whether or not anyone is reasoning about it. The remaining four sit at 0.5, each for a concrete reason. Vocabulary travels (0.5): the load-bearing content is medium-neutral — fidelity, quality floor, non-substitutability — and the same diagnosis governs ML training labels, meta-analysis, audit records, and intelligence sources, but the prime's name is computing-pipeline slang ("garbage") that must be translated into mutual-information terms to read as structure. Evaluative weight (0.5): "garbage" is a mildly pejorative framing, and "quality floor" carries a whiff of approval/disapproval that a purely structural statement of the inequality would lack. Institutional origin (0.5): the practitioner label is early-computing folklore, even though the formal inequality predates and outruns it. Human-practice-bound (0.5): the diagnosis is most often invoked in human pipelines (data engineering, audit, science), but the inequality itself runs in any physical channel that transmits and processes a signal, with no practitioner required.

The honest reading is that the structural core is strong — the inequality is a theorem, not a heuristic, which is why the substrate-independence grade reaches a 5 and import-versus-recognize bottoms out at zero — but the folklore phrasing and the faint evaluative load keep the prime from the pure-structural pole. The 0.4 aggregate correctly places it just inside the structural half, and the prose should neither sand off the computing-folklore frame nor inflate it past the substrate-neutral inequality beneath.

Substrate Independence¶

Garbage In, Garbage Out is a maximally substrate-independent prime — composite 5 / 5 on the substrate-independence scale. Beneath the computing folklore phrasing sits a data-processing-inequality structure — fidelity to ground truth cannot increase across a single-input transformation — that is recognized rather than translated wherever it appears, which is what lifts the composite to the ceiling. On domain breadth (5) the quality-floor-set-by-inputs pattern governs genuinely unlike substrates with identical force: computing and data engineering, machine learning ("you can't model your way out of bad data"), statistics and meta-analysis (a synthesis inherits its primary studies' bias), audit (reports bounded by transaction-record integrity), intelligence analysis (assessment bounded by source quality), scientific research (the replication crisis as a GIGO story), policy modeling (sophisticated models on bad inputs yielding confident wrong answers), education, manufacturing, journalism, and legal adjudication — a span across institutions, machines, and inference with no medium privileged. On structural abstraction (4) the signature is a near-pure inequality on an information-fidelity quantity, medium-neutral and carrying no domain commitments; it sits just shy of the maximum only because what counts as "quality" must be pinned to faithful reconstruction of ground truth for the floor to bind exactly. On transfer evidence (5) the carry is concrete and documented under many independently-coined names that all encode the same non-substitutability — data-centric AI, the evidence pyramid, supplier quality, single source of truth, chain of custody — which is strong evidence the structure was rediscovered in each field rather than borrowed. The folklore phrasing is the only thing resembling a frame, but the underlying inequality is substrate-neutral and recognized in place, so the maximal grade holds.

Composite substrate independence — 5 / 5
Domain breadth — 5 / 5
Structural abstraction — 4 / 5
Transfer evidence — 5 / 5

Relationships to Other Primes¶

Parents (1) — more general patterns this builds on

Garbage In, Garbage Out presupposes Transformation

GIGO is a quality-MONOTONICITY constraint on a transformation (fidelity to ground truth cannot rise across a single-input map — the data-processing inequality); it presupposes the transformation whose output quality it bounds. The file: transformation is its genus, GIGO 'a constraint on one dimension of it'.

Path to root: Garbage In, Garbage Out → Transformation

Neighborhood in Abstraction Space¶

Garbage In, Garbage Out sits among the more crowded primes in the catalog (33^rd percentile for distinctiveness): several abstractions describe nearly the same structure, so a description that fits it will tend to fit its neighbors too — transporting it usually means disambiguating within this family rather than landing on it exactly.

Family — Unclustered & Miscellaneous (91 primes)

Nearest neighbors

Distortion — 0.74
Unreliable Narrator — 0.74
Quality Control — 0.72
Brandolini's Law — 0.72
Substitutability — 0.72

Computed from structural-signature embeddings · 2026-06-14

Not to Be Confused With¶

The nearest confusion is with plain transformation, the prime's embedding-nearest neighbor and its genus. A transformation is any mapping from inputs to outputs — it may translate, compress, enrich, reformat, or combine, and it carries no inherent claim about quality direction. GIGO is not the mapping but a constraint on one dimension of it: fidelity to ground truth cannot increase across a single-input transformation. The distinction is load-bearing because not every transformation is governed by GIGO's pessimism. A transformation that fuses a second trusted channel genuinely raises fidelity (and is the law's acknowledged escape); a transformation whose purpose is deliberate lossy reduction (summarization, abstraction) legitimately discards fidelity for fitness-to-use. GIGO applies only when the quantity cared about is faithful reconstruction of ground truth and the transformation is a function of the defective input alone. A practitioner who treats every transformation as GIGO-bounded will refuse beneficial fusions and mislabel good summaries as degraded; the prime names the specific regime — single defective input, fidelity the target — where the ceiling truly binds.

A second genuine confusion is with refinement, because both concern whether an artifact gets "better" as it passes through a process. They make opposite claims about the same arrow. Refinement is the assertion that successive processing improves the artifact toward a goal — polishing, error-correcting, increasing fitness. GIGO is the assertion that processing cannot improve fidelity-to-ground-truth beyond what the input carried. The two are reconciled by noticing what each measures: refinement improves the artifact against a goal (readability, performance, conformance to spec), which a transformation genuinely can do; GIGO bounds the artifact's information about the truth, which a function of the defective input cannot increase. A polished, refined report can be more usable and less faithful at once. The practitioner consequence is to ask which axis is in play: if the worry is fidelity to a ground truth the input never captured, GIGO binds and refinement is a mirage; if the worry is fitness-for-purpose given faithful inputs, refinement is real work and GIGO is irrelevant.

A third confusion worth pre-empting is with robustness, which also concerns a system's relationship to imperfect inputs. But robustness and GIGO point in different directions. Robustness is the property that a system continues to function and degrades gracefully under perturbed, noisy, or adversarial inputs — it is about preserving operation, not recovering truth. GIGO is the claim that no amount of processing recovers fidelity the input never carried — a robust system can be perfectly GIGO-bounded, faithfully and stably propagating the limited information its garbage inputs contain. One can have either without the other: a brittle system on perfect inputs (no robustness, no GIGO problem) and a robust system on bad inputs (full robustness, full GIGO ceiling). Confusing them leads a practitioner to invest in fault-tolerance and graceful degradation when the actual problem is that the input simply lacks the information the output needs — a ceiling no robustness engineering can lift.

For a practitioner these distinctions decide where to spend. Mistaking GIGO for generic transformation makes the law seem to forbid beneficial fusions and reductions it does not touch. Mistaking it for the negation of refinement confuses fitness-for-goal (improvable) with fidelity-to-truth (ceiling-bound). And mistaking it for a robustness deficit aims fault-tolerance engineering at an information ceiling it cannot raise. The prime earns its keep as the precise non-substitutability claim — downstream sophistication cannot replace input fidelity — that none of these neighbors makes.

Solution Archetypes¶

No catalogued solution archetypes reference this prime yet.