Category Retrieval Lock In¶

Prime #: 691
Origin domain: Psychology Cognition
Subdomain: categorization and retrieval → Psychology Cognition

Core Idea¶

When a property-rich substrate has been compressed under a category label to make retrieval cheap, the label takes the place of the property list during routine use, and subsequent operations that need the property list are systematically obstructed by the very shortcut that made routine use fast. The structural commitments are four. A property-rich substrate — an object, person, codebase, firm, or document — has many independent properties relevant to different downstream uses. A category label is assigned that stands in for the substrate during retrieval, compressing the property list to a tag that can be matched cheaply. Routine use runs through the label successfully, so the property list is rarely inspected. And novel re-use — any operation that needs a property the label does not name, or one the label actively contradicts — pays a disproportionate cost to recover the underlying property list, because the label is now an active obstacle rather than a passive summary.

The structural force is the asymmetry: the same compression that makes routine retrieval fast makes novel retrieval slow, and the slowdown is structural, not a matter of working harder, but of having to undo a category lock that all the retrieval machinery is built on. The lock-in lives in the retrieval infrastructure, not in the substrate itself.

What changes in a reader's view of a system is that "we got stuck in our categories" splits into two distinct diagnoses. Retrieval lock-in locates the obstruction in the indexing and habit machinery, pointing at the labeling infrastructure as the thing to change; substrate lock-in locates it in the substrate, which cannot support the new use at all and suggests entirely different interventions. Separating these is the prime's central contribution, because it predicts that the cheapest fix is usually to re-index against the needed property rather than to transform the substrate or simply try harder.

How would you explain it like I'm…

The Hiding Label

Imagine you put a toy in a box labeled "red toys" so you can find it fast. Later you want "the toy that squeaks," but your boxes only say colors, so you have to dig through everything. The very label that made one kind of search easy makes a different search hard. The problem is in how you sorted things, not in the toy itself.

The Helpful Tag Trap

Category Retrieval Lock-In happens when you label something with one tag to find it quickly, and that tag quietly replaces the real list of its details. Most of the time the tag works great, so you almost never look at the full details. But the moment you need a detail the tag doesn't mention — or one it actually contradicts — you pay a big cost to dig the details back out, because the tag is now in your way. The key point: the stuckness lives in your filing system, not in the thing you filed. The cheapest fix is usually to re-file it by the detail you now need.

Shortcut Becomes Obstacle

Category Retrieval Lock-In is what happens when a property-rich thing — a person, a codebase, a document — gets compressed under a category label so it's cheap to retrieve, and that label starts standing in for the full property list during everyday use. Because routine use runs through the label successfully, the underlying properties are almost never inspected. Then a novel re-use needs a property the label doesn't name (or one it contradicts), and recovering that property is disproportionately expensive, because the label is now an active obstacle, not a passive summary. The force is an asymmetry: the same compression that makes routine retrieval fast makes novel retrieval slow, and the slowdown is structural — you have to undo a category lock the whole retrieval system is built on. Critically, the lock-in lives in the retrieval infrastructure, not in the thing itself.

When a property-rich substrate has been compressed under a category label to make retrieval cheap, the label takes the place of the property list during routine use, and later operations that need the property list are systematically obstructed by the very shortcut that made routine use fast. Four commitments: a property-rich substrate (object, person, codebase, firm, document) with many independent, differently-useful properties; a category label assigned to stand in for it during retrieval, compressing the property list to a cheaply-matched tag; routine use that runs through the label successfully, so the property list is rarely inspected; and novel re-use — any operation needing a property the label does not name or actively contradicts — which pays a disproportionate cost to recover the property list, because the label is now an obstacle, not a summary. The structural force is the asymmetry: the same compression that makes routine retrieval fast makes novel retrieval slow, and the slowdown is structural — a matter of undoing a category lock the retrieval machinery is built on, not of working harder. The lock-in lives in the retrieval infrastructure, not the substrate. This splits "we got stuck in our categories" into two diagnoses: retrieval lock-in locates the obstruction in indexing and habit, pointing at the labeling infrastructure; substrate lock-in locates it in the substrate, which cannot support the new use at all. Separating them predicts the cheapest fix is usually to re-index against the needed property rather than transform the substrate or try harder.

Structural Signature¶

a property-rich substrate — a category label that compresses its property list — a retrieval infrastructure that has compiled the label in place of the properties — routine use that runs through the label successfully — a novel re-use needing a property the label omits or contradicts — a disproportionate decompression cost as the load-bearing invariant — the lock located in the indexing machinery, not the substrate

The pattern is present when each of the following holds:

A property-rich substrate. An object, person, codebase, firm, or document with many independent properties relevant to different downstream uses.
A compressing label. A category tag is assigned that stands in for the substrate during retrieval, collapsing the property list to something matchable cheaply.
A retrieval infrastructure. Indexing and habit machinery that finds the substrate by the label — the locus where the lock-in actually lives.
Successful routine use. Operations that the label names run through it cheaply, so the property list is rarely inspected.
An obstructed novel re-use. Any operation needing a property the label does not name, or one it actively contradicts, must recover the underlying property list.
A disproportionate decompression cost. Recovering the property list is structurally expensive — the slowdown is the asymmetry between cheap compression and costly decompression, not a matter of working harder. Its size scales with the retrieval graph that compiled the label, not with the substrate.
Locus in the indexing geometry. The obstruction is retrieval lock-in (in the labeling machinery, re-indexable) as distinct from substrate lock-in (in the substrate itself, requiring transformation).

These compose into a diagnosis-plus-remedy: when an obstacle to novel use looks disproportionate to the underlying change, suspect a compiled label, and re-index against the needed property — often via a secondary index — rather than transforming the substrate or trying harder.

What It Is Not¶

Not search and retrieval. search_and_retrieval (the embedding nearest neighbor) is the general mechanism of finding stored items by a query. This prime is the specific pathology in which a compiled category label, having made routine retrieval cheap, obstructs novel retrieval of an obscured property — a failure mode of retrieval infrastructure, not the retrieval function itself.
Not lock-in generally. lock_in is any costly-to-reverse commitment. This prime locates the lock specifically in the indexing/labeling machinery and contrasts it with substrate lock-in; its central claim is that the obstruction is re-indexable, not in the substrate — a distinction generic lock_in does not draw.
Not cognitive entrenchment. cognitive_entrenchment is the broad rigidity of expert mental models. This prime is narrower and substrate-neutral: it is about a retrieval label compiled in place of a property list, which applies equally to a codebase, a market index, or a library catalog, not only to expert cognition.
Not locality of reference. locality_of_reference is the tendency to access nearby items together, exploited by caches. This prime concerns retrieval along versus across an indexing dimension; the cost is the geometry of a category index, not spatial or temporal proximity of accesses.
Not associative memory. associative_memory is retrieval by content-based association. This prime is about the compression of a property list into a single category tag that blocks association on the omitted properties — the obstruction of associative reach by a compiled label, not the associative mechanism itself.
Common misclassification. Diagnosing a disproportionate obstacle as substrate-level difficulty when the needed property is present but unreachable under the prevailing label. The tell: ask whether the property is present-but-obscured (retrieval lock, dissolved by re-indexing) or genuinely absent (substrate lock, requiring transformation). Treating the first as the second wastes effort transforming a substrate that was fine.

Broad Use¶

In cognition, the origin substrate, functional fixedness retrieves a screwdriver as "screwdriver" rather than as a thin metal shaft with an insulating handle, and the Einstellung effect fixes a problem-solving method as a category so a novel problem needing a different decomposition is obstructed. In software, a function's type signature compresses its actual property list — memory behavior, exception profile, concurrency guarantees — so callers see the label, not the properties, and refactoring across the type boundary costs far more than the underlying code change. In organizations, a person retrieved by title and remit rather than by latent skills cannot be re-deployed across functions, because the staffing system finds people by role tag. In language, lexicalization collapses a phrase into a single token whose compositional parts become inaccessible under retrieval pressure, so a context demanding a specific property the lexicalized category obscures is obstructed. In markets, a firm filed under one classification is invisible to analysts searching another, and reclassification events produce documented price-revision shocks precisely because the retrieval infrastructure had locked the firm under the old label. In knowledge organization, a document filed under one keyword set resists retrieval under another, which is why full-text and embedding-based search were developed to break hand-categorization locks. In medical diagnosis, premature closure locks a patient under a label so inconsistent symptoms are under-weighted. Across these the structural force is constant: a label-shaped shortcut, having served retrieval, prevents the cheaper-than-baseline inspection of the property list when novel use demands it.

Clarity¶

The prime forces a distinction often elided as "we got stuck in our categories": the difference between retrieval lock-in, where the obstruction is in the retrieval infrastructure rather than the substrate, and substrate lock-in, where the substrate itself cannot support the new use. A diagnosis of retrieval lock-in points the analyst at the labeling and indexing machinery as the load-bearing element to change; a diagnosis of substrate lock-in points at the substrate and suggests very different interventions. Confusing the two leads to transforming a substrate that was fine, or accepting a limitation that was only an indexing artifact.

It also separates the cost of recovering the property list from the cost of using the property list once recovered. The lock-in lives in the first; the second is often modest. This explains why "why didn't anyone notice X about this firm, object, or person?" has a structural answer: nobody who needed X had cheap access to the property list under the prevailing label. The clarifying force is to relocate the puzzle from the substrate or the people to the indexing geometry, making visible that the missed property was present all along but unreachable through the retrieval path that the label had compiled.

Manages Complexity¶

The retrieval-lock-in diagnosis collapses a wide class of "we weren't able to see, re-use, or re-deploy" failures into a single three-part accounting: identify the label that has been doing the retrieval work; identify the property the new operation needs; and decide whether the cheaper move is to drop the label and rebuild retrieval against the property list, build a secondary index against the property, or accept the lock-in and route the new operation through a different mechanism. The same three-step accounting transfers from cognitive re-purposing of objects to software refactoring to organizational re-staffing to library re-cataloguing, replacing a set of unrelated frustrations with one diagnostic.

The compression is sharpened by recognizing the secondary-index intervention as the portable remediation across substrates. Rather than relabeling everything, which disrupts the routine use the label was serving, one builds an alternative retrieval path against the property the new use needs while leaving the primary category-label index in place. This recurs as the cheapest available move whether the substrate is a codebase, an organization, or a library, which is why the prime's intervention space is so compact: the same indexing-geometry move answers a failure family that would otherwise be diagnosed substrate-by-substrate as different problems.

Abstract Reasoning¶

The prime supports several inferences about retrieval geometry. The compression-decompression asymmetry: compression is cheap per use — one label fits in working memory or one tag in an index entry — while decompression is expensive at decision time, because the property list must be reconstructed and re-evaluated against the new use; this predicts that lock-in is heaviest where the property list is long and the label short. Indexing geometry: a category system is a one-of-N tag, so retrieval along the indexing dimension is cheap and retrieval across it expensive, and the cost is the geometry of the index, not of the substrate.

Two further inferences concern dynamics and cost. Update lag: when the substrate's properties change but the label does not, the retrieval system keeps returning the old profile, which is why a person, firm, or codebase can be misclassified at scale for years after the underlying properties have evolved. Reversibility: retrieval lock-in is in principle reversible by re-indexing, but the cost scales with the size of the retrieval graph that has compiled the label, not with the size of the substrate, which is why type-system migrations, org redesigns, and library reclassifications are slow. These inferences follow from the retrieval-infrastructure structure alone, so they apply equally to a mind, a codebase, or a market index, and they tell an analyst where lock-in will bite hardest and what re-indexing will cost.

Knowledge Transfer¶

The transferable content is the diagnostic — when an obstacle to novel use looks disproportionate to the underlying change, suspect retrieval lock-in — together with the three-part accounting and the secondary-index remediation. Because the structure lives in retrieval infrastructure rather than in any substrate, the move carries across domains that named it separately. The functional-fixedness diagnosis from cognition ports to type-system refactoring with the intervention vocabulary intact: relax the type, expose the property, build a secondary interface. The same diagnosis ports to staff redeployment: look past the title, index the property list, introduce a skills inventory as a secondary index. The market-classification example generalizes to academic classification, where the under-retrieval of cross-disciplinary work has the same structural cause as the under-retrieval of a misfiled firm. The historical sequence in information retrieval — from manual cataloguing to full-text search to embedding-based retrieval — is itself a series of retrieval-lock-in-breaking interventions, and the same move applies wherever labels obstruct property-level retrieval.

These transfers work because the structural roles are stable: a property-rich substrate, a compressing label, routine use through the label, novel use needing an obscured property, a retrieval infrastructure that has compiled the label, and a disproportionate decompression cost. A problem-solver re-purposing a tool, an engineer refactoring across a type boundary, a manager re-staffing across roles, and a librarian re-cataloguing a collection are all running the same move: identify the label doing the retrieval work, identify the needed property, and build a secondary index rather than transforming the substrate or trying harder. The portable lesson is that a shortcut which made routine retrieval cheap becomes an active obstacle to novel retrieval, and the cure is almost always to re-index against the property rather than to blame the categories or the categorizers — a lesson that travels intact from a screwdriver to a payments API to a staffing system, and that, once held, makes the indexing geometry the first place to look when a small change mysteriously costs a great deal.

Examples¶

Formal/abstract¶

Functional fixedness in cognition is the prime's origin instance, and the candle problem makes its mechanism precise. The property-rich substrate is a box of thumbtacks: it has many independent properties — it holds tacks, but it is also a rigid container, a flat platform, a mountable shelf. The category label compiled by routine use is "tack-container," which compresses the property list to the single use that retrieval has needed. The retrieval infrastructure is the solver's habit machinery, which finds the object by its label. Routine use runs through the label successfully — when you need tacks, "tack-container" is exactly right. The novel re-use — mounting a candle on the wall — requires a property the label omits and effectively contradicts: the box-as-shelf. The disproportionate decompression cost is the load-bearing invariant: the box is sitting in plain view with the needed property fully present, yet recovering "this is also a shelf" is structurally expensive because all the retrieval machinery has compiled the box under "container." The slowdown is not a matter of trying harder — solvers stare at the box — but of having to un-compile a category lock. The prime's central diagnostic split applies cleanly: this is retrieval lock-in (the obstruction is in the labeling, and re-indexing against "rigid flat surface" dissolves it instantly, which is why presenting the box empty — pre-decompressed — makes the problem easy), not substrate lock-in (the box was always capable of being a shelf).

Mapped back: the candle problem instantiates every role — a property-rich box, a compiled "container" label, successful routine use, an obstructed novel use needing the omitted shelf property, and a decompression cost located in the indexing rather than the substrate — making "re-index against the needed property" the literal solution.

Applied/industry¶

Software type signatures show the identical structure on an engineering substrate. The property-rich substrate is a function whose true property list includes its memory behavior, exception profile, concurrency guarantees, and performance characteristics. The category label is its type signature, which compresses that list to a matchable tag — callers see parse(s: String) -> Config, not the function's real properties. The retrieval infrastructure is the type system and the codebase's call graph, which finds and binds the function by its signature. Routine use runs through the signature cheaply: any caller wanting the declared behavior matches it and compiles. The novel re-use — say, refactoring to make the function async, or to surface a property the signature hides like "this allocates on the heap" — requires changing the property the label compiled away, and the disproportionate decompression cost is the asymmetry every engineer knows: a one-line behavioral change costs a sweeping refactor across the type boundary, because the cost scales with the retrieval graph (every call site the signature compiled), not with the size of the underlying code change. The prime's secondary-index remediation is the portable fix: rather than relabeling everything (disrupting all the routine use), expose the needed property through an additional interface or adapter while leaving the primary signature in place — the same move a manager makes who, unable to redeploy staff retrieved only by job title, builds a skills inventory as a secondary index against latent capabilities, and the same move a librarian makes adding full-text search atop a hand-catalogued collection.

Mapped back: the type signature is a category-retrieval lock-in — a property-rich function compressed to a matchable tag, routine use through the tag, novel use obstructed by an omitted property, decompression cost scaling with the call graph — so the remedy is a secondary index (adapter interface, skills inventory) rather than relabeling the substrate.

Structural Tensions¶

T1 — Retrieval Lock versus Substrate Lock (scopal). The prime's central split is between obstruction in the indexing machinery (re-indexable) and obstruction in the substrate itself (requiring transformation) — and the two are easily confused, with opposite remedies. The failure mode is misdiagnosis in either direction: transforming a substrate that was fine (the box could always be a shelf) because the real lock was in the label, or trying to re-index against a property the substrate genuinely lacks. Diagnostic: ask whether the needed property is present but unreachable or absent. If present, the lock is retrieval and re-indexing dissolves it cheaply; if absent, no amount of re-indexing helps. Confusing the two wastes effort on the wrong layer entirely.

T2 — Routine Speed versus Novel Reach (sign/direction). The same compression that makes routine retrieval fast makes novel retrieval slow — the label's benefit and its cost are the same property pointing in opposite directions for different uses. The failure mode is optimizing the label purely for routine throughput (a tighter tag, a more compiled habit) and thereby deepening the lock against future novel use, or loosening it for flexibility and taxing the common case. Diagnostic: ask what fraction of operations run through the label versus across it, and weight accordingly. There is no free compression; the prime's asymmetry means every gain in routine retrieval speed is a debt against re-use reach, and the right operating point depends on the expected mix.

T3 — Secondary Index versus Index Proliferation (scalar). The prime's portable remedy is to add a secondary index against the needed property rather than relabel everything — but every secondary index has its own maintenance cost and can itself compile into a lock, and a substrate with dozens of overlapping indexes becomes its own retrieval problem. The failure mode is unbounded index accretion: each novel use spawns another adapter, skills inventory, or alternate catalog, until the indexing layer is more complex than the substrate. Diagnostic: ask whether the new property warrants a durable index or a one-time decompression. The secondary-index move is cheap once; applied reflexively to every novel use, it trades a single category lock for a sprawling, unmaintained index mesh.

T4 — Update Lag (temporal). The prime notes that when a substrate's properties change but the label does not, retrieval keeps returning the stale profile — a person, firm, or codebase misclassified for years after its properties evolved. The failure mode is trusting the label as a current summary when it is a historical snapshot: staffing someone by an outdated role, pricing a firm by a classification it has outgrown. Diagnostic: ask when the label was assigned relative to when the substrate last changed, and whether anything forces re-labeling on substrate change. The lock is not only against novel properties the label never named but against evolved properties it once named correctly; currency of the label is a separate failure axis from completeness.

T5 — Decompression Cost versus Use Cost (measurement). The prime separates the cost of recovering the property list from the cost of using it once recovered — the lock lives entirely in the first, while the second is often modest. The failure mode is mis-attributing the expense: concluding the novel use is intrinsically hard (substrate-level difficulty) when the whole cost was the one-time decompression, and abandoning a re-use that would be cheap once the index is built. Diagnostic: estimate the two costs separately — is the obstacle reconstructing access to the property, or doing something hard with it? The prime predicts the obstacle is usually the former; reading a one-time indexing cost as ongoing use difficulty kills viable re-uses at the door.

T6 — Re-Index Cost Scales with Retrieval Graph, Not Substrate (scalar). Retrieval lock-in is in principle reversible, but the cost of re-indexing scales with the size of the retrieval graph that compiled the label — every call site, every habituated user, every downstream filing — not with the size of the substrate. The failure mode is estimating a re-categorization by the substrate's size (the code change is one line, the firm is one entity) and being blindsided by the graph-sized actual cost (every caller, every analyst, every index). Diagnostic: measure the fan-out of the label — how many retrieval points have compiled it — before promising a cheap re-index. The prime's reversibility is real but its price is set by how deeply the label was wired in, which is why type migrations and org redesigns are slow despite small underlying changes.

Structural–Framed Character¶

Category retrieval lock-in sits just structural-of-center on the structural–framed spectrum — a mixed-structural prime, aggregate 0.4, whose claim about retrieval infrastructure is substrate-neutral but whose vocabulary and home cases are dominantly cognitive.

One diagnostic reads cleanly structural and pulls the score below center: evaluative weight is zero. The lock-in is a value-neutral fact about compiled retrieval — the label took the place of the property list, obstructing novel re-use — with no inherent approval or disapproval; the same compression that "locks in" is what made routine retrieval cheap, so neither pole is good or bad in itself. The other four diagnostics each read half-framed, lifting the aggregate to 0.4. The vocabulary half-travels: "category," "retrieval," "functional fixedness," "Einstellung" come from cognitive psychology and carry that lineage, though the underlying claim — that retrieval infrastructure compiled a label in place of the property list — restates in software type systems, organizational role titles, lexicalized language, and market classifications. The origin is half-institutional, born in categorization-and-retrieval psychology rather than in a formalism. It is half human-practice-bound: its showcase instances are cognitive and social — minds, organizations, languages — yet the structural claim applies equally to a software index or any retrieval system that compiled a label, which is not bound to human practice. And invoking it half-imports a frame: you bring the cognitive picture of fixedness, but you also genuinely recognize a property-present-but-unreachable obstruction that is a real structural fact about the index. The substrate-faithful reading is a prime whose core claim about compiled retrieval is portable and value-neutral, scored mixed-structural because its cognitive-psychology vocabulary, origin, and largely mental-and-social home cases leave consistent half-marks — with the software and market instances confirming the structure is not confined to human minds.

Substrate Independence¶

Category-retrieval lock-in is a strongly substrate-independent prime — composite 4 / 5 on the substrate-independence scale. Its domain breadth is wide: a routine-retrieval label that compresses an item's property list and thereby blocks novel retrieval recurs across cognition (its origin — functional fixedness retrieving a screwdriver as "screwdriver" not as a conductive shaft; the Einstellung effect fixing a method), software (a type signature compressing a function's actual memory, exception, and concurrency behavior, so refactoring across the type boundary costs more than the code change), organizations (a person retrieved by title and remit, not latent skills, resisting cross-functional redeployment), language (lexicalization collapsing a phrase into a token whose parts become inaccessible under retrieval pressure), markets (a firm filed under one classification invisible to analysts searching another, producing documented reclassification price shocks), and knowledge organization (a document under one keyword set resisting retrieval under another). The structural abstraction is high because the load-bearing element — a retrieval index keyed on a compressed label that occludes the underlying property list — is relational and medium-neutral, describing a human memory, a type system, and a market taxonomy with the same machinery. The transfer evidence is concrete: in each substrate the same lock-in mechanism is documented, and the reclassification-shock and full-text/embedding-search responses are named instances of recognizing and breaking it. It stays at 4 rather than 5 because every instance presupposes a retrieval or categorization system — there is no purely physical substrate — so the pattern is medium-neutral across cognitive, computational, and institutional retrieval but not beyond them.

Composite substrate independence — 4 / 5
Domain breadth — 4 / 5
Structural abstraction — 4 / 5
Transfer evidence — 4 / 5

Relationships to Other Primes¶

Parents (2) — more general patterns this builds on

Category Retrieval Lock In is a kind of Lock-In

The file: 'the generic prime it specializes' — genus-to-species. category_retrieval_lock_in is lock_in where the costly-to-reverse commitment is a category label compiled into retrieval infrastructure, adding the retrieval-vs-substrate-lock split.
Category Retrieval Lock In presupposes, typical Search and Retrieval

The file: a specific PATHOLOGY of search_and_retrieval — the compression that makes routine retrieval cheap is what obstructs novel retrieval. It presupposes a retrieval/indexing infrastructure.

Path to root: Category Retrieval Lock In → Lock-In → Increasing Returns

Neighborhood in Abstraction Space¶

Category Retrieval Lock In sits in a sparse region of abstraction space (89^th percentile for distinctiveness): few abstractions share its structure, so a faithful description tends to retrieve it precisely rather than landing on a neighbor.

Family — Context-Keyed Mapping & State Switching (10 primes)

Nearest neighbors

Encoding Specificity — 0.69
Classification — 0.68
Native-Category Flattening — 0.68
Remapping — 0.68
Metonymy — 0.67

Computed from structural-signature embeddings · 2026-06-14

Not to Be Confused With¶

Category retrieval lock-in is most naturally placed against lock_in, the generic prime it specializes, and the relationship is genus-to-species with a crucial added discrimination. lock_in names any situation where a prior commitment is costly to reverse — switching costs, sunk investment, network effects, path-dependent standards — and its remedy is generally to reduce switching cost or accept the commitment. Category retrieval lock-in is the specific case where the costly-to-reverse commitment is a category label compiled into retrieval infrastructure, and its signature contribution is a discrimination that generic lock_in does not make: the split between retrieval lock-in (the obstruction lives in the indexing/labeling machinery and is dissolved by re-indexing, often via a cheap secondary index) and substrate lock-in (the obstruction lives in the substrate itself and requires transformation). This split inverts the usual lock-in intuition. Generic lock-in treats the cost as real and the substrate as the thing one is stuck with; this prime says the most common case is that the substrate is fine and only the index is stuck, so the cheapest move is almost always to re-index against the needed property rather than to transform anything. A practitioner reaching for lock_in will reason about switching costs and reversibility in the abstract; one reaching for this prime will first ask "is the needed property present-but-unreachable, or absent?" — and on the answer hangs whether the fix is a trivial secondary index or a substrate rebuild. Collapsing this prime into lock_in loses exactly the diagnostic that makes it useful.

A second genuine confusion is with search_and_retrieval, the embedding nearest neighbor and the general mechanism this prime is a pathology of. search_and_retrieval is the broad function of locating stored items by a query against an index — it describes how retrieval works and how to make it efficient. This prime describes a specific way retrieval fails: a label that was compiled to make routine queries cheap becomes an active obstacle to a novel query needing a property the label omits or contradicts, with a disproportionate decompression cost to recover the obscured property list. The two are related as a function to its characteristic failure mode, and the difference governs what one studies. A search_and_retrieval analysis optimizes the index for the queries it expects; this prime warns that optimizing for expected queries is exactly what builds the lock — the compression that makes routine retrieval fast is the same compression that makes novel retrieval slow (the prime's T2), so the very efficiency search_and_retrieval pursues manufactures the obstruction this prime diagnoses. The practical upshot is a tension the bare retrieval frame does not surface: every gain in routine-retrieval speed is a debt against re-use reach. Treating this prime as merely "a retrieval problem" misses that its content is the asymmetry between cheap compression and costly decompression, and that the remedy (a secondary index against the needed property, leaving the primary index in place) is precisely a way to escape the optimize-for-expected-queries trap without paying to relabel everything.

For a practitioner the distinctions order the response to a "we got stuck in our categories" complaint. First localize the lock with this prime's signature split: is the needed property present but unreachable (retrieval lock — re-index cheaply) or absent (substrate lock — fall back to generic lock_in reasoning about transformation cost)? Then, if retrieval, recognize that the obstruction is the predictable downside of search_and_retrieval efficiency and reach for the secondary-index remedy. The prime's unique value is relocating the puzzle from the substrate or the people to the indexing geometry, and predicting that the cheapest fix is almost always to re-index against the property rather than to blame the categories or rebuild the thing they categorized.

Solution Archetypes¶

No catalogued solution archetypes reference this prime yet.