Skip to content

Native-Category Flattening

Prime #
1011
Origin domain
Social Sciences
Subdomain
research methods and classification → Social Sciences

Core Idea

A source system carries its own meaning-bearing partition of the world. When an external observer recodes it into a foreign taxonomy without first preserving the source partition, the source's distinctions are silently collapsed — cells kept apart are merged, cells kept together split — and the residue is reported as if it were original. The failure is not classification but premature commitment to a foreign partition, and the loss is irrecoverable from the recoded labels.

How would you explain it like I'm…

Squishing My Groups

Imagine you sorted your toys into 'fast cars' and 'slow cars,' which matters a lot to you. Then a grown-up dumps them all into one bin labeled just 'cars,' and now nobody can tell which was fast and which was slow. Native-Category Flattening is squishing apart the groups someone carefully kept separate, so their real differences disappear.

Foreign Boxes Crush The Real Ones

Every group of people splits the world into categories that feel real to them and that they act on. Native-Category Flattening happens when an outsider re-labels things using their OWN categories without first keeping the original ones. Things the locals carefully kept apart get merged, and things they kept together get split, and then it's reported as if those new labels were the real story. The big problem isn't sorting — everyone sorts — it's committing to someone else's sorting too early and erasing the first one. Once it's erased, you usually can't get it back from the new labels.

Whose Partition Wins

A source group carries its own partition of the world — distinctions its members treat as real and act upon. Native-Category Flattening is when an outside observer recodes that source into their own taxonomy WITHOUT first preserving the source's partition, so the original distinctions silently collapse: native cells that were kept apart get merged, native cells kept together get split, and the residue is reported in the foreign scheme as if it were the original. The failure isn't classification itself — every analysis classifies — it's the premature commitment to a foreign partition, which destroys structure the source had and the downstream analysis would have needed. A key feature is asymmetry of recoverability: you can't reconstruct the source partition from the external labels alone. There can even be a feedback loop where the imposed categories reshape how the source describes itself over time.

 

Native-category flattening names a specific failure in coding, mapping, and translation pipelines. A source system carries its own partition of a domain into categories its participants treat as real and act upon. An external observer or system holds a different partition — a codebook, taxonomy, schema, label set, or lexicon. A recoding step maps source instances into the external categories. When that step does not first preserve the source partition, source distinctions are lost: native cells the external scheme does not separate are merged, and native cells it partitions differently are split, with the residue then reported in the external scheme as if it were original. The structural defect is not classification per se — every analysis classifies — but the premature commitment to a foreign partition, which destroys structure the source preserved and the downstream analysis would have needed. A signature property is an asymmetry of recoverability: external labels cannot reconstruct the source partition from themselves. An optional feedback loop can make the imposed partition reshape the source's own self-description over time. The frame surfaces a usually hidden choice in any such pipeline: whose partition gets to be ground truth, and whether it has been preserved long enough to remain useful.

Broad Use

  • Ethnography: coding participants' phenomena into an a-priori codebook erases the kin-term, illness, or moral distinctions they maintain.
  • Clinical coding: a patient's narrative of distress is recoded into diagnostic codes, losing their own "good days / the wave" partition.
  • Translation and NLP: source-language distinctions of aspect, evidentiality, or kinship collapse into target-language defaults.
  • Database interoperability: a source's category values without target analogues are bucketed into "other," losing distinctions downstream uses depended on.
  • Colonial administration: state taxonomies for caste, tribe, or occupation overwrite the lived partitions populations use.
  • Machine learning: annotators apply a fixed label set to data carrying finer native cuts, constraining every downstream model.

Clarity

Separates classification (necessary) from premature classification into a foreign partition (avoidable), turns the question from "what bucket does this go in?" to "did the source already have a bucket, and have I preserved it?", and itemizes the cost as specific merges and splits.

Manages Complexity

Decomposes a single "coding" act into two engineerable stages — preserve the source partition first, then map deliberately and reversibly — converting an opaque irreversible collapse into an inspectable transformation.

Abstract Reasoning

Exposes an asymmetry between cheap labels and expensive partitions (once lost, the cut is gone from the data), and a feedback loop in which an imposed taxonomy reshapes the population's self-description so later measurement "finds" it as a self-fulfilling artifact.

Knowledge Transfer

  • Across substrates: the roles correspond (source partition, external partition, recoding step), so the same moves transfer — carry source codes alongside analyst codes, build a documented reversible mapping, audit "other"-bucket heterogeneity, defer commitment.
  • Ethnography → ML/clinical: the preserve-first-then-map fix transfers without modification from kinship coding to dataset annotation to chart coding.

Example

Coding interviews about kinship into an English-based codebook merges the native mother's-brother / father's-brother contrast into "uncle" and splits a single native cousin-category across several English types — and once coded "uncle," no downstream analysis can recover which native category was meant.

Relationships to Other Primes

One-hop neighborhood: parents above, mutual partners to the right, children below.Native-CategoryFlatteningcomposition: ClassificationClassificationsubsumption: Translation and Conceptual BridgingTranslation and…

Parents (2) — more general patterns this builds on

  • Native-Category Flattening is a kind of, typical Translation and Conceptual Bridging — It is a pathology of translating one category scheme into another — lossy, asymmetric, one-way. Owner picks classification vs translation lineage.
  • Native-Category Flattening presupposes, typical Classification — The failure is a lossy recoding of a source's meaning-bearing partition into a foreign taxonomy; it presupposes a classification/recoding act and names its destructive (merge/split, irrecoverable) special case. Built on the recoding step.

Path to root: Native-Category FlatteningClassification

Not to Be Confused With

  • Native-Category Flattening is not Segmentation because flattening destroys an already-drawn partition by recoding, whereas segmentation creates boundaries in an un-partitioned domain.
  • Native-Category Flattening is not Decomposition because flattening is lossy and asymmetric (its labels cannot reconstruct the source cut), whereas decomposition's parts reassemble into the original.
  • Native-Category Flattening is not Interleaving because flattening is the one-way overwriting of one partition by another, whereas interleaving is the alternation of coexisting streams that can be unwoven.