Record Reconciliation¶
Core Idea¶
Matching records from one naming system to entities in another and declaring, per pair, a typed sameness verdict — equivalent, near-match with stated loss, ambiguous, or no-match — paired with an explicit statement of what the match preserves and loses. The payload is the persistent, citable, typed claim of cross-system sameness; the mapping is the joint, not the merge, so each system keeps its own granularity.
How would you explain it like I'm…
Same Kid, Two Lists?
Which One Is the Same?
Typed-Match With Named Loss
Broad Use¶
- Cultural heritage: catalog entries reconciled to authority files so differently-spelled names resolve to one entity.
- Scholarly identity: publication records reconciled to person identifiers across disambiguation pipelines.
- Bioinformatics: gene names across registries reconciled, the mapping table with its one-to-many cases the load-bearing artifact.
- Customer and patient data: master-data management reconciling heterogeneous source records into one canonical identity.
- Trade and clinical coding: shipment descriptions reconciled to harmonized codes, narrative diagnoses to code sets.
- Cross-jurisdictional law: a foreign judgment or marriage reconciled to a domestic legal category with explicit carry-across.
Clarity¶
Forces an explicit distinction between three otherwise-conflatable claims — that two records are the same, can be used as the same for a purpose, or share enough to link but not identify — so systems stop propagating identity claims they cannot defend.
Manages Complexity¶
Turns distributed, repeated, error-prone matching into one inspectable, updatable artifact whose errors are typed and visible rather than scattered and silent, while letting each system keep its own internal logic.
Abstract Reasoning¶
Makes the typing of the claim the central reasoning act, because the match type bounds the inference a consumer may draw — and predicts the pathologies: untyped matches collapse distinctions, hidden cardinality merges referents, unstated loss lets consumers over-claim.
Knowledge Transfer¶
- Authority control → genomics: the role-set (source record, target entity, match type, preservation conditions) lets a practitioner read a gene-mapping table with the same questions used for a bibliographic crosswalk.
- Finance: a customer record reconciled against a sanctions list mirrors a museum specimen reconciled against taxonomic registries.
- Across the family: the discipline — type every match, name the loss, make ambiguous cases visible, define an update discipline — transfers within reference-system practice.
Example¶
A local catalog's author records are reconciled against a global author-identifier registry: equivalent where dates and affiliations agree, near-match-with-loss where the registry conflates two pseudonyms the catalog keeps separate, ambiguous where three entries share a name (escalated rather than guessed), and no-match for a regional author absent from the registry.
Relationships to Other Primes¶
Parents (1) — more general patterns this builds on
- Record Reconciliation presupposes, typical Equivalence Relation — A typed cross-system sameness verdict (equivalent/near-match-with-loss/ambiguous/no-match) that deliberately BLOCKS the transitive closure an equivalence_relation enjoys; presupposes the equivalence machinery precisely to control where it must not apply across two sets. (Owner may prefer parentless.)
Path to root: Record Reconciliation → Equivalence Relation
Not to Be Confused With¶
- Record Reconciliation is not Provenance because provenance records intra-system lineage (where a record came from), whereas reconciliation records a cross-system sameness claim between two records in different regimes.
- Record Reconciliation is not an Equivalence Relation because an equivalence relation is transitive identity within a set, whereas reconciliation is a partial, typed claim across two sets where transitivity is precisely the failure mode to block.
- Record Reconciliation is not Native-Category Flattening because flattening is the failure mode of imposing one system's partition on another's data, whereas reconciliation is the positive discipline whose absence produces it.