Skip to content

Unreliable Narrator

Core Idea

A source whose report is systematically distorted relative to the reality it describes, so the receiver holds two layers at once — the account as given, and a transformation model of how the source distorts it — and inverts the known distortion to recover ground truth rather than reading the account as a transparent window.

How would you explain it like I'm…

The Story-Stretcher

Imagine a friend tells you a story, but he always makes himself the hero and leaves out the parts where he messed up. You don't throw the story away. You just remember 'he stretches things' and guess what really happened.

The Predictable Twist

An Unreliable Narrator is someone telling you about something, but their telling is bent in a regular, predictable way. Maybe they're showing off, maybe they only saw part of it, maybe they're trying to trick you. The smart move isn't to ignore them. You keep two things in your head: what they said, AND a little model of how this particular person twists things, so you can work backward to the truth.

Model-Then-Invert the Source

An Unreliable Narrator is a source whose report is bent away from reality in a *systematic* way, not just by random mistakes. The bend lines up with who they are: their interests, their position, what they can see, how their mind works. Unlike plain skepticism, which just says 'maybe that's wrong' and stops, this asks you to hold two layers at once: the account itself, and a model of *how* this source distorts. You then invert the known distortion to recover what's probably true. The same move works for a lying witness, a biased reporter, or a self-deceived friend.

 

An Unreliable Narrator is a source-of-account whose report is systematically distorted relative to the reality it claims to describe, in a way the receiver must explicitly model rather than ignore. The protocol is two-layered: you maintain (a) the source's account as given and (b) a *transformation model* of how that source distorts — its limits, biases, motives, and blind spots. The distortion is not measurement noise; it is systematic and often strategic, correlating with the source's identity, position, incentives, or cognitive structure, and it may be deceptive, innocently limited, confused, or self-deceived. The structural commitment is sharp: do not treat the account as a transparent window onto reality, but as the output of a known-fallible reporting function, and use what you know of that function to infer backward toward ground truth. This invariant survives across literature, courtroom testimony, journalism, organizational reporting, science, and machine-generated text; only the generating function changes. What separates it from generic skepticism is the second step: skepticism stops at doubt, while this specifies a *correction* — invert the known distortion to recover truth, to the extent possible.

Broad Use

  • Literature: the origin (Booth) — the reader reads through a narrator's distortion toward the implied author's signal.
  • Forensics: testimony processed against a model of witness fallibility — distance, lighting, motive, memory decay, leading questions.
  • Journalism: explicit models of source motive and proximity, with triangulation across independent sources.
  • Organisational reporting: status reports are systematically optimistic; "on track with a few yellow items" means red.
  • Scientific publication: the file-drawer effect and p-hacking make the literature a distorted reporting function meta-analysts correct.
  • Machine-generated text: fluent output shaped by training distribution, sycophancy, and confabulation, demanding an explicit distortion model.

Clarity

Separates what the source says from what is the case, showing that a known-biased report adds information once inverted — a consistently sandbagging "on track" is more informative than a noisy one.

Manages Complexity

Compresses a zoo of source-skepticism practices into one three-slot protocol — account, source model, inversion — with different distortion mechanisms as interchangeable contents of the source-model slot.

Abstract Reasoning

Lets one reason about what makes a source informative: a wide-but-characterised distortion beats a narrow-but-uncharacterised one, and adversarial distortion (change incentives) demands a different correction than innocent distortion (understand structural limits).

Knowledge Transfer

  • Witness evaluation → LLM evaluation: corroboration and leading-question probes map onto cross-prompt consistency checks and adversarial sycophancy detection.
  • Publication-bias correction → executive reporting: estimating the file-drawer maps onto estimating unreported red items.
  • Survey response-bias → clinical self-report: social-desirability corrections transfer to engagement surveys.

Example

In meta-analysis the published literature is the unreliable narrator: the file-drawer mechanism biases the published mean upward, and a selection model inverts it — funnel plots and trim-and-fill reconstruct the true effect rather than averaging the distorted published effects.

Not to Be Confused With

  • Unreliable Narrator is not Verification because verification checks the account directly against accessible ground truth, whereas this move is used when ground truth is inaccessible and the only leverage is modelling the source's distortion.
  • Unreliable Narrator is not Belief Formation because belief formation is an agent's general updating engine, whereas this prime names a specific feature of the evidence — that it arrives through a known-fallible reporting function.
  • Unreliable Narrator is not Signaling because signaling is a strategic sender conveying a costly-to-fake quality, whereas this prime covers innocent and self-deceived distortion too and aims to recover ground truth, not read a quality.