An agent maintains a separately-indexed, updateable model of another agent's hidden states — beliefs, knowledge, intentions — and routes prediction through that second model rather than through ground truth, so it can hold "I know X, but they do not."
Imagine you hide a cookie in the blue box while your friend is watching, but then they leave the room and you sneak it into the red box. When your friend comes back, where will they look? They'll look in the blue box — because *they* don't know you moved it, even though *you* do. Knowing that other people can think something different from what you know is the whole trick.
What They Believe
Theory Of Mind is keeping a little model in your head of what *someone else* is thinking — what they know, want, or believe — and using it to guess what they'll do. The key is that their picture of the world can be *different* from yours. Suppose you watch a toy get moved while your friend is away; you know where it really is, but you can also figure out that your friend will look in the old spot because *they* still believe it's there. You have to keep two pictures separate: the true one in your head, and the one inside your friend's head. Without this, you'd be bad at things like surprises, jokes, teaching, or knowing when someone's been tricked — because all of those depend on what the *other person* knows, not just what's true.
False-Belief Tracking
Theory of Mind is the pattern in which an agent *maintains and updates a model of another agent's hidden internal states* — beliefs, desires, intentions, knowledge, attention, ignorance — and uses it to predict and respond to their behavior. The key is that your own world-state and the other agent's are kept *formally separate*: you can hold 'I know X, but they don't know X yet' and reason about behavior that depends on their belief rather than on the truth. Three moves follow: *perspective-taking* (what can they perceive from where they are?), *false-belief tracking* (what do they believe even when it's wrong?), and *recursive embedding* (what do they think I think they think?, to some depth). Every operation specifies four things: the target agent, the mental-state type, the content, and the depth of recursion — and many real failures get three right and one wrong. It is the dual of *omniscient* reasoning that acts only on the world as it truly is; the giveaway that you have the machinery is being able to model a state *known to be false* — picturing someone looking where they *believe* the object is, not where it actually is.
Theory Of Mind is the structural pattern in which an agent maintains and updates a model of another agent's hidden internal states — beliefs, desires, intentions, knowledge, attention, ignorance — and uses that model to predict, anticipate, and respond to the other's behaviour. The modeller's own world-state and the modelled agent's world-state are kept formally separate: the modeller can hold the belief 'I know X, but they do not yet know X' and reason about behaviour that depends on the other's belief rather than on the truth. The commitment is that an agent in a social, adversarial, cooperative, or instructional context cannot rely on its own world-model alone; it must run a second model, indexed to the other agent, with potentially divergent contents, and route decisions through that second model rather than through ground truth. Three structural moves follow: perspective-taking — representing what the other can perceive given their position, attention, and sensors; false-belief tracking — representing what the other believes even when that belief is incorrect; and recursive embedding — representing what the other thinks the modeller thinks they think, to some depth k. Every theory-of-mind operation specifies four parameters: the target agent being modelled, the mental-state type (belief, knowledge, intention, desire, attention, emotion), the content (the proposition the state is directed at), and the depth of recursion; many real failures are right about three and wrong about one — correct agent and content but wrong belief, or correct at depth-1 but wrong at depth-2. The pattern is the dual of omniscient or ground-truth-only reasoning: an agent that acts on the world as it is, ignoring what others know, fails predictably at deception, persuasion, instruction, and negotiation. The operational signature of having the machinery at all is the ability to model a state known to be false — to represent the other looking where they believe the object is, not where it actually is.
Forces the question whose mind, in what state of knowledge?, relocating a class of failures from "they're being difficult" to "I modelled their information state incorrectly," and pinpointing which of four parameters (target, state-type, content, recursion depth) went wrong.
Collapses deception detection, instructional design, negotiation, and multi-agent planning into one move — maintain a separate model of another's information state — bounded by the fact that real reasoning bottoms out at depth-2 or depth-3.
Licenses channel-indexed updating (update only agents who had access to the channel) and false-belief reasoning (hold a model known to be false), the operational signature of the machinery.
In the Sally-Anne task, the world changed (the marble moved) and the child saw it, but Sally had no access to that channel — so a child with theory of mind keeps the two models separate and predicts Sally looks in the basket, representing a state known to be false.
Parents (1) — more general patterns this builds on
Theory Of Mindis a kind ofMental Model — Theory of mind is the SPECIFIC case of mental_model where the modelled thing is another agent's hidden mental states, kept SEPARATELY INDEXED from one's own world-model (false-belief capacity). The file: mental_model is 'any internal representation', ToM is the agent-target, dual-indexed specialization.
Children (1) — more specific cases that build on this
Curse Of Knowledgeis a kind of, typicalTheory Of Mind — *** curse_of_knowledge is a CANDIDATE (CAND-R2-154-08), not a canonical prime — recorded as links_to_other_candidates below, NOT a corpus reparent. *** The file: the curse is a FAILURE MODE of theory of mind (a breakdown of the separately-indexed second model), not a parallel prime.
Theory Of Mind is not Mental Model because a mental model is any internal representation of how something works whereas theory of mind is the specific case modelling another agent's hidden mental states, kept separately indexed so it can hold contents known to be false.
Theory Of Mind is not Perspective because perspective is a vantage-dependent view of the world whereas theory of mind is a model of another's mind — beliefs, knowledge, and recursion above mere perception.
Theory Of Mind is not Curse Of Knowledge because the curse is a failure mode of theory of mind (over-attributing one's own knowledge) whereas theory of mind is the capacity whose breakdown produces it.