Skip to content

Optimal Stopping Rule

Prime #
1040
Origin domain
Probability Decision Theory
Subdomain
sequential decisions → Probability Decision Theory
Aliases
Secretary Problem, Marriage Problem, Best Choice Problem

Core Idea

An optimal stopping rule is a structural object that maps a sequence of states or observations generated by an iterative process to a halt decision. The interesting structural questions in any instance are the same: what information structure is available before the decision (full history, noisy proxy, expected future value); what cost structure governs continuing (sampling cost, opportunity cost, regret) versus stopping (lost option value, premature commitment); whether the stop is reversible or final; and what adversarial structure the future obeys (exchangeable, adversarial, strategic counterparty). The optimal stopping rule under any choice of these parameters is a function from observed history to a halt decision — and the same rules recur with identical structural force across domains that look superficially unrelated.

The essential commitment is that when to stop is a structural question with substrate-independent parameters, not a matter of felt judgement. The arrangement factors a continuous decision stream into a single binary at each step — continue or halt — plus a stopping boundary that summarizes the rule, often expressed as a threshold on a running statistic. The boundary collapses the entire history into a sufficient statistic, which sharply reduces the cognitive load of sequential decision-making. The arrangement carries a characteristic dual-failure structure: a stopping rule can err by stopping too early or too late, each with its own cost calculus, and the boundary's design is precisely the trade-off between them. Recognizing a problem as a stopping problem — distinguishing it from a selection problem (which candidate, given all at once) or a search problem (where to look next) — is itself part of the arrangement: stopping is the structure where the order of arrival matters and the future is uncertain.

How would you explain it like I'm…

When To Grab It

Imagine picking the biggest seashell while walking down the beach, but you can't go back for one you passed. You have to decide each time: keep this one, or hope for a better one ahead? A stopping rule is your way of choosing the right moment to say 'this one, I'll stop now.'

Stop Or Keep Looking

An Optimal Stopping Rule is a smart way to decide when to quit looking and take what you have. You see things one at a time, in order, and once you pass one up you usually can't get it back. The rule looks at what you've seen so far and tells you: keep going, or stop now. The hard part is that stopping too early might mean missing something great, while stopping too late costs you time and chances. A good rule is built to balance those two mistakes against each other.

The Stopping Boundary

An Optimal Stopping Rule is a structural object that maps a sequence of states or observations from an iterative process to a halt decision — a function from observed history to 'continue or stop.' The key claim is that WHEN to stop is a structural question with substrate-independent parameters, not a matter of felt judgement: the same rules recur with identical force across domains that look unrelated. The structure has recurring questions — what information is available before deciding, what it costs to continue versus stop, whether the stop is reversible or final, and whether the future is friendly or adversarial. It factors a continuous decision stream into a single binary at each step plus a stopping boundary, often a threshold on a running statistic that collapses the whole history into a sufficient summary. The boundary carries a dual-failure structure: you can stop too early or too late, each with its own cost, and designing the boundary IS trading those off. Recognizing a problem as a stopping problem — distinct from choosing among options given all at once, or from deciding where to look next — is itself part of the arrangement, because stopping is where order of arrival matters and the future is uncertain.

 

An Optimal Stopping Rule is a structural object that maps a sequence of states or observations generated by an iterative process to a halt decision. The interesting structural questions in any instance are the same: what information structure is available before the decision (full history, noisy proxy, expected future value); what cost structure governs continuing (sampling cost, opportunity cost, regret) versus stopping (lost option value, premature commitment); whether the stop is reversible or final; and what adversarial structure the future obeys (exchangeable, adversarial, strategic counterparty). The optimal rule under any choice of these parameters is a function from observed history to a halt decision — and the same rules recur with identical structural force across superficially unrelated domains. The essential commitment is that when to stop is a structural question with substrate-independent parameters, not a matter of felt judgement. The arrangement factors a continuous decision stream into a single binary at each step — continue or halt — plus a stopping boundary that summarizes the rule, often expressed as a threshold on a running statistic; the boundary collapses the entire history into a sufficient statistic, sharply reducing the cognitive load of sequential decision-making. It carries a characteristic dual-failure structure: a rule can err by stopping too early or too late, each with its own cost calculus, and the boundary's design is precisely the trade-off between them. Recognizing a problem as a stopping problem — distinguishing it from a selection problem (which candidate, given all at once) or a search problem (where to look next) — is itself part of the arrangement: stopping is the structure where the order of arrival matters and the future is uncertain.

Structural Signature

the sequence-generating processthe per-step information structurethe continue-versus-halt binarythe continuation-cost against stopping-cost calculusthe reversibility specificationthe adversarial structure of the futurethe stopping boundary as sufficient statisticthe dual-failure (too-early/too-late) invariant

A configuration exhibits an optimal stopping rule when each of the following holds:

  • A sequence-generating process. Some iterative process emits states or observations one at a time, so that order of arrival is load-bearing and not all candidates are present at once. This is what distinguishes stopping from selection and search.
  • A per-step information structure. At each step a defined quantum of information is available — full history, a noisy proxy, or an expected forward value — on which the decision must rest.
  • The continue/halt binary. The arrangement factors a continuous decision stream into a single binary at each step: continue sampling or halt now. There is no third move.
  • An asymmetric cost calculus. Continuing carries a cost (sampling, opportunity, compute) and stopping carries a cost (regret, lost option value, premature commitment); the two are distinct and generally asymmetric.
  • A reversibility specification. The halt is either final or revisable, which changes the value of waiting.
  • An adversarial structure. The future is exchangeable, adversarial, or strategic, fixing which canonical boundary applies.
  • A stopping boundary. A threshold on a running statistic collapses the entire history into a sufficient statistic; once set, the moment-to-moment decision is mechanical. The boundary is the rule.
  • The dual-failure invariant. Every rule can err in exactly two ways — stopping too early or too late — and the boundary's placement is precisely the chosen balance between them.

These components compose into a single tunable object: information, cost, reversibility, and adversarial structure jointly fix a boundary on a running statistic, and that boundary encodes the entire too-early/too-late trade-off.

What It Is Not

  • Not the value of keeping options open (see optionality). Optionality is a state property — the worth of a held right to act later; the optimal stopping rule is the decision policy that says when to exercise or abandon that right. Optionality measures the asset; stopping prices the moment to spend it.
  • Not a one-shot decision (see decision). A decision is a single choice among present alternatives; stopping is the structure where alternatives arrive in sequence and order is load-bearing. The defining move is the continue/halt binary repeated over a stream, not a single committed pick.
  • Not the full sequential-control apparatus of markov_decision_processes_mdps. An MDP optimizes a policy over actions and rewards across many states; optimal stopping is the special case where the only action is "stop or continue." Importing the whole MDP machinery where the action set is binary over-models the problem.
  • Not satisficing (see satisficing). satisficing halts on the first option clearing an aspiration threshold; optimal stopping derives the threshold from the cost and information structure, with the boundary as a tuned sufficient statistic rather than a fixed good-enough bar.
  • Not signal fade (see fading). fading is a passive decay of a quantity over time; stopping is an active, modeled halt governed by a cost calculus, not the erosion of the thing being observed.
  • Common misclassification. Treating a fully-observable batch choice — all candidates present at once — as a stopping problem, and paying option-value premiums for "waiting" the real problem never imposed. Catch it by asking whether candidates genuinely arrive one-by-one and the accept is genuinely irreversible; if everything is simultaneously available, this is selection, not stopping.

Broad Use

The pattern recurs with the same anatomy across substrates. In probability and mathematics, the secretary problem, the optimal stopping theorem, Wald's sequential probability ratio test, and prophet inequalities are the canonical formal results. In statistics, sequential analysis and group-sequential clinical-trial designs with futility and efficacy boundaries operationalize stopping for ethical and resource reasons. In machine learning, early stopping monitors validation loss and halts training when generalization begins to degrade. In operations research, anytime algorithms expose a stop-anytime contract. In labour and search economics, house-hunting, job search, and Weitzman's Pandora's box are classical optimal-stopping problems. In meta-reasoning under resource bounds, "when to stop deliberating" is treated formally. In daily practice — when to stop reading a paper, fixing a bug, or negotiating — practitioners apply implicit stopping rules of varying quality. Drug discovery uses early go/no-go gates; military operations use commit-points; investment uses exit-rule discipline.

Clarity

The arrangement makes visible a question otherwise buried: when should I stop? is treated as felt judgement in most contexts, but optimal-stopping theory recasts it as a structural question with substrate-independent parameters. Naming the four parameters — information, cost, reversibility, adversarial structure — replaces "trust your gut" with "given my history and forward expected value, what does the boundary look like?" Once seen, the secretary problem and clinical-trial futility analysis become recognizably the same shape: optimal stopping over noisy proxies with asymmetric costs of continuation versus premature halt.

The clarifying force is to surface the dual-failure structure that intuition tends to suppress. A stopping rule can fail by stopping too early (forfeiting option value, premature commitment) or too late (paying excess sampling or opportunity cost), and these are governed by different parts of the cost calculus. Treating "stopping well" as a single virtue hides the trade-off; naming the two failure modes makes the boundary's placement an explicit decision about which error is more expensive in the case at hand.

Manages Complexity

The arrangement factors a continuous decision stream into a single binary at each step — continue or halt — plus a stopping boundary that summarizes the rule. The boundary collapses the entire history into a sufficient statistic, dramatically reducing the cognitive load of sequential decision-making. Without the prime, each substrate invents its own ad-hoc stopping discipline — research-coverage heuristics, "give-it-three-tries" rules, intuitive walk-away points. With it, the question becomes structural and the answer becomes computable, or at least systematically reasoned.

The leverage is that the boundary is a single tunable object whose placement encodes the entire dual-failure trade-off. Raise the boundary when a false stop is costly; lower it when a missed opportunity is costly; design for anytime behaviour when commit-time is itself uncertain. A reasoner equipped with the boundary abstraction does not need to re-derive a bespoke rule per domain; the canonical results — the 1/e rule for exchangeable arrivals, the sequential probability ratio test for sequential testing, the Snell envelope for Markovian processes, reservation-value rules for stationary search — provide ready boundaries indexed by the information and adversarial structure.

Abstract Reasoning

Optimal stopping rule trains a reasoner to ask:

  • What iterative process generates the sequence of states or observations?
  • What information is available at each step — full history, noisy proxy, or expected future value?
  • What is the cost of continuing (sampling, opportunity) and the cost of stopping (regret, lost option value)?
  • Is the stop reversible or final, and what adversarial structure does the future obey — exchangeable, adversarial, strategic?
  • Which canonical boundary fits the structure — 1/e for exchangeable arrivals, SPRT for sequential testing, Snell envelope for Markovian, reservation value for stationary search?
  • Is this actually a stopping problem, or a selection problem or a search problem in disguise?

The arrangement surfaces the meta-question explicitly: stopping is the structure where the order of arrival matters and the future is uncertain, distinct from selection (all candidates present at once) and search (where to look next). It also makes the dual-failure calculus a standing discipline — every stopping rule has two ways to be wrong, and the boundary's placement is the chosen balance between them. The deepest inference is that the boundary, as a sufficient statistic on the running history, is the entire content of the rule; once it is set, the moment-to-moment decision is mechanical.

Knowledge Transfer

Role mappings across domains:

  • Iterative process ↔ candidate stream / trial enrolment / training epochs / job offers / search sequence
  • Information structure ↔ full history / running test statistic / validation loss / observed offers
  • Cost of continuing ↔ sampling cost / patient exposure / compute / search effort
  • Cost of stopping ↔ regret / Type I error / under-training / accepting a worse option
  • Stopping boundary ↔ 1/e fraction / futility-efficacy boundary / early-stop criterion / reservation value
  • Dual-failure calculus ↔ stop-too-early versus stop-too-late, each with its own cost

A statistician designing a clinical-trial futility boundary, an ML practitioner watching validation loss, an economist analyzing job search, and a hiring manager interviewing candidates one at a time are solving the same structural problem: a halt decision over a sequence of observations under an information structure, a cost structure, a reversibility specification, and an adversarial structure, with a boundary that trades stopping too early against stopping too late. The cargo is a vocabulary (stopping boundary, patience, reservation value, value-of-information, futility, exploration/exploitation trade-off), a small set of canonical results (the 1/e rule, the sequential probability ratio test, the prophet bound), and an intervention catalogue (raise the boundary when a false stop is costly, lower it when missed opportunity is costly, design for anytime behaviour when commit-time is uncertain). The transfer is unusually clean: a statistician who learns the prime in sequential analysis can apply it to bug-fixing within a single sitting; an economist who learns it in job search can apply it to clinical trials. The reason is the high abstraction of the four parameters — information, cost, reversibility, adversarial structure — which are substrate-independent. What moves between fields is not a metaphor but the literal stopping object: a function from observed history to a halt decision, carrying its dual-failure calculus and its canonical boundaries, recognizable wherever order of arrival matters and the future is uncertain.

Examples

Formal/abstract

Consider the secretary problem in its pure form. The sequence-generating process presents \(n\) candidates one at a time in random order; each must be accepted or rejected irreversibly on arrival, and only relative rank against those already seen is observable. The per-step information structure is exactly that relative rank — never the absolute quality — so the decision rests on a thin proxy. The continue/halt binary is "reject and keep looking" versus "accept now"; the reversibility specification is final, which is what gives waiting its option value and its risk. The adversarial structure is exchangeable: order is uniformly random, no candidate is strategically placed. Solving for the policy that maximizes the probability of selecting the single best candidate yields the stopping boundary as a sufficient statistic: reject the first \(n/e\) candidates outright (a pure-observation phase that calibrates the reference), then accept the first one thereafter that beats every candidate seen so far. As \(n\) grows this rule selects the best candidate with probability \(1/e \approx 0.368\), and the threshold fraction itself converges to \(1/e\). The dual-failure invariant is visible in the boundary's placement: set the observation window too short and you stop too early on a merely-good candidate (premature commitment); set it too long and you run out of sequence having rejected the best one (waited too late). Shifting the boundary trades one error against the other, and no rule eliminates both — the \(1/e\) point is precisely the balance under exchangeability.

Mapped back: The \(1/e\) window is the stopping boundary collapsing the full arrival history into one threshold-on-rank; the irreversible accept and the random order are the reversibility and adversarial parameters, and the two ways the rule fails are exactly the too-early/too-late invariant.

Applied/industry

A group-sequential clinical trial instantiates the same object under a different cost calculus. The sequence-generating process is patient enrolment over interim analyses; the per-step information structure is the accumulating test statistic (e.g., a log-rank Z-score) computed at each interim look. The continue/halt binary is "enroll the next cohort" versus "stop the trial," but here it resolves into two engineered stopping boundaries rather than one: an efficacy boundary (stop early because the treatment is convincingly working) and a futility boundary (stop early because it convincingly is not). The cost of continuing is patient exposure to an inferior arm plus trial cost; the cost of stopping is a wrong call — declaring efficacy on noise (Type I error) or abandoning a real effect (Type II). The reversibility is effectively final: once halted, the cohort is not re-randomized. Designers compute boundaries (O'Brien–Fleming, Pocock) that spend the total error budget across looks so that the cumulative false-positive rate stays controlled despite repeated peeking. The dual-failure invariant is the explicit design target: a conservative early-efficacy boundary guards against stopping too early on a lucky interim; a futility boundary guards against running too long on a dead treatment. The same structure governs machine-learning early stopping — validation loss is the running statistic, the boundary is "halt when validation loss has not improved for \(k\) epochs," continuing costs compute and risks overfitting (too late), halting costs under-training (too early). And in stationary job search, the reservation wage is the boundary: accept the first offer above it, where the threshold is set so the marginal cost of one more search round equals its expected gain.

Mapped back: Trial boundaries, the early-stopping patience window, and the reservation wage are all the same stopping boundary indexed by different information and cost structures; in each, the engineered threshold is the chosen point on the too-early/too-late trade-off, and verifying it means asking whether continuation cost or premature-commitment cost dominates.

Structural Tensions

T1 — Too-Early versus Too-Late (Sign/Direction). The boundary is a single dial whose two ends are opposing errors: raise it and you forfeit good early candidates; lower it and you pay excess sampling cost or run out of sequence. The two errors trade against each other and no boundary kills both. The failure mode is collapsing the dual error into a single "stop well" virtue and then tuning toward whichever error was salient last — overreacting to a recent premature stop by stopping ever later, ratcheting the boundary in one direction. Diagnostic: ask which error is actually more expensive in this case; if you cannot say, the boundary is being set by recency, not by the cost calculus.

T2 — Optimality versus Model Fidelity (Scopal). The canonical boundaries — 1/e, SPRT, Snell envelope, reservation value — are optimal only under their assumed information and adversarial structure. Optimality is a within-model guarantee; it says nothing when the real arrival process violates the assumption. The failure mode is importing the 1/e rule (which assumes exchangeable, uniformly-random order) into a setting where order is non-stationary or adversarial, and trusting the optimality label rather than the premise. Diagnostic: before invoking a named boundary, name the exchangeability/stationarity assumption it rests on and check it against the actual sequence generator. Where the premise fails, robustness takes over from optimality.

T3 — Boundary as Sufficient Statistic versus Discarded History (Measurement). The boundary's power is that it collapses the entire observed history into one threshold-on-a-statistic — but that compression is only valid if the running statistic is genuinely sufficient. When the process has hidden state the statistic does not capture, the boundary discards predictive information and stops on a blind summary. The failure mode is treating a convenient proxy (validation loss, running max, a Z-score) as sufficient when it is merely available, so the rule halts on a statistic that has thrown away the signal that mattered. Diagnostic: ask what a fuller history would tell you that the running statistic cannot — if the answer is non-trivial, the boundary is lossy.

T4 — Reversibility Assumption versus Real Recoverability (Coupling). The reversibility parameter is treated as a binary input — final or revisable — but real stops are often partially reversible at a cost, and the rule's option value depends sharply on which assumption you encode. Modeling a costly-but-recoverable stop as strictly final inflates the apparent value of waiting; modeling a truly irreversible commit as revisable invites reckless early halts. The failure mode is inheriting the reversibility flag from the canonical problem (the secretary problem's irreversibility) rather than measuring the actual recovery cost. Diagnostic: ask what it would cost to un-stop; if that cost is finite and moderate, neither canonical extreme applies and the boundary needs a recovery term.

T5 — Per-Step Locality versus Global Horizon (Scalar). Each step presents a clean local binary — continue or halt now — but the optimal local choice depends on the global horizon (how much sequence remains, total budget). A rule tuned to look right at each step can be globally wrong when the horizon is mis-estimated or shifts mid-stream. The failure mode is greedy local stopping: applying a stationary reservation value when the remaining horizon is shrinking, so a threshold that was optimal early becomes far too patient near the end. Diagnostic: check whether the boundary is conditioned on remaining horizon; a horizon-blind threshold on a finite sequence is stopping too late by construction near the tail.

T6 — Recognizing the Problem Type versus Forcing the Frame (Scopal). Stopping is the structure where order of arrival matters and the future is uncertain — distinct from selection (all candidates present at once) and search (where to look next). Mis-typing the problem imports the wrong machinery wholesale. The failure mode is forcing a selection or search problem into a stopping frame: treating a fully-observable batch choice as if candidates arrived irreversibly one-by-one, and paying option-value premiums for waiting that the real problem does not impose. Diagnostic: ask whether the candidates are genuinely sequential and the accept genuinely irreversible; if all options are simultaneously available, this is selection, and the stopping boundary is solving a problem you do not have.

Structural–Framed Character

Optimal Stopping Rule sits firmly at the structural end of the structural–framed spectrum, consistent with its frontmatter label and an aggregate of 0.0: every diagnostic points the same way. The prime is a function from an observed history to a halt decision, parameterized by information structure, cost calculus, reversibility, and the adversarial structure of the future — a relational object with no home domain whose meaning depends on a discipline's vocabulary.

Walking the five diagnostics confirms the reading. The vocabulary travels freely: the secretary problem, Wald's sequential probability ratio test, the Snell envelope, and reservation-value search are stated in pure mathematical terms — boundary, patience, reservation value, value-of-information — that each substrate (clinical trials, ML early stopping, job search, bug-fixing) renders in its own words while the canonical results carry over unmodified, so no heavy home lexicon must be imported. The prime is value-neutral: a stopping boundary is neither good nor bad until the cost structure specifies whether stopping too early or too late is the more expensive error. Its origin is formal, in probability and decision theory, not in any human institution; the dual-failure invariant and the sufficient-statistic boundary are properties of sequential structure, not of social practice. It runs indifferently in non-human substrates — drug-discovery go/no-go gates, anytime algorithms, a statistician's interim look — wherever order of arrival matters and the future is uncertain. And invoking it merely recognizes a halt-decision-over-a-sequence pattern already present rather than importing an interpretive frame: naming a problem as a stopping problem distinguishes it from selection or search without adding any normative overlay. On every axis the prime reads structural, exactly as the grade records.

Substrate Independence

Optimal Stopping Rule is about as substrate-independent as a prime can be — composite 5 / 5 on the substrate-independence scale. Its signature is a function from an observed history to a halt decision, parameterized by information structure, cost calculus, reversibility, and the adversarial structure of the future — a wholly relational object with no home medium, which is why its structural abstraction is maximal. The domain breadth is equally wide: the identical halt-over-a-sequence shape recurs in probability and mathematics (the secretary problem, Wald's sequential probability ratio test, prophet inequalities), statistics (group-sequential clinical trials with futility and efficacy boundaries), machine learning (early stopping on validation loss), operations research (anytime algorithms), labour and search economics (job search, Weitzman's Pandora's box), meta-reasoning under resource bounds, and everyday practice (when to stop reading, debugging, or negotiating), as well as drug-discovery go/no-go gates and military commit-points. The transfer evidence is exceptionally concrete because what moves between fields is not a metaphor but the literal stopping object: a statistician's reservation-value rule, an ML practitioner's patience window, and a hiring manager's 1/e threshold are recognizably the same boundary indexed by different cost and information structures, carrying its dual-failure calculus unchanged.

  • Composite substrate independence — 5 / 5
  • Domain breadth — 5 / 5
  • Structural abstraction — 5 / 5
  • Transfer evidence — 5 / 5

Relationships to Other Primes

One-hop neighborhood: parents above, mutual partners to the right, children below.Optimal Stopping Rulecomposition: DecisionDecision

Parents (1) — more general patterns this builds on

  • Optimal Stopping Rule presupposes, typical Decision

    An optimal stopping rule repeats a continue/halt DECISION over a stream under uncertainty and trade-off; it presupposes the decision prime (committing to one alternative under uncertainty) and specializes it to the sequential, irreversible, order-matters halt structure. The file: distinct from a one-shot decision because alternatives arrive in sequence.

Path to root: Optimal Stopping RuleDecisionConstraint

Neighborhood in Abstraction Space

Optimal Stopping Rule sits among the more crowded primes in the catalog (12th percentile for distinctiveness): several abstractions describe nearly the same structure, so a description that fits it will tend to fit its neighbors too — transporting it usually means disambiguating within this family rather than landing on it exactly.

Family — Staged Processes & Drift (32 primes)

Nearest neighbors

Computed from structural-signature embeddings · 2026-06-14

Not to Be Confused With

The optimal stopping rule is most often conflated with optionality, and the embedding places them nearest (similarity 0.894) for good reason — both turn on the value of waiting. But they sit on opposite sides of the same transaction. optionality is a state variable: the worth, right now, of holding a right whose exercise can be deferred, a measure of how much flexibility a position carries. The optimal stopping rule is the policy that consumes that flexibility — the function from observed history to a halt decision that says precisely when the held option should be exercised or allowed to lapse. One can possess high optionality and still stop badly (exercising too early forfeits the very value optionality measures); one can face low optionality and still need a sharp stopping rule (a near-irreversible commit with a thin margin for error). The practitioner who confuses the two will try to price the asset when the task is to time its use, or will assume that simply having options guarantees good outcomes, when the boundary placement — the stopping rule — is what realizes or squanders the option's value.

It is also worth distinguishing optimal stopping from markov_decision_processes_mdps, the general sequential-control framework of which stopping is a degenerate but important special case. An MDP optimizes a policy over a rich action set, where each action shapes both immediate reward and the distribution over next states; the agent is steering a trajectory. Optimal stopping collapses the action set to a single binary — continue or halt — with no steering: the sequence-generating process runs on its own and the only lever is when to get off. This restriction is what makes stopping tractable where the full MDP is not: the boundary becomes a sufficient statistic precisely because there is no state to drive, only a moment to choose. The distinction matters because importing MDP machinery (value iteration over a large state-action space, reward shaping) where the real action is binary over-engineers the problem, while forcing a genuine multi-action control problem into a stopping frame under-models it — the analyst must first ask whether the future can be steered or only exited.

A subtler confusion is with satisficing, since both produce a "good enough, stop now" verdict over a sequence. The difference is where the threshold comes from. satisficing posits an aspiration level exogenously — a fixed bar set by habit, norm, or bounded rationality — and halts at the first option clearing it, without claiming the bar is optimal. Optimal stopping derives its boundary from the information structure, the asymmetric cost calculus, and the adversarial structure of the future, so the threshold is a tuned object that encodes the entire too-early/too-late trade-off and shifts as those parameters change. Satisficing is content with a defensible rule of thumb; optimal stopping insists the rule of thumb be the right rule given the cost structure. A practitioner who treats a satisficing aspiration as if it were an optimal boundary will trust a habit where the cost calculus may demand a quite different cutoff; one who insists on full optimality where data is too thin to estimate the parameters may be better served by an honest satisficing bar.

These distinctions matter because each names a different load-bearing question. Confusing stopping with optionality swaps timing the exercise for valuing the right; confusing it with markov_decision_processes_mdps swaps a binary exit for a steerable trajectory; confusing it with satisficing swaps a derived boundary for an inherited bar. Naming the boundary as a sufficient statistic on a sequential, irreversible, costly stream keeps the analyst from solving a selection problem with stopping machinery, or a steering problem with a halt rule.

Solution Archetypes

No catalogued solution archetypes reference this prime yet.