Wargaming¶
Core Idea¶
Wargaming is the structured rehearsal of a plan against an adversary that adapts, played out in turns or rounds so that move and counter-move can surface assumptions the plan did not know it was making. The pattern has four load-bearing roles — a plan or hypothesis under test, a blue cell representing the friendly actor, a red cell representing an adversary or any adaptively hostile environment, and an adjudicator or rule-set that resolves moves against state. Unlike static scenario analysis or single-step risk assessment, wargaming threads adaptation into the simulation itself: the red cell sees blue's move and chooses its own in response, and so on, until the plan either survives a worthwhile sequence of contact or fails in a way that exposes which assumption it depended on.
The point is not prediction; the point is contact — the cheapest available substitute for finding out the hard way. The central structural move is to substitute an adaptive role for a probability distribution: where a Monte Carlo simulation rolls dice, a wargame rolls another mind. That substitution is what makes the technique useful in environments where the adversary is intelligent — where the future move depends on what move you just made, and so cannot be sampled from a fixed distribution. A good wargame's output is correspondingly distinctive: not a forecast of what will happen, but a prioritised list of the assumptions the plan is exposed to and the moves the adversary has that the plan did not account for. Treating that output as a prediction is the classic failure mode of the pattern.
How would you explain it like I'm…
Practice Against a Sneaky Friend
The Back-and-Forth Test
Rolling Another Mind
Structural Signature¶
the plan or hypothesis under test — the protagonist role enacting it — the adaptive adversary role — the adjudicator resolving moves against state — the turn-structured move/counter-move sequence — the substitution of an adaptive agent for a probability distribution — the output as exposed assumptions, not a forecast
A configuration exhibits wargaming when each of the following holds:
- A plan under test. Some plan, strategy, or hypothesis is the object of evaluation; the exercise exists to stress it, not to enact it for real.
- A protagonist role. A party enacts the plan, making the moves the plan prescribes and adapting within its bounds.
- An adaptive adversary role. A second party represents an opponent or adaptively hostile environment that observes the protagonist's move and chooses its own in response. Its responsiveness — not mere randomness — is what distinguishes the pattern.
- An adjudicator. A referee, rule-set, or model resolves each pair of moves against the evolving state, determining outcomes the players do not control unilaterally.
- A turn-structured sequence. Play proceeds in rounds of move and counter-move, threading adaptation into the simulation so that later turns — where plans characteristically break — are reached.
- An adaptive-agent-for-distribution substitution. Where a sampling method would roll dice from a fixed distribution, this pattern rolls another mind; it is the right move precisely when the binding uncertainty is strategic (the adversary's response depends on your choice) rather than parametric.
- Assumption-exposure output. The product is a prioritised list of the assumptions the plan was exposed to and the adversary moves it failed to account for — not a probability of success. Treating the residue as a forecast is the pattern's defining failure mode.
Composed, these substitute rehearsed adaptive contact for both static analysis and costly first-contact-with-reality, surfacing under adversarial pressure the silent assumptions a plan could not otherwise know it held.
What It Is Not¶
- Not
scenario_planning. Scenario planning enumerates a set of plausible futures and prepares against each; wargaming threads an adaptive adversary into the simulation so the future responds to your moves — it rolls a mind, not a fixed set of branches. - Not
monte_carlo_simulation. Monte Carlo samples outcomes from a fixed probability distribution; wargaming applies where the binding uncertainty is strategic — the opponent's distribution is not fixed because it depends on your choice — and its output is exposed assumptions, not a win probability. - Not
competition. Competition is real contest for a real stake; wargaming is rehearsal of contest whose purpose is to stress a plan before enactment, harvesting assumptions, not to actually win anything. - Not
learningin general. Learning updates a model from experience; wargaming is the specific structured rehearsal-against-an-adaptive-opponent that produces the experience — a method, not the model update it may feed. - Not
red_teaming_in_strategyalone. Red-teaming is the adversary role; wargaming is the full apparatus — protagonist, adversary, adjudicator, turn structure — of which an adaptive red is one load-bearing component. - Not
delphi_methodorenvironmental_scanning. Those harvest expert judgement or scan for signals without adaptive move/counter-move; wargaming's distinctive content is the turn-structured contact that surfaces what static elicitation cannot. - Common misclassification. Reading a wargame's output as a forecast — "we won 7 of 10 plays." That treats an adaptive exercise as a Monte Carlo, overclaiming precision the small sample cannot support and burying the actual product, the exposed assumptions.
Broad Use¶
- Military doctrine (named form): from the Prussian Kriegsspiel to modern campaign-level and tactical exercises.
- Cybersecurity: red-team/blue-team and purple-team engagements, tabletop incident-response drills.
- Policy and diplomacy: crisis simulations, Track II exercises, sanctions tabletops.
- Financial regulation: stress-testing scenarios where the adversarial element is a hostile market regime.
- Negotiation training: devil's-advocate role-play and rehearsal against a sparring partner who attacks the deal.
- Public-health preparedness: surge tabletops and pandemic exercises.
- Product strategy: pre-mortems with a competitor-in-the-room role and "kill the product" exercises.
- Legal preparation and software resilience: moot court and mock trial; chaos engineering and fault-injection campaigns where the chaos is the adaptive red. The shared structure is the same in each: an adaptive opposition is instantiated as a role and given moves, the plan is forced through contact with it, and the residue is a list of assumptions the plan had silently made.
Clarity¶
Wargaming clarifies that there is a distinct mode of plan-evaluation that neither static analysis nor first-contact-with-reality can substitute for: the interactive, adversarially adaptive rehearsal. Static analysis enumerates risks but does not sequence them; expert review critiques in the abstract but does not react to your counter-move; running the plan for real exposes flaws at maximum cost. Wargaming sits between, exposing flaws under adaptive pressure at rehearsal cost.
It also clarifies what a good wargame outputs: not a prediction of what will happen, but a list of assumptions the plan is exposed to and a sense of which moves the adversary holds that the plan did not account for. Keeping this output type straight is the prime's central discipline, because treating the wargame's residue as a forecast — "the model says we win" — both overclaims and misses the point. The value is in the exposed assumptions, which are actionable, not in an illusory probability, which is not.
Manages Complexity¶
The construct compresses a sprawling design problem — how to evaluate a plan in an adversarial environment — into a small inventory of design choices: how rich the red model is (mirror image, stylised doctrine, dedicated team), how adjudication works (free-form umpire, rule-based resolution, computed model), how many turns to run, what to measure, and whether to play with or without fog of war. Each choice is a knob with known trade-offs: a more realistic red is more expensive and harder to staff; rule-based adjudication is cheaper but constrains creativity; longer games surface emergent dynamics but lose focus.
Because the wargaming literature names these knobs explicitly, an analyst stepping into a new domain — a hospital running a surge tabletop, a product team running a competitor-response drill — can borrow the design vocabulary directly rather than reinventing it. The complexity of "how do we test this plan against a thinking opponent?" collapses into the bounded problem of setting a handful of well-understood knobs, each with a documented cost on one side and a documented benefit on the other.
Abstract Reasoning¶
Reasoning with wargaming proceeds by asking what would a thinking opponent do here, and how would my plan react? This is structurally different from "what could go wrong?" because it forces a model of an agent with its own objectives, information, and resources rather than a flat hazard list. It also forces sequential thinking — not just the first move and the first response, but the third and fourth turns, where plans tend to break. The abstract move is the substitution of an adaptive role for a probability distribution, which is the right move exactly when the binding uncertainty is strategic rather than parametric: when the opponent's distribution is not fixed because it depends on your own choices.
This frames a clean decision rule about when to wargame at all. Where the binding uncertainty is parametric — variation around a base case with a stable distribution — sensitivity analysis or Monte Carlo suffices. Where the binding uncertainty is strategic — an intelligent adversary whose move responds to yours — only an adaptive agent can surface the failure, because no fixed distribution captures the adversary's responsiveness. Recognising which kind of uncertainty dominates a problem is itself the abstract payoff of holding the prime, and it prevents the common error of reaching for a probabilistic tool where an adaptive one is required.
Knowledge Transfer¶
The roles map across substrates: the plan under test is the campaign plan, the incident-response plan, the deal, the product strategy; the blue cell is the friendly team enacting it; the red cell is the adversary, threat actor, hostile market, surge of patients, or competitor; and the adjudicator is the exercise control or observer-controllers. A cyber-defence purple-team exercise and a hospital mass-casualty tabletop are doing structurally the same thing: instantiating an adversarial role against a defending role under a referee, running moves and counter-moves under fog of war, and harvesting the post-action report for assumptions the plan did not know it had.
The vocabulary ports with substance. "Fog of war" becomes "incomplete visibility into the incident"; "red flexibility" becomes "patient case-mix variance"; the after-action review is the same artefact in every domain. The interventions port too — the move list, the white-card injects, the adjudication rules, the post-action template — which is why cyber tabletop exercises borrowed almost their entire format from military exercises, and pandemic-preparedness exercises later borrowed from cyber. The structural prime is what is being copied; the domain dressing is what changes. A single worked instance shows the substance: a hospital network wargaming a multi-day ransomware event, with blue as the incident-response and clinical teams, red as offensive-security consultants instructed to exfiltrate before encrypting, and control running a clock and a white-card inbox, surfaces by the third turn that the paper downtime procedures assume a printer network that the same compromised domain controllers have taken down, that clinicians have no offline medication lists, and that the prepared press statement assumes data was not exfiltrated — none of which a static walk-through would have produced, because only the red cell's adaptive moves created the specific contact conditions under which those silent assumptions broke. The same logic, exercise design, and output template serve an air-force exercise, a diplomatic crisis simulation, and a site-reliability game day — different substrates, identical prime.
Examples¶
Formal/abstract¶
The Prussian Kriegsspiel is the rigorously formalised origin instance and exposes every role of the prime as an explicit rule. The plan under test is a commander's operational scheme; the blue cell is the officer enacting it; the red cell is a second officer commanding the opposing force; and the adjudicator is an umpire equipped with a rule-set, dice for residual chance, and tables converting move pairs into casualties and ground gained. The defining structural move — substituting an adaptive agent for a probability distribution — is visible in the rules' treatment of fog of war: blue and red each see only their own situation map, and the umpire holds the true state, so red's next move is computed from red's own appreciation of blue's observed action, not drawn from a fixed table. This is exactly why a static casualty distribution cannot stand in: red's response is conditional on blue's choice, making the uncertainty strategic rather than parametric. The turn structure forces play past the first exchange into the third and fourth rounds where a plan's reserve commitments and supply assumptions break. And the output is the prime's signature artifact, not a win probability but an after-action list of the assumptions blue's plan silently held — that a bridge would be intact, that a flank was secure — surfaced only because an adapting red created the contact conditions that tested them.
Mapped back: The operational scheme is the plan, the two commanding officers are blue and red, the rule-and-dice umpire is the adjudicator, the hidden situation maps are fog of war, and the after-action assumption list is the assumption-exposure output rather than a forecast.
Applied/industry¶
A hospital network rehearsing a multi-day ransomware event runs the prime in a public-health-preparedness substrate. The plan under test is the cyber incident-response and clinical-continuity plan; the blue cell is the incident-response and clinical teams; the red cell is a team of offensive-security consultants instructed to behave like a real adversary — exfiltrate data before encrypting, and adapt to blue's containment moves; and the adjudicator is exercise control, running a clock and a white-card inbox that injects consequences. Because red adapts to blue's actions rather than following a script, the exercise reaches turn-three contact conditions a static walk-through never would: blue's paper downtime procedures assume a printer network that the same compromised domain controllers have taken down, clinicians discover they hold no offline medication lists, and the prepared press statement assumes data was never exfiltrated — three silent assumptions exposed as the output, not a probability of recovery. A structurally identical applied instance is a financial regulator's stress test where the adaptive red is a hostile market regime, or a product team's competitor- response drill where a colleague is assigned to play the rival and counter each go-to-market move. The intervention is the same in each: harvest the exposed assumptions and repair the plan before first contact with reality.
Mapped back: The continuity plan is the plan under test, the clinical teams and the offensive-security consultants are blue and red, exercise control is the adjudicator, the adaptive multi-day play is the turn structure, and the exposed printer/medication/press assumptions are the prime's assumption- exposure output.
Structural Tensions¶
T1 — Assumption Exposure versus Forecast (output-type/scopal). The prime insists its output is a list of exposed assumptions, not a probability of success — this is its central discipline. But the boundary is fragile: a wargame run many times with quantified adjudication starts to look like a Monte Carlo, and stakeholders crave a number. Where the temptation to read a win-rate off the exercise wins, a competing prime — probabilistic simulation — silently displaces this one. Failure mode: reporting "we won 7 of 10 plays" as a forecast, both overclaiming precision the small adaptive sample cannot support and burying the actual product, the exposed assumptions. Diagnostic: ask what the exercise's deliverable is; if it is a probability rather than an assumption inventory, the prime has been misused as a predictor.
T2 — Red Realism versus Red Tractability (cost/fidelity). The technique's value rises with how faithfully red adapts, but a more realistic red is more expensive, harder to staff, and harder to control within the exercise. The prime's defining move — substitute a mind for a distribution — is only as good as the mind substituted. Failure mode: a red cell too weak or too scripted to surface the non-obvious moves, so the wargame validates the plan against a strawman and breeds false confidence — worse than no exercise, because it certifies the untested. Diagnostic: ask whether red ever genuinely beat blue or forced a re-plan; a red that only ever loses is a mirror, not an adversary.
T3 — Strategic versus Parametric Uncertainty (boundary with a competing prime). The prime is the right tool exactly when the binding uncertainty is strategic — the adversary's move depends on yours. Where the dominant uncertainty is parametric, sensitivity analysis or Monte Carlo is correct and wargaming is wasted ceremony. Misreading which uncertainty binds picks the wrong tool. Failure mode: convening an elaborate adversarial exercise for a problem whose real risk is parameter variance around a stable base case, spending adaptive-game cost on uncertainty a distribution would have captured cheaply. Diagnostic: ask whether the threat's behaviour responds to your choices; if it is indifferent to your moves, the uncertainty is parametric and the wargame is overkill.
T4 — Adjudication Authority versus Player Agency (coupling/control). The adjudicator must resolve moves the players cannot control unilaterally — but the adjudicator's rule-set encodes its own assumptions, and a heavy-handed umpire can script outcomes that foreclose exactly the surprises the exercise exists to find. Adjudication that is too tight suppresses emergence; too loose, it produces incoherent results. Failure mode: an adjudicator whose rulings railroad play toward a predetermined narrative, so the "exposed assumptions" are the umpire's, not the plan's. Diagnostic: ask whether any exercise outcome surprised the control cell itself; an adjudication that only ever confirms control's prior is scripting, not resolving.
T5 — Rehearsal Cost versus First-Contact Cost (temporal/economic). The prime sits between cheap static analysis and costly real contact, justified because it exposes flaws at rehearsal cost. But rehearsal is not free, and an over-invested wargaming programme can consume the resources and calendar that the actual plan needs — the rehearsal crowding out the performance. Failure mode: gold-plating the exercise (months of design, huge cells, elaborate injects) until its cost approaches the cost of the contingency it rehearses, or delays readiness past the threat's arrival. Diagnostic: compare the exercise's cost and lead time to the expected cost of the failure it guards against; when rehearsal cost rivals first-contact cost, the economic premise has inverted.
T6 — Surfaced Assumptions versus Acted-Upon Assumptions (temporal/follow-through). The exercise's product is a prioritised list of exposed assumptions — but the prime says nothing about whether that list is ever fixed. The structural gap is between surfacing a vulnerability and closing it, and a wargame discharges only the first. Failure mode: a ritual cycle of well-run exercises that reliably expose the same assumptions year after year because the after-action findings are filed and never remediated, converting insight into theatre. Diagnostic: trace last cycle's top exposed assumptions; if they reappear unaddressed in this cycle, the exercise is producing knowledge the organisation does not consume.
Structural–Framed Character¶
Wargaming sits on the framed side of the structural–framed spectrum. There is a genuine relational move underneath — substitute an adaptive agent for a probability distribution and iterate move/counter-move until a plan's hidden assumptions surface — but the prime is bound to a human role-play practice tightly enough that invoking it imports an adversarial frame rather than merely recognising a pattern already running in a physical system.
The two diagnostics pinned high drive the grade. Institutional origin and human-practice-bound both read fully framed: wargaming is a military-strategic practice, and its constitutive roles — a blue cell, a red cell, an adjudicator with a rule-set, played in turns — are humans (or designed agents) enacting roles. There is no inanimate substrate in which a "red cell" spontaneously rehearses against a "plan"; the technique exists only where actors deliberately stand up an opposing role and convene a structured exercise. The remaining diagnostics sit at the middle. Vocabulary travels only with translation (0.5): "blue/red cell," "adjudication," "move and counter-move," "contact" carry an adversarial home lexicon that cyber, finance, negotiation, and resilience contexts adopt rather than retell freely. Evaluative weight is half-loaded (0.5): wargaming arrives with an implicit endorsement — surfacing assumptions cheaply is good, treating the output as a prediction is bad — so it is not value-neutral until specified. Import-vs-recognise (0.5) reflects that invoking the prime half-imports the stance of adversarial rehearsal rather than naming a structure that is simply present.
The honest concession is that the structural insight — replace a die-roll with another mind because an intelligent adversary's next move depends on yours — is real and portable, which is what gives the prime a transfer record across domains. But that insight only instantiates inside a deliberately convened, role-played, adjudicated human exercise. With institutional origin and human-practice-binding both fully framed and the rest at the middle, the aggregate sits firmly framed, matching the assigned grade.
Substrate Independence¶
Wargaming is a moderately substrate-independent prime — composite 3 / 5 on the substrate-independence scale. The structural insight is real and portable: replace a die-roll with another mind, because an intelligent adversary's next move depends on yours. That insight gives the prime a genuine transfer record (transfer evidence 4 / 5) across military planning, cyber red-teaming, policy exercises, financial stress tests, negotiation rehearsal, software resilience drills (chaos engineering, game days), and competitive sports. But domain breadth (3 / 5) and structural abstraction (3 / 5) are both capped by the same ceiling: every instance only instantiates inside a deliberately convened, role-played, adjudicated human exercise. There is no physical or biological substrate where adversarial rehearsal happens of itself — institutional origin and human-practice-binding both read fully framed. The strength and concreteness of the cross-domain transfer within that band lifts the composite to a 3, but the constitutive dependence on human role-play keeps it from climbing further.
- Composite substrate independence — 3 / 5
- Domain breadth — 3 / 5
- Structural abstraction — 3 / 5
- Transfer evidence — 4 / 5
Relationships to Other Primes¶
Parents (1) — more general patterns this builds on
-
Wargaming is a kind of, typical Foresight
Wargaming is a foresight method — disciplined anticipation of plural futures via adaptive move/counter-move rehearsal. A specialization of foresight whose distinctive move is substituting an adaptive agent for a probability distribution. Owner may prefer no hard parent for a framed practice.
Children (1) — more specific cases that build on this
-
Red Teaming In Strategy decompose Wargaming
The red cell (adaptive adversary role) is one load-bearing COMPONENT of the wargaming apparatus; the file: 'red-teaming is the adversary role; wargaming is the full apparatus.' red_teaming_in_strategy is a candidate.
Path to root: Wargaming → Foresight
Neighborhood in Abstraction Space¶
Wargaming sits in a sparse region of abstraction space (90th percentile for distinctiveness): few abstractions share its structure, so a faithful description tends to retrieve it precisely rather than landing on a neighbor.
Family — Adaptation Under Adversarial Pressure (14 primes)
Nearest neighbors
- Adversarial Boundary Navigation — 0.69
- Mixed Strategy — 0.68
- Stage Gate Process — 0.68
- Game-Theoretic Strategy — 0.67
- Counterfactual Reasoning — 0.67
Computed from structural-signature embeddings · 2026-06-14
Not to Be Confused With¶
The most consequential confusion is with scenario_planning. Both are
foresight methods that explore how a plan fares against possible futures, and
both are deployed by the same strategy teams, which makes them easy to merge.
The structural difference is whether the future adapts. Scenario planning
constructs a small set of internally coherent futures — high-growth,
stagnation, disruption — and pressure-tests the plan against each as a fixed
backdrop; the scenarios do not respond to the protagonist's moves.
Wargaming's defining move is to substitute an adaptive agent for a fixed
backdrop: the red cell observes blue's choice and chooses its counter, so the
"future" is generated turn by turn in response to the plan, reaching the third
and fourth moves where plans characteristically break. The discriminating
question is whether the binding uncertainty is parametric (variation around a
base case — scenario planning's domain) or strategic (the adversary's move
depends on yours — wargaming's). Choosing wrong wastes adaptive-game cost on
parametric uncertainty, or under-prepares a strategic problem with static
scenarios.
A closely related confusion is with monte_carlo_simulation. The boundary
is fragile precisely because a wargame run many times with quantified
adjudication starts to look like a Monte Carlo, and stakeholders crave the
single number a Monte Carlo provides. But Monte Carlo's entire validity rests
on sampling from a fixed distribution — it is the right tool exactly when the
uncertainty is parametric and the underlying process indifferent to the
analyst's choices. Wargaming is the right tool exactly when that assumption
fails: the adversary's response distribution is not fixed, because it is
conditioned on your own move, so no amount of sampling from a stationary
distribution captures it. Equally important, the two have different outputs —
Monte Carlo yields a probability, wargaming yields a prioritised inventory of
exposed assumptions. Reporting a wargame's play-tally as a win-rate is the
characteristic error: it misuses the prime as a predictor and discards its
actual product.
Finally, wargaming is distinct from competition. Both involve adversaries
contending under rules, and a wargame can feel indistinguishable from a real
contest in the moment. But competition is contest for a real stake whose
outcome is the point; wargaming is rehearsal whose stake is fictitious and
whose point is to surface, cheaply and before real contact, the assumptions a
plan silently holds. The red cell is instructed to play hard not to win but to
break the plan instructively. Conflating the two leads to two errors: treating
a rehearsal as if its result were a real victory (false confidence from beating
a strawman red), or treating a real competition as merely a rehearsal (failing
to act on a genuine loss). The diagnostic is whether anything is actually at
stake in the outcome, or whether the outcome is a means to expose assumptions.
Solution Archetypes¶
No catalogued solution archetypes reference this prime yet.