Experimental Design¶
Core Idea¶
The deliberate planning of an experiment to maximize causal-inference power and minimize confounding, given resource and ethical constraints.
How would you explain it like I'm…
How to Test Fairly
Planning a Fair Test
Designing Causal Studies
Broad Use¶
- Experimental science: Fisher's randomized controlled trials (RCTs), blocking, factorial designs, Latin squares.
- Software engineering: A/B testing, multi-armed bandits, canary deployments, feature flag rollouts.
- Clinical medicine: RCT protocols, blinding (single/double), placebo controls, stratification.
- Psychology: within-subject designs, between-subject designs, counterbalancing, order effects.
- Agriculture: field trials, crop rotation studies, soil amendment testing.
- Operations research: experimental simulation, DOE (Design of Experiments) frameworks.
Clarity¶
Names the bridge between research questions and data collection. Surfaces the tension between internal validity (did the treatment cause the effect?) and external validity (does it generalize?). Distinguishes experimental design as a planning phase from randomization (a technique) and statistical inference (the analysis phase).
Manages Complexity¶
Reduces an open-ended research problem into a structured protocol: identify causal question, define treatments and outcomes, eliminate or control confounders, allocate units to treatments, specify measurement plan. Bounds scope by forcing explicit choices about sample size, randomization mechanism, and blinding.
Abstract Reasoning¶
Encourages thinking in counterfactuals and potential outcomes: what would have happened if the unit received the other treatment? Frames all observed data as one realization of many possible experiments, sharpening focus on design robustness rather than luck.
Knowledge Transfer¶
The same structural principles—randomization, blocking, balance, replication—recur across clinical trials, software experiments, agricultural trials, and manufacturing process optimization. Tools developed in one domain (matched pairs, fractional factorials, sequential testing) transfer to others.
Example¶
A software team wants to know if a new search algorithm reduces latency. Rather than deploying to all users, they randomly assign half to the new algorithm and half to the control. They stratify by region to ensure geographic balance, measure median latency across a 48-hour window, and pre-specify a non-inferiority threshold. This mirrors a clinical trial comparing two drugs: randomization ensures exchangeability, stratification controls for a known confounder, and pre-specification prevents p-hacking.
Relationships to Other Primes¶
Parents (1) — more general patterns this builds on
- Experimental Design is a decomposition of Comparison — Experimental design is the specific shape comparison takes when it becomes a controlled, intervention-based architecture for causal inference.
Children (6) — more specific cases that build on this
- Confounding presupposes Experimental Design — Confounding presupposes Experimental Design: identifying and controlling third-variable common causes is the central problem the design must address.
- Statistical Power presupposes Experimental Design — Statistical power presupposes experimental design because its computation requires the pre-specified architecture of treatment assignment, sample size, and outcome measurement.
- Blocking (In Experimental Design) is a decomposition of Experimental Design — Blocking is the specific shape experimental design takes when known nuisance variability is absorbed by stratifying units before randomization.
- Factorial Design is a decomposition of Experimental Design — Factorial design is the specific shape experimental design takes when multiple factors are varied simultaneously to reveal main effects and interactions.
- Randomization is a decomposition of Experimental Design — Randomization is the specific shape experimental design takes when treatment assignment is made stochastic to neutralize observed and unobserved confounders.
- Sampling (Representativeness) is a decomposition of Experimental Design — Sampling representativeness is the specific shape experimental design takes when inference from observed units must generalize to a defined target population.
Path to root: Experimental Design → Comparison
Not to Be Confused With¶
- Experimental Design is not Design Prototyping because Experimental Design involves controlled assignment of units to treatments to establish causality, whereas Design Prototyping materializes design decisions into tangible learning instruments without assignment of causal conditions.
- Experimental Design is not Factorial Design because Experimental Design is the broader architecture encompassing treatment assignment, outcome measurement, and analysis planning, whereas Factorial Design is a specific technique that simultaneously varies multiple factors.
- Experimental Design is not Hypothesis Testing (Null vs. Alternative) because Experimental Design is the framework for collecting data so causal claims are valid, whereas Hypothesis Testing is the post-collection statistical procedure applied to evaluate evidence against a null model.