Skip to content

Probability

Core Idea

Quantifying uncertainty and likelihoods.

How would you explain it like I'm…

How Likely Something Is

When you flip a coin, you do not know if it will land on heads or tails. But you can say there is about a one-out-of-two chance for heads. Probability is just a number, between zero (it will never happen) and one (it will definitely happen), that says how likely something is. It is a way to put a size on what you are not sure about.

Measuring how likely things are

Probability is a way of putting numbers on how sure or unsure you are about something. The number is always between zero and one, where zero means no chance, one means certain, and one-half means a fifty-fifty chance. There are rules for how to combine these numbers: if two things cannot both happen, you add their chances; if you learn new information, you can update the chances. These rules turn vague guesses like probably into something you can calculate, compare, and check against what actually happens.

Measuring Uncertainty

Probability is the way mathematics measures uncertainty by attaching numbers to events. Each event gets a number between zero and one, with zero meaning the event cannot happen and one meaning it is certain. The numbers follow strict rules. The total across all possible outcomes is one. The chance of either of two non-overlapping events is the sum of their individual chances. Conditional probability lets you update a chance once you learn something new. These rules turn vague language like likely or rare into a system you can calculate with. A claim about probability is not complete unless it names the set of possible outcomes, the event in question, the assigned number, and how that number should be interpreted: a long-run frequency, a degree of belief, or a physical tendency.

 

Probability is the calibrated quantification of uncertainty: a numerical assignment to events or propositions that obeys a fixed set of coherence rules and supports consistent reasoning and decision-making under incomplete information. The modern formal foundation is the measure-theoretic axiomatization given by Kolmogorov in 1933, which fixes a sample space of possible outcomes, a collection of events (a sigma-algebra), and a probability measure that assigns to each event a number in [0, 1] satisfying normalization (the whole sample space gets probability one) and countable additivity (the probability of a disjoint union is the sum of probabilities). From these axioms follow expected value, conditional probability, independence, and the full apparatus of stochastic reasoning. Every well-formed probability claim names four things: the sample space, the event whose probability is asserted, the measure that assigns the number, and an interpretation, frequentist (long-run relative frequency), Bayesian (degree of rational belief), or propensity (physical tendency), that fixes what the number means and how it can be tested. Without all four parts the claim is incoherent; with them, probabilities can be combined, conditioned, and compared by anyone who accepts the same axioms.

Broad Use

Helps manage risk, predict outcomes, and make decisions in data-rich, uncertain environments.

Clarity

Quantifies uncertainty, e.g., predicting weather or medical risks.

Manages Complexity

Quantifies uncertainty, allowing reasoning under incomplete information.

Abstract Reasoning

Encourages probabilistic thinking and scenario evaluation.

Knowledge Transfer

Widely used in data science, game theory, and decision-making under uncertainty.

Example

Weather forecasts use probability to predict the likelihood of rain based on historical and current data.

Relationships to Other Primes

Foundational — no parent edges in the catalog.

Children (16) — more specific cases that build on this

  • Ensemble is a kind of Probability — An ensemble is a specific kind of probability object, treating realizations as draws from a distribution to be characterized.
  • Regression to the Mean is a kind of Probability — Regression to the mean is a kind of probability phenomenon in which extreme observations re-measure closer to the population mean due to transient noise.
  • Bayesian Updating presupposes Probability — Bayesian updating presupposes probability because the prior-times-likelihood-equals-posterior rule operates on probability distributions over hypotheses.
  • Distributional Assumption presupposes Probability — A distributional assumption presupposes probability because it commits to a specific probability distribution shape for uncertain quantities.
  • Expected Utility is part of Probability — Expected utility is a constituent piece of probability reasoning; it provides the probability-weighted aggregation of value over outcomes.

Not to Be Confused With

  • Probability is not Uncertainty because Probability is the calibrated quantification of uncertainty via a measure obeying additivity and normalization, while Uncertainty is the broader structural condition of incomplete knowledge—Probability is a specific formalization, Uncertainty encompasses cases without clean probabilistic measures.
  • Probability is not Statistical Significance (p-Value) because Probability is the foundational quantification of uncertainty for all events, while p-Value is a tail-probability statement testing data incompatibility with a null hypothesis—the first is general quantification, the second is a specific evidential test.
  • Probability is not Variability because Probability quantifies the uncertainty about unknown outcomes before they occur, while Variability describes the observable spread in outcomes after the fact—the first is predictive, the second is descriptive.
  • Probability is not Statistical Inference because Probability is the formalization of uncertainty via a measure obeying composition rules, while Statistical Inference uses probability models to draw conclusions about parameters from data—the first is the formal apparatus, the second is its application to knowledge extraction.
  • Probability is not Confidence Intervals because Probability is the general quantification of uncertainty, while Confidence Intervals are a specific frequentist procedure providing calibrated coverage of unknown parameters—the first is foundational, the second is an inferential tool.