Convexity¶
Core Idea¶
A set is convex when the straight line between any two members stays inside it; a function is convex when the chord between two points on its graph lies on or above the graph. Both express one shape: mixtures preserve membership, and the average of values dominates the value of the average — from which an unusual number of well-behaved properties follow.
How would you explain it like I'm…
The Marble Bowl
The Smiling Bowl
Chord Above the Graph
Broad Use¶
- Optimization: convex sets and functions are the dividing line between tractable and intractable problems.
- Economics: convex preferences and production sets underlie general-equilibrium existence; risk-aversion is convexity of disutility.
- Statistics and ML: concave log-likelihoods make maximum-likelihood convex, and convex relaxations (LP, Lasso, nuclear norm) make hard tasks tractable.
- Probability: the set of distributions is convex, and Jensen's inequality is the canonical bound behind many results.
- Decision theory: utilitarian aggregation, mixed strategies, and veil-of-ignorance arguments all use convex combinations.
- Engineering and physics: Lyapunov functions are convex stability certificates; thermodynamic free energy is convex in extensive variables.
Clarity¶
Relocates the question of difficulty from "is the data noisy?" to a structural property of the feasible set and objective — once a problem is seen as convex, the local-versus-global distinction collapses and "stuck in a local optimum" stops being a worry.
Manages Complexity¶
Converts a global question ("is this the best solution anywhere?") into a local one ("does any nearby direction improve?"), answerable by inspection at a single point — the hard accounting of trajectories and competing basins replaced by one shape check.
Abstract Reasoning¶
Unlocks reusable templates: Jensen's inequality (source of AM–GM, entropy bounds, risk premia), the separating hyperplane (dual certificates, Farkas, supporting prices), the convex hull as feasibility envelope, Carathéodory's bound, and convex relaxation.
Knowledge Transfer¶
- Optimization → policy: "local incentives reach a global optimum when the welfare function is concave" tells when decentralized markets succeed.
- Statistics → diagnosis: "this objective is non-convex" warns that estimation is sensitive to initialization, in econometrics and deep learning alike.
- Thermodynamics → economics: the common-tangent construction resolving a non-convex free energy is the same convexification turning a non-convex production set into two-firm specialization.
Example¶
Minimizing portfolio variance over the probability simplex is convex (any mixture of two allocations is an allocation), so the optimum is unique and found by any gradient method regardless of starting point, with dual prices on the constraints.
Relationships to Other Primes¶
Parents (1) — more general patterns this builds on
- Convexity presupposes Optimization — Convexity is the STRUCTURAL PROPERTY of a feasible set + objective that makes optimization tractable (local optimum = global); the file: 'Not optimization itself — optimization is the activity, convexity the property that governs it.' It presupposes an optimization setting to be load-bearing, but is a property OF it, not an is-a child of the activity.
Path to root: Convexity → Optimization
Not to Be Confused With¶
- Convexity is not Optimization because optimization is the activity of finding a best feasible point, whereas convexity is a structural property of the set and objective that guarantees the search succeeds globally.
- Convexity is not Linearity because linearity demands the chord lie on the graph (equality), whereas convexity demands only on or above (inequality) — admitting curvature like diminishing returns that linearity forbids.
- Convexity is not Pareto Efficiency because Pareto efficiency is a property of an outcome, whereas convexity is a property of the feasible region's shape — a convex set makes the frontier well-behaved, but the two are distinct.