Skip to content

Information Set Specification And Completeness Verification

Essence

Information Set Specification and Completeness Verification turns vague market-efficiency language into a scoped test. Instead of asking whether a price is simply efficient, it asks: efficient relative to which information, available to whom, through which pathway, and within what time window?

The archetype is most visible in finance, but the structure also applies to prediction markets, marketplace rankings, dynamic pricing systems, indices, benchmarks, and other market-like signals. A signal can be informationally complete relative to public information and incomplete relative to local, costly, private, delayed, or cross-market information.

Compression statement

Efficient-market claims become ambiguous when the information set is undefined. This archetype makes the claim explicit: identify which information is in scope, who could access it, when it became available, how it should enter prices or market-like signals, and how quickly the signal should adjust. Then test for incomplete incorporation by comparing signal response, residual opportunities, lagged predictability, cross-market leakage, stale quotes, or arbitrage gaps against the declared information set. The result is a scoped completeness judgment rather than a vague assertion that the market already knows everything.

Canonical formula: information_incorporation_claim = (signal, information_set, access_population, availability_time, incorporation_pathway, latency_window); completeness_gap = available_in_scope_information - information_reflected_in_signal

Problem pattern

Decision-makers often say that information is “priced in” without specifying the information set. That phrase can hide very different claims: all public news, all analyst expectations, all order-flow information, all private information, or merely all information that the speaker personally knows. Without a boundary, the claim cannot be tested.

The result is overtrust in prices and market-like signals. Stale local information may persist in real estate prices. Earnings surprises may produce delayed drift. Related instruments may incorporate the same information at different speeds. Dynamic prices may omit current inventory or competitor information. The issue is not that markets never aggregate information; it is that aggregation must be specified and verified.

Intervention pattern

The intervention begins with an information-set boundary statement. It then inventories what was available, when it became available, who could access it, and how it could enter the signal. The analyst sets a latency window, tests the observed response, probes residual opportunities, and revises the claim when the evidence shows incomplete incorporation.

A useful output is not a universal verdict that markets are efficient or inefficient. It is a scoped judgment such as: public earnings information is reflected in large-cap prices within one day, but local zoning information is not reliably reflected in thinly traded real-estate comparables for several months.

Key components

This archetype converts a vague "the market has priced it in" claim into a scoped, testable exercise: efficient relative to which information, available to whom, through which pathway, and within what window. The first three components specify the claim before any testing begins. The Information-Set Boundary Statement declares the class of information in scope, separating public, private, costly, delayed, and excluded information so a global verdict cannot be smuggled in on narrow evidence. The Available Information Inventory records what was actually accessible at the relevant time, guarding against hindsight that treats today's knowledge as available before disclosure. The Incorporation Pathway Model names how the information should reach the signal — trading, quoting, arbitrage, disclosure, or platform updates — because a completeness claim is weak if no plausible pathway exists even when the information does.

The remaining components index the claim to time and put it to the test. The Latency and Update Window makes completeness explicitly time-bound, defining when a delayed response becomes evidence of incompleteness rather than normal lag. The Response Completeness Test then checks whether the signal actually moved in the expected direction, magnitude, timing, and cross-signal consistency once information became available. Finally, the Residual Opportunity Probe searches for predictable excess returns, arbitrage gaps, or stale quotes that would indicate unincorporated information — treated as evidence to investigate rather than automatic proof of inefficiency, since a residual return may simply compensate risk or be too small to exploit after costs. Together the components yield a scoped judgment about a specific signal rather than a blanket efficiency story.

ComponentDescription
Information-Set Boundary Statement The boundary statement defines the class of information in scope. It distinguishes public, semi-public, private, costly, delayed, local, order-book, and excluded information. This prevents global claims from being smuggled into narrow evidence.
Available Information Inventory The inventory records which information was actually available at the relevant time. It guards against hindsight: information known today should not be treated as available before disclosure.
Incorporation Pathway Model Information reaches a signal through mechanisms such as trading, quoting, arbitrage, disclosure, analysis, search, market making, or platform updates. If the pathway is implausible, the completeness claim is weak even if the information existed somewhere.
Latency and Update Window Completeness is time-indexed. Some signals may update in seconds; others in days, weeks, or reporting cycles. The latency window defines when a delayed response becomes evidence of incompleteness.
Response Completeness Test The response test asks whether the signal changed in direction, magnitude, timing, and cross-signal consistency after information became available.
Residual Opportunity Probe Residual opportunities include predictable excess returns, arbitrage gaps, stale quotes, underpriced risk, or exploitable decision gaps. They are evidence to investigate, not automatic proof of inefficiency.

Common mechanisms

Event studies test response around timestamped information releases. Post-announcement drift analysis looks for delayed incorporation. Abnormal-return or residual models compare observed movement to a baseline. Arbitrage scans look for inconsistent related prices. Source inventory matrices map availability and access. Order-book probes examine how microstructure reflects information. Lagged response regressions test whether old information still predicts later movement.

Parameters and calibration

The most important parameter is information-set scope. A narrow public-information claim is easier to verify than a broad all-information claim. A second parameter is latency: a market may be complete after a day but incomplete after a minute. A third parameter is access: information may be available to insiders, subscribers, locals, specialists, or high-speed traders but not to the public.

Completeness judgments should be risk-, cost-, liquidity-, and transaction-cost aware. A residual return is not necessarily an information gap; it may compensate risk or be too small to exploit after costs.

Neighbor distinctions

Completeness Audit checks whether records, cases, stakeholders, or scenarios are missing. This archetype checks whether a price or market-like signal reflects a declared information set. Price Signal Design uses price as an intervention; this archetype tests an existing signal’s information incorporation. Independent Verification Oversight concerns who validates; this archetype concerns the structure of the information-set claim being validated.

Examples

An analyst tests whether earnings announcements are fully incorporated by checking immediate response and post-announcement drift. A real estate investor tests whether transit announcements entered neighborhood prices. A prediction market platform checks whether new polling releases are reflected in contract prices. A dynamic pricing platform audits whether inventory and competitor information are reflected before customers see offers.

Failure modes

The major failure is leaving the information set undefined. Other failures include hindsight availability error, latency mismatch, mistaking risk premia for information gaps, and boundary laundering after inconvenient anomalies appear. The mitigation is to predeclare the boundary, reconstruct availability timing, adjust for costs and risks, and revise the claim rather than forcing every observation into a global efficiency story.

Non-examples

A spreadsheet completeness check is not this archetype. A city designing congestion pricing is price signal design, not information-set verification. A trader claiming mispricing without specifying what available information the market missed is making an unscoped opinion, not applying this pattern.

Review notes

This draft is merge-sensitive with completeness, price-signal, arbitrage, and validation families. It should remain distinct only when the central reasoning is an information-set-relative market-efficiency claim tested through availability, incorporation, latency, and residual-opportunity evidence.