Control / Data Channel Confusion¶

Prime #: 746
Origin domain: Information Security
Subdomain: protocol design → Information Security
Aliases: Sql Injection, Injection Attack, Command Injection

Core Idea¶

A receiver interprets content meant to be inert data as if it were authoritative instructions, because the protocol does not enforce a structural separation between the two. The boundary is marked by content cues the parser re-inspects rather than by construction it cannot be tricked into crossing — so an adversary crafts data to be re-parsed as control. The historical name is in-band signalling.

How would you explain it like I'm…

Words Pretending To Be Orders

Imagine you tell your robot, "only do what's written in RED." A trickster writes a sneaky order in regular ink but colors it red, and the robot obeys it! The mix-up happens because orders and ordinary words got written on the same paper, and the robot can be fooled into thinking ordinary words are orders.

Sneaky Data Faking Commands

Some systems carry two kinds of stuff on the same line: instructions (commands the machine should obey) and data (plain content it should just hold, like a name or a message). Trouble happens when the machine can't structurally tell them apart and decides which is which by looking at clues inside the content. A sneaky person who controls some of the data shapes it to look like a command, and the machine obeys it. The old name is in-band signalling, from old phones where the right whistle tone on the talking line could secretly control the phone switch. The real fix is to build the system so no content can ever be mistaken for a command, not just to keep blocking bad words one at a time.

In-Band Signalling Trap

Control/Data Channel Confusion is when a receiver treats content it was meant to handle as inert data as if it were authoritative instructions, because the protocol doesn't structurally separate the two. The control channel (instructions, authorization, code) and the data channel (content, parameters, payload) share a substrate, and the boundary between them is marked, if at all, by content cues the receiver re-parses rather than by structure it can't be tricked into crossing. An adversary who controls part of the data shapes it to resemble a control token, and the downstream parser, separating the two logical channels out of the shared stream, mis-classifies the crafted data as control and acts on it. The historic name is in-band signalling vulnerability, after the telephony case where audible tones on the speech channel could trigger switch control. The decisive feature is where the boundary marker lives: in the content, where the adversary can reach it, versus in the construction, where they cannot, which is why structural fixes like prepared statements or capability tokens make the attack impossible by design rather than merely filtered.

Control/Data Channel Confusion is the failure in which a receiver interprets content it was meant to treat as inert data as if it were authoritative instructions, because the protocol does not enforce a structural separation between the two. The control channel (instructions, authorization, code) and the data channel (content, parameters, payload) share a substrate, and the boundary between them is marked, if at all, by content cues the receiver re-parses rather than by structure the receiver cannot be tricked into crossing. An adversary who controls part of the data shapes it to resemble a control token; the downstream parser, separating the two logical channels out of the shared stream, mis-classifies the crafted data as control and acts on it. The historical name is in-band signalling vulnerability, after the telephony case in which audible tones carried on the speech channel could trigger switch control. Four structural commitments compose it: two distinct logical channels, one carrying directives and one carrying content; a shared physical or logical substrate through which both flow; a parser that consumes the substrate and re-separates the channels downstream; and insufficient structural enforcement of the separation, so the parser distinguishes control from data by content (keywords, escape sequences, shape) rather than by structure (sealed envelopes, pre-bound parameters, capability tokens). The decisive feature is the location of the boundary marker, in the content where the adversary can reach it rather than in the construction where they cannot, which makes the failure qualitative rather than quantitative: when control and data share a substrate without structural separation, the probability that crafted data is re-interpreted as control is non-zero and adversarial optimization drives it toward one over time, while structural separation makes that probability zero by construction.

Broad Use¶

SQL injection: a query parser conflates template with input — '; DROP TABLE users; -- is parsed as additional SQL.
Command injection: a shell conflates command and argument — a filename containing ; rm -rf / becomes a second command.
Cross-site scripting: a browser conflates markup with content — input containing <script> becomes executable code.
Return-oriented programming: the CPU conflates code and data on the stack, reading attacker content as return addresses.
Prompt injection: a language model conflates retrieved content with system instructions, obeying "ignore your previous instructions."
Organizational forgery: a forged memo on letterhead is treated as valid authorization because instruction-shaped artifacts are read as control.

Clarity¶

Replaces a long list of substrate-specific vulnerabilities with one structural diagnosis, and exposes the category error: trying to detect bad content when the problem is the absence of channel separation.

Manages Complexity¶

Collapses many vulnerability classes into one frame with one intervention family — separate by structure, not by content inspection — so a designer recognizes which structural mechanism a substrate affords rather than re-deriving a defense for each.

Abstract Reasoning¶

The conditional probability that crafted data is read as control is strictly positive under content separation and driven toward certainty by adversarial search, while structural separation makes it zero by construction — the diagnostic is "separation by construction or by convention?"

Knowledge Transfer¶

Databases → shells → browsers: prepared statements, argument arrays, and text-only DOM APIs are the same structural move in different costumes.
Security → LLM safety: demoting retrieved content to a sealed role and granting tool capabilities explicitly is the prepared-statement pattern for models.
Engineering → institutions: signed authorizations carrying a verifiable authority chain are the same fix as parameter binding, replacing inspectable shape with construction.

Example¶

SQL injection relocates the boundary into the content: a concatenated query lets a quote close the literal so everything after is re-parsed as SQL, while a prepared statement compiles the template before binding so no input can be read as command.

Relationships to Other Primes¶

Parents (2) — more general patterns this builds on

Control / Data Channel Confusion is a kind of Untrusted Input Execution — child of emergent untrusted_input_execution
Control / Data Channel Confusion presupposes, typical Interface — An in-band-signalling FLAW of a contracted boundary where control and data share a substrate; presupposes an interface (the meeting point) whose control/data separation is marked by content not construction. Tentative parent — the prime is largely foundational/structural.

Path to root: Control / Data Channel Confusion → Untrusted Input Execution

Not to Be Confused With¶

Control / Data Channel Confusion is not Form and Content because form-and-content is the general distinction between an expression's shape and meaning, whereas this adds an adversary plus a parser that mis-classifies crafted content as control.
Control / Data Channel Confusion is not Multiplexing because multiplexing is deliberate, well-defined sharing with a reliable demultiplexing scheme, whereas this is the failure of demultiplexing under adversarial input.
Control / Data Channel Confusion is not Interface because an interface is the general notion of a contracted boundary, whereas this is the specific way the control/data boundary fails when enforced conventionally rather than structurally.