Pipeline¶

Prime #: 168
Origin domain: Computer Science & Software Engineering
Also from: Operations Research, Engineering & Design
Aliases: Workflow Stages, Sequential Processing, Staged Execution
Related primes: Concurrency, throughput, Latency, Buffering

Core Idea¶

A pipeline is a sequence of stages through which work items or data flow in an ordered manner, often allowing concurrent processing of different stages to increase throughput ^[1]. The essential commitment is staging: dividing a workflow into discrete, separable steps such that each stage accepts outputs from the prior stage and produces inputs for the next, enabling overlap and parallelism without requiring true simultaneity within a single stage ^[1].

How would you explain it like I'm…

Assembly Line

Think about washing dishes with friends in a line: one person rinses, one scrubs, one dries, one stacks. Each person keeps working on a new dish while the others handle theirs. The dishes flow down the line, and you finish way more than if one person did every step. That line of stages is a pipeline.

Stage-by-Stage Flow

A pipeline is a way of getting work done by splitting it into a fixed sequence of stages, with each stage doing one piece and passing the result to the next. Because every stage is working on a different item at the same time, the whole line finishes far more items per minute than a single worker doing everything would. It is the same idea behind a factory assembly line, and it shows up in computer chips, data processing, and software build systems.

Staged Workflow

A pipeline is a sequence of stages through which work items or data flow in order, where each stage takes the output of the previous one as its input. The key advantage is throughput: because the stages are separable, different items can occupy different stages simultaneously, giving you parallelism without needing each stage to run multiple copies. Computer processors use pipelines to overlap instruction fetching, decoding, and execution; factories use them on assembly lines; software build systems use them to overlap compiling, testing, and deploying. The structure trades a small per-item latency cost for a much higher overall rate.

A pipeline is a structural pattern in which a workflow is decomposed into an ordered series of discrete stages, each consuming the output of the previous stage and producing input for the next. The key engineering payoff is pipelined parallelism (overlapping execution of different items at different stages simultaneously), which raises throughput in proportion to depth without requiring true intra-stage parallelism. Pipelines impose three design constraints: stage isolation (no stage reaches across boundaries), stage balance (the slowest stage sets the rate, so unbalanced pipelines waste capacity), and buffering between stages (to absorb variance). The pattern was formalized for CPU instruction execution by Ramamoorthy and Li in 1966 and generalized to data processing, manufacturing, build systems, and biological signaling cascades.

Structural Signature¶

Discrete sequential stages with well-defined entry and exit points ^[1]
Stage-to-stage data flow: outputs of stage N become inputs to stage N+1 ^[1]
Concurrent execution of multiple items across different stages simultaneously ^[2]
Buffering capacity between stages to decouple and smooth flow ^[3]
Throughput determined by the slowest stage (bottleneck) ^[4]
Latency as the cumulative time for one item to traverse all stages ^[4]

What It Is Not¶

Not parallel processing. Parallel processing executes independent tasks simultaneously on multiple processors; pipelining stages are sequential and dependent — each stage cannot begin until the prior stage completes (on that particular item), but multiple items can be in different stages concurrently. True parallelism requires multiple processors; pipelining can occur on a single processor via time-slicing or through physical separation of stages.
Not batch processing. Batch processing accumulates all items, processes them through all stages as a cohort, then outputs the cohort. Pipelining continuously accepts items and pushes them through as soon as the stage is ready, even while later items are earlier in the pipeline. Batch is all-or-nothing; pipeline is continuous.
Not just task decomposition. Dividing a task into subtasks (decomposition) is necessary but not sufficient for pipelining; pipelining further requires that multiple items move through those stages concurrently, gaining throughput by overlapping.
Not asynchronous execution. Asynchronous execution means a call returns before completion; pipelining is a structural pattern for organizing workflows, which may or may not be asynchronous in implementation.

Broad Use¶

Pipelines organize staged workflows wherever throughput and latency can be optimized by overlapping:

Software development: build pipelines (compile, test, package, deploy) process multiple commits through stages; CI/CD pipelines gate each change through multiple approval stages.
Manufacturing: assembly lines move products through sequential stations (welding, painting, inspection, packaging), with multiple products in flight simultaneously.
Data processing: ETL (Extract, Transform, Load) pipelines move batches through successive stages; data lakes feed warehouses via layered pipelines.
Microservices: request routing through API gateway → authentication → business logic → persistence → response serialization; multiple requests in flight.
Publishing: editorial workflow (submission → review → revision → copy-editing → typesetting → printing) processes multiple articles concurrently.
Refining and processing: oil refining, water treatment, and beverage production all use physical pipelines with sequential conversion stages.

Clarity¶

Pipeline clarifies by making stage dependencies and throughput bottlenecks visible. Vague goals like "faster processing" resolve into questions of which stage is slowest (bottleneck) and which dependencies prevent parallelism. The clarifying force is to make each stage's input contract and output format explicit, exposing mismatches between stages and identifying where buffering or re-work occurs ^[3].

Manages Complexity¶

Enables decomposition: each stage can be designed, tested, and optimized independently as long as the stage contract (input/output format) is maintained.
Makes bottlenecks visible: with metrics per stage (items queued, processing time, throughput), the slowest stage becomes obvious, directing optimization effort.
Supports scalability: stages can be replicated (multiple parallel instances of the slowest stage, load-balanced by a queue) to increase throughput without redesigning the entire pipeline.
Decouples timing: buffering between stages means a slow stage does not block input (upstream keeps producing) or starve output (downstream keeps consuming), enabling asynchronous flow.
Simplifies failure isolation: if stage N fails, stages 1..N-1 drain, stage N is debugged, and stages N+1.. resume once stage N recovers. Stages upstream are not blocked indefinitely.

Abstract Reasoning¶

Pipeline trains a reasoner to ask:

What are the stages? Are they truly sequential (output of one is input to the next) or can some be parallelized or merged?
What is the throughput of each stage (items per unit time)? Which stage is the bottleneck (slowest)?
What is the latency (total time for one item to traverse all stages)? Can latency be reduced without reducing throughput?
How much buffering (intermediate queue capacity) is needed between stages to smooth flow?
What happens when a stage fails, stalls, or is slower than expected? Does the pipeline degrade gracefully or cascade?
Can stages be load-balanced (replicated and fed by a queue) to reduce bottleneck effect ^[5]?

Knowledge Transfer¶

Role mappings across domains:

Stage ↔ step / phase / station / process / transformation / approval gate
Item ↔ workpiece / data record / request / task / document
Flow ↔ throughput / progress / advancement / queuing
Buffering ↔ queue / staging area / inventory / backlog
Bottleneck ↔ constraint / limiting stage / capacity bottleneck / slowest link
Latency ↔ cycle time / lead time / time-to-completion
Throughput ↔ items per unit time / processing rate / goodput / yield

A compiler's pipeline (lexical analysis → parsing → semantic analysis → code generation), a factory assembly line, and a web request's journey through an API gateway are all organizing the same structural pattern: sequential stages, concurrent items, buffering, and bottleneck management ^[6].

Examples¶

Formal/abstract¶

Intel's instruction pipeline (Hennessy & Patterson 2011) exemplifies the abstraction: fetch (retrieve next instruction from memory), decode (identify opcode and operands), execute (perform ALU operation), memory (access cache or main memory), write-back (store result in register). Each stage takes one cycle, but with pipelining, five instructions can be in flight simultaneously. Item N finishes write-back while item N+1 executes, N+2 accesses memory, and so on. Throughput is one instruction per cycle (in the ideal case); latency is five cycles per instruction. Pipeline hazards (data dependencies, branch mispredictions) create stalls, blocking the pipeline and reducing throughput, illustrating the tension between latency and robustness. This formal pipeline is embedded in every modern processor ^[7].

Mapped back: This instantiates the structural signature directly — discrete sequential stages, item-to-stage flow, concurrent execution across multiple items, buffering (instruction cache), throughput bottleneck (stage with highest latency), and per-item latency (5 cycles).

Applied/industry¶

A cloud CI/CD pipeline for software deployment stages a code change through: git commit → build (compile, unit tests) → test (integration tests, security scans) → staging (deploy to staging environment, smoke tests) → production (deploy to live servers, monitor). Each stage processes the same artifact (the built binary) and produces outputs (build artifacts, test reports, deployment logs). Multiple commits are in flight: commit A in production monitoring, commit B in staging smoke tests, commit C in the test stage, commit D in the build stage. Throughput is determined by the slowest stage (often integration tests or security scans); latency is measured from commit to live deployment (typically 30 minutes to several hours). Bottlenecks (slow security scans) motivate parallelization (running scans on multiple cores), and buffering (allowing builds to start even if tests are running) decouples stages. Failure in one stage (tests fail) stops that commit but does not block earlier commits from proceeding into the test stage, enabling failure isolation and recovery ^[3].

Mapped back: This shows the same structural commitments (sequential stages, item flow, concurrent processing, buffering, bottleneck visibility, failure isolation) at production scale, enabling organization of complex processes across teams and services.

Structural Tensions¶

T1: Latency vs Throughput. Longer pipelines (more fine-grained stages) can increase throughput (more opportunity for parallelism) but increase latency (more stages for each item to traverse). Short pipelines (few coarse stages) reduce latency but limit parallelism and throughput. The trade-off is fundamental: a one-stage pipeline has minimal latency but zero parallelism. A 100-stage pipeline can maximize parallelism but adds overhead. A common failure is over-pipelining (latency unbounded) or under-pipelining (throughput limited) ^[7].
T2: Simplicity vs Flexibility. Simple pipelines (few stages, rigid structure) are easy to reason about and implement but cannot adapt to varying item characteristics. Flexible pipelines (many stages, conditional routing) can adapt but become fragile and harder to optimize. A common failure is building a simple pipeline that works for the happy path, then discovering that error cases bypass stages or require special handling, compromising the pipeline's clarity.
T3: Decoupling vs Coherence. Buffering between stages decouples them, allowing asynchronous flow and tolerating speed mismatches. But large buffers can hide problems (a slow stage is masked by a large queue) and complicate coordination (which stage is responsible for a buffered item if it fails?). Too much decoupling (infinite buffers, pure fire-and-forget) makes the system uncontrollable; too little (synchronous handoff) serializes everything. A common failure is over-decoupling and losing end-to-end visibility.
T4: Static vs Dynamic Bottleneck. The bottleneck (slowest stage) is static in simple pipelines (one stage is always slowest) but dynamic in complex systems (the bottleneck shifts depending on input distribution). A common failure is optimizing for the static bottleneck without realizing it will shift once that stage is improved, squandering effort on already fast stages.
T5: Fairness vs Efficiency. Pipelines can process items in FIFO order (fair: first in, first out) or by priority (efficient: high-priority items skip ahead). FIFO ensures predictability but can be slow if a low-priority item blocks a high-priority one. Priority enables efficiency but can starve low-priority items. A common failure is implementing priority without feedback mechanisms, allowing high-priority items to monopolize stages.
T6: Error Handling vs Progress. A pipeline that stops on the first error is safe (no corrupted downstream results) but halts throughput. A pipeline that skips errors and continues risks propagating bad data downstream. A common failure is designing happy-path pipelines without addressing errors, then deploying fragile systems that crash unpredictably or silently corrupt data ^[8].

Structural–Framed Character¶

Pipeline sits at the structural end of the structural–framed spectrum: it is a pure relational pattern, the same in any domain where it appears, and nothing about its meaning depends on a particular field's vocabulary or assumptions. At its core it is just the arrangement of work into discrete sequential stages, where each stage's output feeds the next and different items can be processed at different stages at the same time.

Though the term is most familiar from software and computing, the pattern owes nothing to that origin: the same staging describes a factory assembly line, the steps of refining crude oil, or a multi-step approval workflow in an organization, and in each case you are simply seeing ordered stages with overlap. It carries no evaluative weight, it is defined by a formal flow relation rather than by any institution, and it can be described without invoking any human practice. Identifying a pipeline is recognizing a structure already present in how work moves through a process, not importing an outside frame. On every diagnostic, it reads structural.

Substrate Independence¶

Pipeline is a highly substrate-independent prime — composite 4 / 5 on the substrate-independence scale. Its signature is crisp and neutral — discrete sequential stages, stage-to-stage flow, concurrent execution, and a governing bottleneck — and it appears across instruction pipelines and CI/CD in computing, assembly lines in manufacturing, workflow staging in operations research, and engineering design. The transfer evidence is real and concrete, with examples reaching from CPU pipelines to cloud CI/CD, showing the same structure recognized across computing and operations. It earns a strong 4 on the back of genuine cross-substrate use and good examples, just shy of the universal spread that defines the top tier.

Composite substrate independence — 4 / 5
Domain breadth — 4 / 5
Structural abstraction — 4 / 5
Transfer evidence — 4 / 5

Relationships to Other Abstractions¶

Current abstraction Pipeline Prime

Parents (3) — more general patterns this builds on

Pipeline is a kind of Decomposition Prime

A pipeline is a specialization of decomposition that breaks a workflow into ordered stages whose outputs feed the next.
Pipeline presupposes Iteration Prime

A pipeline presupposes iteration because work items advance through repeated stage transitions, each consuming the previous stage's output.
Pipeline presupposes Modularity Prime

A pipeline presupposes modularity because the discrete separable stages with well-defined interfaces are exactly modular components.

Children (7) — more specific cases that build on this

MapReduce Domain-specific is a kind of Pipeline

MapReduce is a pipeline specialized to map, shuffle, and reduce stages with a rigid interface and one cost-governing synchronization boundary.
Saga Pattern Domain-specific is a kind of Pipeline

A saga is a pipeline specialized to ordered local transactions with a compensating reverse path for partial failure.
Stream Processing Domain-specific is a kind of Pipeline

Stream processing is a pipeline specialized to continuous record arrival, stateful transforms, event-time windows, and recoverable low-latency output.

▸ Show 4 more

Medication Error Domain-specific is part of Pipeline
Medication Error contains the ordered prescribing-to-monitoring Pipeline on whose stages deviations originate and later defenses operate.
Pharmacokinetic Interaction Domain-specific presupposes Pipeline
Pharmacokinetic Interaction presupposes the staged ADME pipeline that lets a perturbation be localized to absorption, distribution, metabolism, or excretion.
Funnel Analysis Prime presupposes Pipeline
Funnel analysis presupposes an ordered pipeline whose stage boundaries let entrants, exits, and conditional survival be attributed to particular transitions.
Serial Local Optimization Failure Prime is part of Pipeline
A serial local optimization failure contains a pipeline because its locally optimizing stages must be arranged in an ordered chain whose outputs or decisions become the next stage's conditions.

Hierarchy paths (3) — routes to 2 parentless roots

Pipeline → Decomposition

Show alternative paths (2)

Neighborhood in Abstraction Space¶

Pipeline sits in a moderately populated region (40^th percentile for distinctiveness): it has near-neighbors but no dense thicket of synonyms.

Family — Drift, Decay & Record Fidelity (17 primes)

Nearest neighbors

Computed from structural-signature embeddings · 2026-07-26

Not to Be Confused With¶

Pipeline must be distinguished from Flow, though the two are closely related concepts both concerned with movement through systems. Flow is the more general concept describing the smooth, continuous movement of items or substances through a system without particular emphasis on the discrete stages or structure. Flow emphasizes the smoothness and absence of obstruction — flow optimizes for continuity and minimizes impedance to movement. A pipeline, by contrast, is a specific structural pattern that explicitly relies on discrete, separable stages and the overlap of processing across multiple items at different stages. Where flow asks "how do we remove obstacles to smooth movement?", a pipeline asks "how do we structure stages to maximize concurrent processing?" A river flows smoothly, but a water treatment facility is a pipeline — distinct stages (intake, coagulation, settling, filtration, chemical treatment, distribution) process water through sequential steps, with multiple batches of water at different stages simultaneously. A service's operational flow might refer to the overall smoothness of operations (requests handled without delay); a software pipeline refers to the specific sequence of build, test, and deployment stages. Flow is about absence of friction; pipeline is about structured parallelism. Systems can have both: a well-designed pipeline maintains flow within and between its stages, while a smooth-flowing system might lack the staging structure of a pipeline. The distinction matters because optimizing for flow (removing obstacles) requires different interventions than optimizing for pipeline throughput (rebalancing stage times, adding parallelism).

Pipeline is also distinct from Batch Processing, a common confusion point because both involve processing multiple items. Batch processing accumulates a collection of items (a batch) and processes them as a unit through an entire workflow — all items in the batch complete stage 1 before any move to stage 2; all complete stage 2 before any move to stage 3. The batch is the unit of processing; once processing starts, the batch progresses through all stages before another batch begins. A pipeline, by contrast, continuously accepts items and processes them individually, allowing multiple items to occupy different stages simultaneously. In batch processing, once you commit a batch to the workflow, its components are locked together until completion; in a pipeline, items flow through independently, decoupled by buffering between stages. Manufacturing example: batch processing would prepare 100 cars for painting, paint all 100, then move them all to assembly; a pipeline continually feeds cars through paint and assembly such that while one car is in assembly, the next is in paint, and a third is in inspection. Batch processing maximizes per-unit efficiency for large cohorts but introduces wait times (items wait to gather a full batch); pipelining maintains low latency for individual items and maximizes throughput. Batch systems are easier to reason about (all items in cohort move together); pipelines are more complex but more responsive. The choice depends on whether latency (time per item) or throughput (items per unit time) is the priority.

Pipeline is also distinct from Assembly Line, though they are often conflated and the assembly line is a canonical example of pipeline structure. An assembly line is a physical instantiation of a pipeline — a manufacturing system where work items (cars, appliances, products) move through physical stations (welding, painting, assembly, inspection), with workers or machines at each station performing a specific operation. Not all pipelines are assembly lines: a software CI/CD pipeline has no physical assembly line; a data processing pipeline has no workers at stations. Conversely, not all assembly lines are strictly pipelines: an assembly line with flexible routing (items branching to different paths depending on quality or variant) deviates from the pure pipeline model. The assembly line is a useful metaphor for understanding pipelines and a concrete domain where pipeline structure is most visible and optimized, but the pipeline concept is more general — it applies wherever sequential stages with concurrent item processing are designed.

Solution Archetypes¶

Solution archetypes in the catalog that build on this prime — directly (this prime is a source ingredient) or as a related prime.

Built directly on this prime (9)

Bottleneck Identification and Relief: Find the stage, resource, role, queue, or transition that limits whole-system throughput, then relieve, protect, redesign, or prioritize around it.
▸ Mechanisms (11)
- Automation of Bottleneck Stage
- Bottleneck Analysis Workshop
- Bottleneck Buffer
- Bottleneck Priority Rule
- Capacity Expansion
- Input Quality Check
- Process Mining / Trace Analysis — Reconstructs the real process from event traces — discovering the actual control flow, its variants, and where reality deviates from the intended path — that the log reveals but no diagram admits.
- Queue Analysis
- Staffing Relief / Cross-Training
- Theory of Constraints Cycle
- Work-in-Progress Limit
Dependency Ordering: Arrange actions or components according to prerequisite dependencies so later steps do not begin before required conditions exist.
▸ Mechanisms (8)
- Critical Path Method
- Curriculum Prerequisite Map
- Dependency Graph — Draws the system's depends-on relations as nodes and directed edges so concentration, chains, and single points of failure become visible at a glance.
- Deployment Runbook
- Manufacturing Process Plan
- Prerequisite Matrix
- Topological Sort
- Treatment Sequencing Protocol
Funnel Attrition Localization: Represent an ordered process as denominator-preserving stages, measure where the population is lost, and prioritize the stage whose repair most improves final yield.
▸ Mechanisms (11)
- Cohort Transition Table
- Conversion Funnel Dashboard
- Denominator Reconciliation Checklist
- Event Instrumentation Specification
- Event Trace Process Mining
- Funnel Experiment Backlog
- Loss Pareto Review
- Segment Funnel Comparison
- Stage Conversion Anomaly Alert
- Stage Drop-Off Waterfall
- Survivorship Bias Audit
Handoff Standardization: Standardize transitions between stages or actors so flow does not lose context, quality, state, or accountability at handoff points.
▸ Mechanisms (9)
- Case Transfer Dossier
- Chain-of-Custody Form
- Deployment Release Handoff
- Handoff Note Template
- Incident Escalation Note
- Manufacturing Station Handoff
- Shift-Change Briefing
- Structured Handoff Checklist
- Support Ticket Escalation
Head-of-Line Blocking Relief: Prevent one blocked or slow item at the front of a queue from delaying everything behind it.
▸ Mechanisms (8)
- Blocked Item Escalation
- Bypass Queue
- Exception Queue — Pulls the endpoint cases that don't fit the standard flow into a dedicated queue with its own capacity and clock, so the main line keeps moving and the oddballs still get resolved.
- Out-of-Order Processing
- Parallel Lane Activation
- Readiness Scan
- Resequencing Buffer
- Timeout and Escalation
Order-Sensitive Configuration: Control the order of selected elements when sequence changes the meaning, safety, learning, transformation, or function of the whole.
▸ Mechanisms (8)
- Curriculum Sequence Map
- Dependency Graph — Draws the system's depends-on relations as nodes and directed edges so concentration, chains, and single points of failure become visible at a glance.
- Misorder Incident Review
- Ordered Protocol Runbook
- Simulation or Dry Run
- Step-Locking Checklist
- Topological Sorting
- Workflow Orchestrator
Pipeline Staging: Divide a complex flow into ordered stages so each stage can specialize, coordinate handoffs, and preserve throughput, quality, and accountability.
▸ Mechanisms (8)
- Assembly Line Workflow
- CI/CD Pipeline
- Clinical Care Pathway
- Editorial Workflow
- ETL or Data Processing Pipeline
- Legal Procedure Sequence
- Onboarding Workflow
- Research Review Pipeline
Return-Path Design: For every forward path that moves people, work, goods, data, or decisions toward a goal, deliberately design the backward path that lets legitimate reversal, repair, appeal, return, or exit happen without improvisation.
▸ Mechanisms (12)
- Appeal or Review Process
- De-Escalation Pathway
- Dead-Letter Queue and Replay
- Refund or Reversal Protocol
- Resubmission with Preserved State
- Return Authorization Workflow
- Return-Reason Dashboard
- Reverse Logistics Channel
- Rollback Runbook
- Round-Trip Journey Test
- Undo or Cancel Flow
- Unsubscribe or Exit Path
Stage-Gate Progression: Move work, people, decisions, or artifacts through stages only after explicit criteria are met, preventing premature progression and preserving quality, safety, readiness, or legitimacy.
▸ Mechanisms (8)
- Approval Workflow
- Clinical Clearance Protocol
- Compliance Signoff
- Educational Mastery Assessment
- Go / No-Go Review
- Manufacturing Inspection Point
- Quality Gate
- Release Readiness Review

Also a related prime in 14 archetypes

Batch Size Calibration: Set batch size as a controllable design variable, not a habit: make the batch large enough to amortize setup cost but small enough to preserve flow, safety, responsiveness, and timely feedback.
Demand-Triggered Deferred Evaluation: Represent optional or path-dependent work as a suspended unit, realize only the dependency closure demanded now, and make result sharing, side effects, failure timing, cancellation, lifetime, and first-use latency explicit.
Effective-Input Delivery Assurance: Manage what becomes usable at the point of action, not merely what was supplied upstream.
Endpoint Fan-Out Fulfillment: Design the deconsolidation, local staging, routing, service-mode, access, evidence, and recovery layer that turns efficient trunk flow into verified endpoint completion.
Fast–Slow Store Coupling: Keep a volatile fast store and a durable integrated store coupled by governed transfer so the system gets immediate access without losing long-term coherence.
Flow Channelization: Confine diffuse or chaotic flow into defined channels so it can be directed, measured, protected, or governed.
Intermediate-State Throughput Control: Treat a named transient state as a controllable intervention surface: regulate how fast it forms, how long it persists, how its quality changes, and how reliably it converts into the desired next state.
Inversion of Control: Shift initiative or control from the usual actor to another layer, framework, recipient, or environment to reduce coupling, improve fit to context, or coordinate action more cleanly.
Network Flow Optimization: Route flow through a capacity-constrained network to maximize throughput, minimize cost, or avoid bottlenecks.
Push-Pull Decoupling Point Design: Place the buffer at the point where forecastable upstream preparation should stop and demand-specific downstream fulfillment should begin.

▸ Show 4 more

Notes¶

Pipelining is a foundational technique across computer architecture (instruction pipelines), distributed systems (data pipelines), and manufacturing (assembly lines). Hennessy and Patterson's work on computer architecture (2011) formalizes instruction-level pipelining. Modern DevOps embraces CI/CD pipelines as central organizing patterns. Apache Kafka and stream processing frameworks (Flink, Spark Streaming) industrialize data pipelines. The challenge remains balancing latency, throughput, and robustness in complex multi-stage systems.

References¶

[1] Ramamoorthy, C. V., & Gonzalez, M. J. (1966). "Pipeline processing." ACM Computing Surveys, 1(1), 23–38. ↩

[2] Kogge, P. M., & Stone, H. S. (1972). "A parallel algorithm for the efficient solution of a general class of recurrence equations." IEEE Transactions on Computers, 22(8), 786–793. ↩

[3] Newman, S. (2012). Building Microservices: Designing Fine-Grained Systems. O'Reilly Media. ↩

[4] Jain, R. (1989). The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling. Wiley. ↩

[5] Karnin, O., Tsidon, E., & Hannig, F. (2012). "On efficient pipelined parallel processing." In Proceedings of the 26^th ACM International Conference on Supercomputing, 73–82. ↩

[6] Dubois, M., Annavaram, M., & Stenstrom, P. (2003). "Cache protocols: Implementation, invocation, and exploiting concurrency." In Handbook of Computer Architecture. Marcel Dekker. ↩

[7] Hennessy, J. L., & Patterson, D. A. (2011). Computer Architecture: A Quantitative Approach (5^th ed.). Elsevier. ↩

[8] Black, E., Culler, D., & Ousterhout, J. (2000). "Software performance and scalability." In Advances in Computers, Vol. 50. Academic Press. ↩