Cap Theorem - Learning Module

Loading content...

0/273

Consistency — All Nodes See the Same Data

The Promise That Changes Everything

Imagine a global banking system where you check your account balance in New York and see $1,000, while simultaneously your colleague checks the same account in London and sees $500. This isn't just a bug—it's a fundamental violation of what we intuitively expect from a consistent system.

Consistency is perhaps the most intuitively understood yet technically nuanced property in distributed systems. We expect that when data changes, everyone sees that change. But in a world where data lives on multiple machines across continents, connected by unreliable networks with varying latencies, achieving this seemingly simple guarantee becomes one of the deepest challenges in computer science.

What You Will Learn

By the end of this page, you will deeply understand what consistency means in the context of the CAP theorem, how it differs from ACID consistency, the various consistency models that exist, and why achieving strong consistency in distributed systems requires fundamental trade-offs that shape every modern distributed system's architecture.

Defining Consistency in the CAP Context

Before diving deep, we must be precise about terminology. The word "consistency" appears in multiple contexts in computer science, and conflating them causes endless confusion.

CAP Consistency (Linearizability):

In the CAP theorem, consistency means linearizability—arguably the strongest consistency guarantee possible. A linearizable system behaves as if there is only a single copy of the data, even though that data may be replicated across many nodes. More formally:

A system is linearizable if every operation appears to take effect instantaneously at some point between its invocation and its response, and all operations appear in a total order that respects the real-time ordering of non-overlapping operations.

This is an extraordinarily strong guarantee. It means that once a write completes successfully, every subsequent read (from any client, on any node) will return that written value or a more recent one. There is a single, global ordering of all operations that all observers agree upon.

CAP vs ACID Consistency

ACID Consistency (from database transactions) is entirely different. ACID consistency means that a transaction preserves database invariants—if a constraint says account balances must be non-negative, a consistent transaction won't violate that. This has nothing to do with replication or distributed systems. Never confuse the two; you'll make incorrect architectural decisions if you do.

Consistency: CAP vs ACID vs Other Contexts
Context	What 'Consistency' Means	Primary Concern
CAP Theorem	Linearizability — all nodes see same data in same order	Replication and distributed agreement
ACID Transactions	Database invariants are preserved after transactions	Data integrity and constraint validation
Eventual Consistency	All replicas converge to same value... eventually	Convergence under asynchrony
Cache Consistency	Cache reflects current source-of-truth	Staleness and invalidation

Why linearizability is the gold standard:

Linearizability provides the illusion of a single-server system. Clients don't need to reason about replication, network partitions, or eventual convergence. They read, and they get the latest value. They write, and it's immediately visible everywhere. This simplicity is why linearizability is desirable—it matches our mental model of how data should behave.

But this simplicity comes at an enormous cost in distributed environments, as we'll explore throughout this module.

The Formal Model of Linearizability

To truly understand consistency, we must examine the formal model. This isn't academic pedantry—the formal definition exposes exactly why achieving consistency is so challenging.

The Execution Model:

In a distributed system, operations don't happen instantaneously. Each operation has:

An invocation time: when the client sends the request
A response time: when the client receives the response
A linearization point: the instant when the operation "takes effect"

For a system to be linearizable, every operation's linearization point must fall between its invocation and response times. Additionally, if operation A completes before operation B begins (in real time), then A's linearization point must precede B's.

Example: A linearizable execution:

Time →
Client 1: [--write(x=1)--]           [--read()→1--]
                    ↑(write takes effect)
Client 2:       [--read()→0--]    [--read()→1--]
                       ↑(reads old value)    ↑(reads new value)

Here, Client 2's first read returns 0 because its linearization point occurred before Client 1's write took effect. The second read returns 1 because it linearized after the write.

Example: A non-linearizable execution:

Time →
Client 1: [--write(x=1)--]
                    ↑(write completes)
Client 2:                  [--read()→0--]
                                 ↑(returns stale value!)

This violates linearizability because Client 2's read occurred entirely after Client 1's write completed, yet it returned the old value. In a linearizable system, this cannot happen.

Linearizability vs Sequential Consistency

Sequential Consistency is slightly weaker: it requires that all operations appear in some sequential order consistent with each client's program order, but this order need not respect real-time. Linearizability adds the real-time constraint. This subtle difference has profound implications—sequential consistency allows 'stale' reads that linearizability forbids, but sequential consistency is cheaper to implement.

The Composability Property:

One beautiful property of linearizability is that it's composable. If you have two linearizable objects (say, two registers), their combined system is also linearizable. This is not true of weaker consistency models, making linearizability particularly valuable for building layered systems.

The Cost of Verification:

Determining whether an execution history is linearizable is NP-complete in the general case. This has practical implications: testing for linearizability is computationally expensive, and distributed systems testing frameworks (like Jepsen) use sophisticated techniques to find violations.

Why Consistency is Fundamentally Hard

Understanding why consistency is difficult requires grasping the fundamental nature of distributed systems. Several interconnected challenges make strong consistency expensive or impossible under certain conditions.

Challenge 1: The Speed of Light

Information cannot travel faster than light. A round trip from New York to London takes at least 28 milliseconds at the speed of light in fiber. In practice, it's 60-80ms. If you have replicas on both continents and require them to agree before acknowledging a write, every write incurs this latency penalty. There's no engineering solution—this is physics.

Challenge 2: Network Asynchrony

Networks don't provide guaranteed delivery times. A message might arrive in 1ms, 100ms, or never. Without bounds on message delays, nodes cannot distinguish between a slow network and a crashed node. This uncertainty is the root of many distributed systems problems.

Challenge 3: Independent Failure Modes

In a single-machine system, the entire system either works or it doesn't. In distributed systems, nodes fail independently. Node A might process a write while Node B never receives it due to network failure. Maintaining consistency across such partial failures is where the real complexity lies.

The Fundamental Obstacles to Consistency

•Latency: Synchronizing replicas takes time proportional to network distance
•Asynchrony: No upper bound on message delivery times
•Partial Failure: Some nodes may fail while others continue
•Concurrent Operations: Multiple clients may operate on the same data simultaneously
•Clock Uncertainty: No global clock to order events definitively
•Byzantine Behavior: Nodes may misbehave, not just crash

The Coordination Tax:

Achieving strong consistency requires coordination between nodes. Coordination means nodes must communicate and wait for each other before proceeding. This introduces latency (waiting for the slowest participant), reduces throughput (nodes spend time coordinating instead of processing), and creates availability risks (if the coordinator is unreachable, progress halts).

The coordination tax is inescapable. Every consistency protocol—two-phase commit, Paxos, Raft, consensus—pays this tax in some form. The art of distributed systems design is minimizing this tax while maintaining the consistency guarantees your application requires.

The CAP Insight

The CAP theorem formalizes this impossibility. It doesn't tell us that consistency is unachievable—it tells us that under network partitions, we must choose between consistency and availability. Understanding this trade-off is the foundation of distributed systems architecture.

Implementing Strong Consistency

Despite the challenges, strong consistency is achievable—with appropriate trade-offs. Several strategies can implement linearizability in distributed systems.

Strategy 1: Single Leader with Synchronous Replication

The simplest approach: all writes go to a single leader node, which synchronously replicates to followers before acknowledging. Reads either go to the leader or to followers after they've confirmed receipt of all writes up to a certain point.

Pros: Simple to understand and implement Cons: Leader is a bottleneck; latency equals RTT to slowest follower; leader failure requires leader election

Strategy 2: Quorum-Based Protocols

Writes succeed when acknowledged by a majority (quorum) of nodes. Reads also contact a majority. Because any two majorities overlap, a read quorum will always include at least one node with the latest write.

With N nodes, write quorum W, and read quorum R, linearizability requires: W + R > N

Example: With 5 nodes, W=3, R=3, we have 3+3=6 > 5. Any read contacts 3 nodes, any write contacts 3 nodes, and they must share at least 1 node.

Pros: No single point of failure; can tune W and R for different read/write workloads Cons: Higher message complexity; requires careful handling of concurrent writes

Strategy 3: Consensus Protocols (Paxos, Raft)

Consensus protocols ensure that a group of nodes agrees on a sequence of values. All operations are appended to a replicated log, and nodes apply operations from this log in order. Because all nodes see the same log in the same order, they arrive at the same state.

Pros: Handles leader failure gracefully; forms the foundation of many production systems (etcd, Consul, CockroachDB) Cons: Complex to implement correctly; high message overhead; latency proportional to leader distance

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Given:
  N = Total number of replicas
  W = Write quorum (nodes that must acknowledge a write)
  R = Read quorum (nodes that must respond to a read)
 
Linearizability guarantee: W + R > N
 
Example configurations for N = 5:
┌───────────────────────────────────────────────────────────┐
│ Configuration │  W  │  R  │ W+R │ Trade-off               │
├───────────────────────────────────────────────────────────┤
│ Read-heavy    │  3  │  3  │  6  │ Balanced latency        │
│ Write-heavy   │  2  │  4  │  6  │ Fast writes, slow reads │
│ Read-optimized│  4  │  2  │  6  │ Slow writes, fast reads │
│ Minimum viable│  3  │  3  │  6  │ Can tolerate 2 failures │
└───────────────────────────────────────────────────────────┘
 
Why it works: Any two quorums of size > N/2 must overlap.
That overlap contains the latest write, ensuring reads see it.

Quorums Are Not Sufficient for Linearizability

A common misconception: quorums alone don't guarantee linearizability. You also need proper handling of concurrent writes (usually via version vectors or consensus on write order) and careful read repair. Naive quorum implementations can violate linearizability in subtle ways.

The Consistency Spectrum

Consistency isn't binary. Between the extremes of "no consistency" and "perfect linearizability" lies a rich spectrum of consistency models, each with different guarantees and costs.

Taxonomy of Consistency Models (Strongest to Weakest):

1. Strict Consistency (Theoretical Only) Every read returns the most recently written value across all nodes. Requires instantaneous global communication—physically impossible.

2. Linearizability The CAP consistency. Operations appear atomic and ordered in real-time. Achievable but expensive.

3. Sequential Consistency All operations appear in some sequential order consistent with program order, but not necessarily real-time order. Slightly weaker than linearizability.

4. Causal Consistency Operations that are causally related (one depends on another) are seen in the same order by all nodes. Concurrent operations may be seen in different orders.

5. Read-Your-Writes Consistency A client always sees its own writes. Other clients may see stale data.

6. Eventual Consistency If no new writes occur, all replicas eventually converge to the same value. No timing guarantees.

7. No Consistency (Just Availability) Each replica operates independently. Replicas may permanently diverge.

Consistency Models: Guarantees and Trade-offs
Model	Guarantee	Latency Cost	Availability	Use Case
Linearizability	Real-time order, latest value	High	Limited during partitions	Financial transactions, leader election
Sequential	Some total order exists	Medium-High	Better than linearizable	Local caches with cache coherence
Causal	Causal order preserved	Medium	Good	Collaborative editing, social feeds
Read-your-writes	Client sees own writes	Low	Very Good	User sessions, shopping carts
Eventual	Eventually converges	Lowest	Maximum	DNS, CDN caches, metrics

Choose the Weakest Model You Can Tolerate

Stronger consistency always costs more. A fundamental principle of distributed systems design is: use the weakest consistency model that your application can tolerate. If eventual consistency works, don't pay for linearizability. If causal consistency suffices, don't implement Paxos.

Understanding Causal Consistency in Depth:

Causal consistency deserves special attention because it's often the sweet spot between usability and performance. Two operations are causally related if:

They were issued by the same client (program order)
One reads a value that the other wrote (read-your-writes)
There's a chain of such dependencies (transitive closure)

Operations that aren't causally related are concurrent and may be seen in any order. This model captures the intuitive notion of "cause and effect" without the overhead of total ordering.

Example: In a social network:

Alice posts "I'm having a baby!"
Bob comments "Congratulations!"

These are causally related—Bob's comment depends on seeing Alice's post. Causal consistency ensures no user ever sees Bob's comment without Alice's post. But if Carol simultaneously posts about her vacation, that's concurrent with Alice's post, and different users might see them in different orders. This is usually fine—there's no logical dependency.

Real-World Consistency Challenges

Theory provides the foundation, but real-world systems face additional challenges that textbooks often overlook.

Challenge: The Read-After-Write Problem

A user submits a form and is redirected to a page showing their submission. If the read goes to a replica that hasn't received the write yet, the user sees nothing—a common UX disaster.

Solutions:

Read from leader after writes (expensive, doesn't scale)
Include a version token in the redirect; wait until replica has that version
Sticky sessions: route user to same node that handled the write

Challenge: The Dual-Write Problem

You need to update both a database and a cache, or a database and a message queue. How do you keep them consistent?

1. Write to database
2. Write to cache

If step 2 fails, database and cache are inconsistent. If you reverse the order and step 1 fails, you have the same problem. Even with try/catch, the process might crash between steps.

Solution: Don't dual-write. Use a single source of truth and derive the other:

Write to database, cache invalidation happens via CDC (Change Data Capture)
Write to event log, both database and cache consume from it (Event Sourcing)

Common Consistency Pitfalls

•Caching without invalidation strategy: Cache serves stale data indefinitely
•Async replication assumed to be sync: Leader fails before replication completes
•Time-based ordering: Clock skew causes 'earlier' writes to overwrite 'later' ones
•Read replicas as authoritative: App reads from replica, assumes it's current
•Eventual consistency misunderstood: 'Eventually' could be hours or forever if writes continue
•Ignoring network partitions: Assuming network is reliable leads to data loss

Challenge: The Last-Writer-Wins Problem

In systems with concurrent writes, how do you decide which write "wins"? A common approach is Last-Writer-Wins (LWW) using timestamps. But this has severe issues:

Clock skew: A write at time T+100 might have a lower timestamp than a write at time T if clocks are skewed
Data loss: LWW silently discards writes without any indication
No conflict detection: Users don't know their writes were overwritten

Better alternatives:

Multi-value registers (return all concurrent values, let application merge)
CRDTs (Conflict-free Replicated Data Types with automatic merge functions)
Application-level conflict resolution (require user intervention)

Challenge: Consistency Across Services

In microservices, data spans multiple databases. Maintaining consistency is even harder:

SAGA pattern: Sequence of local transactions with compensating actions
Two-Phase Commit (2PC): Distributed transaction, but blocks during failures
Event sourcing with eventual consistency: Accept inconsistency, reconcile later

The Danger of Partial Failures

The most dangerous consistency violations occur during partial failures—when some operations succeed and others fail. A system that correctly handles the happy path but breaks during failures will seem fine in development and testing, then lose data in production. Always design for failure, not just success.

Measuring and Testing Consistency

How do you know if your system actually provides the consistency guarantees it claims? Measuring and testing consistency requires specialized techniques.

Approach 1: Jepsen Testing

Jepsen is a testing framework that simulates network partitions, node failures, and clock skew while running workloads against distributed systems. It collects operation histories and checks whether they're linearizable.

Jepsen has found bugs in nearly every distributed system it's tested, including:

MongoDB: Acknowledged writes could be lost during replica set failover
Elasticsearch: Documents could be lost during network partitions
Redis: Redlock algorithm doesn't guarantee mutual exclusion under async conditions

Approach 2: Formal Verification

Tools like TLA+ and Alloy can model distributed protocols and prove properties about them. Amazon uses TLA+ extensively to verify protocols in S3, DynamoDB, and other services.

Approach 3: Lineage and Tracing

Instrument systems to track causal dependencies between operations. If a read returns a value, trace which writes produced it and verify the causal chain is correct.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
Analysis of operation history:
  ┌──────────────────────────────────────────────────────────┐
  │ Operation Timeline                                        │
  ├──────────────────────────────────────────────────────────┤
  │ T1: [:invoke 1 :write [x 1]]                              │
  │ T2:                    [:invoke 2 :read [x]]              │
  │ T1: [:ok     1 :write [x 1]]                              │
  │ T2:                    [:ok     2 :read [x] -> nil]       │
  └──────────────────────────────────────────────────────────┘
  
VIOLATION DETECTED: Operation 2 read nil after Operation 1 
wrote x=1. Read completed after write completed.
 
This is NOT linearizable.
 
Operations:
  Op 1: write(x=1) @ T=0ms, completed @ T=15ms
  Op 2: read(x)    @ T=5ms, completed @ T=20ms, returned nil
 
Expected: Op 2 should return 1 (or any value ≥ 1)
Actual:   Op 2 returned nil
 
Possible causes:
  - Async replication lag
  - Read went to non-leader replica
  - Write not durably committed before ack

Testing Isn't Proving

Testing can find bugs, but it can't prove their absence. A system that passes Jepsen tests might still have consistency bugs that the tests didn't trigger. For critical systems, combine testing with formal methods and design reviews by distributed systems experts.

Metrics for Consistency Health:

While you can't directly "measure" consistency like latency, you can track proxy metrics:

Replication lag: Time between write on leader and application on followers
Quorum response times: How long until quorum is reached?
Conflict rate: How often do concurrent writes conflict?
Read repair frequency: How often do reads find inconsistent data?
Staleness distribution: What's the age distribution of values returned by reads?

Monitouring these metrics helps detect consistency degradation before it causes visible problems.

Summary: Consistency in Distributed Systems

We've taken a deep dive into consistency—the 'C' in CAP—understanding what it means, why it's hard, and how it's achieved. Let's consolidate the key insights.

Key Takeaways

•CAP consistency = Linearizability: All nodes see the same data in real-time order. Not to be confused with ACID consistency, which is about database invariants.
•Linearizability makes distributed systems appear single-node: Operations appear atomic and instantaneous, enormously simplifying client logic.
•Consistency requires coordination: Nodes must communicate and agree before proceeding, introducing latency, throughput limits, and availability risks.
•Consistency models form a spectrum: From linearizability to eventual consistency, each point trades off guarantees against performance and availability.
•Choose the weakest model you can tolerate: Stronger consistency is always more expensive. Don't pay for what you don't need.
•Real-world systems face compounding challenges: Dual-writes, read-after-write, last-writer-wins, and partial failures all threaten consistency.
•Testing consistency is hard: Tools like Jepsen help, but testing proves the presence of bugs, not their absence. Formal verification provides stronger guarantees.

What's Next:

Consistency is only one vertex of the CAP triangle. In the next page, we'll explore Availability—the guarantee that every request receives a response. You'll see that availability has its own nuances and that the tension between consistency and availability lies at the heart of distributed systems design.

Page Complete

You now understand consistency in the context of the CAP theorem: what it means formally (linearizability), why it's difficult to achieve (coordination costs), the spectrum of consistency models, and the real-world challenges in maintaining consistency. This foundation is essential for understanding the trade-offs we'll explore in the remaining CAP theorem pages.