Computer NetworksCongestion Control Overview

TCP Congestion Control Overview

LevelIntermediate

Duration60 mins

TopicCongestion Control Overview

1 / 5

The Congestion Problem: Understanding Network Collapse

When Networks Come Crashing Down

In October 1986, the early Internet experienced its first congestion collapse. The network between Lawrence Berkeley National Laboratory and UC Berkeley saw throughput plummet from 32 Kbps to a mere 40 bps—a devastating 99.9% reduction. Packets that should have traversed the network in milliseconds were taking minutes, and most never arrived at all.

This wasn't a hardware failure. Every router, every cable, every interface was functioning perfectly. The network had simply overwhelmed itself. Too many senders were pushing too much data too fast, and the cumulative effect was catastrophic.

This event—and the brilliant work by Van Jacobson and Michael Karels that followed—fundamentally shaped how TCP operates today. Understanding the congestion problem isn't merely academic; it's understanding why the modern Internet works at all.

What You Will Learn

By the end of this page, you will deeply understand what network congestion is, the mechanisms that cause it, how it differs from mere packet loss, and why addressing congestion is essential for network stability. You'll grasp the feedback dynamics that can spiral a healthy network into collapse, and appreciate why congestion control became one of TCP's most critical functions.

Defining Network Congestion

Network congestion occurs when the aggregate demand for network resources—bandwidth, buffer space, processing capacity—exceeds the available supply. It is, fundamentally, a resource contention problem that emerges from the statistical nature of shared network infrastructure.

To understand congestion precisely, we must first understand what it is not:

Congestion is not high utilization. A link running at 95% capacity with smooth, predictable traffic is efficient, not congested.
Congestion is not packet loss. While congestion often causes packet loss, packets can be lost for many reasons (bit errors, power outages, misconfiguration), and not all packet loss implies congestion.
Congestion is not slow connections. A low-bandwidth link serving its traffic well is not congested; congestion implies degraded performance relative to what the network should deliver.

The Precise Definition

Congestion exists when the arrival rate of packets at a network element (router, switch, link) exceeds its departure rate for a sustained period, causing queues to build and eventually overflow. The key insight: congestion is about imbalance over time, not instantaneous load.

The mathematical framing:

Consider a router with incoming traffic rate λ (lambda) and service capacity μ (mu). In queueing theory terms:

If λ < μ: The system is stable. Packets arrive slower than they can be processed, queues remain bounded, and delays are finite.
If λ = μ: The system is at capacity. Any burst causes temporary queuing, but the system can recover.
If λ > μ: The system is overloaded. Queues grow unboundedly, delays increase without limit, and eventually packets must be dropped.

The critical observation is that routers have finite buffer space. When queues fill, new arriving packets are dropped—this is the router's only option. But here's where things get interesting: how senders react to those drops determines whether the network stabilizes or collapses.

Where Congestion Occurs: Anatomy of a Bottleneck

Congestion manifests at specific points in the network where traffic converges and resources are constrained. Understanding these congestion points is essential for appreciating why congestion control is both necessary and challenging.

The Bottleneck Link:

In any end-to-end path, the bottleneck link is the link with the lowest available bandwidth. This is where congestion first appears and persists. Consider a path where:

Source connects to router at 1 Gbps
Router connects to another router at 100 Mbps (the bottleneck)
Final router connects to destination at 1 Gbps

If the source attempts to send at 500 Mbps, the 100 Mbps link becomes overwhelmed. Packets queue at the router preceding this link, and when the queue fills, packets drop.

Common Congestion Points in Network Architecture
Location	Cause	Characteristics
Access Network Uplink	Asymmetric bandwidth (faster download, slower upload)	Home/office connections bottleneck on upstream traffic
Peering Points	Traffic exchange between ISPs with limited capacity	Evening usage spikes cause widespread slowdowns
Core Router Interfaces	Multiple high-speed inputs converging on single output	Microsecond-scale congestion affecting many flows
Last-Mile Links	Shared medium (cable, wireless) among many users	Contention-based delays during peak usage
Data Center TOR Switches	Many servers competing for uplink bandwidth	Incast congestion from synchronized responses

The Many-to-One Problem:

Congestion becomes particularly severe in many-to-one traffic patterns, where multiple senders transmit toward a single receiver. Consider a web request that triggers responses from dozens of microservices, or a distributed storage read that returns data from many storage nodes simultaneously.

In these scenarios:

Each sender has adequate bandwidth individually
The receiver link can handle each flow individually
But the aggregate arrival rate exceeds the receiver's ability to accept traffic
The converging flows compete destructively

This incast congestion pattern is especially problematic in data centers, where it can cause severe performance degradation or even TCP timeouts despite network health.

Congestion is Statistical

Network traffic is inherently bursty. Even if average load is well below capacity, statistical bursts cause momentary congestion. Effective congestion control must handle both sustained overload AND transient bursts—two distinct challenges requiring different mechanisms.

The Feedback Loop from Hell: How Congestion Spirals

What makes congestion particularly insidious is its tendency to self-reinforce. Without proper control mechanisms, minor congestion escalates into network-wide collapse through a vicious cycle that feeds on itself.

The Deadly Spiral:

Consider what happens when a network becomes congested and TCP senders detect packet loss:

Step 1: Initial Overload

Traffic exceeds capacity at a bottleneck
Router buffers fill; packets are dropped
Some packets reach their destinations, many don't

Step 2: Retransmission Storm

Senders detect losses (timeout or duplicate ACKs)
Senders retransmit lost packets
These retransmissions add to the traffic load
Network is now handling original traffic PLUS retransmissions

Step 3: Amplification

Increased load causes more packets to be dropped
More drops trigger more retransmissions
Each round adds more traffic than it removes
RTT increases, so senders have more packets "in flight"

Step 4: Collapse

Virtually all traffic is retransmissions
Useful throughput approaches zero
The network is "full" but accomplishing nothing
Recovery requires traffic to subside—which may take hours

Converting Mermaid diagram...

The Mathematics of Collapse:

Let's quantify this. Suppose a network has true capacity C and current load L = 1.2C (20% overload). With no congestion control:

20% of packets are dropped (those exceeding capacity)
Senders retransmit these packets, adding 0.2C more traffic
New load: 1.2C + 0.2C = 1.4C
Now 40% of packets drop, triggering 0.4C in retransmissions
Load becomes 1.8C, then 2.6C, then 4.2C...

Within a few round-trip times, the network is carrying mostly retransmissions. Actual useful data—new bytes never-before-transmitted—becomes a tiny fraction of traffic. Goodput (useful throughput) collapses even as throughput (raw bytes moved) remains high.

Congestion Collapse: The Real-World Impact

The 1986 collapse that Van Jacobson observed showed throughput reduction of 99.9%. The network hardware was fine—plenty of capacity existed. But the traffic patterns made that capacity unusable. Without intervention, the Internet as we know it would be impossible.

Congestion vs. Link Failure: A Critical Distinction

For TCP to handle congestion effectively, it must first recognize congestion. This is complicated by the fact that packet loss—TCP's primary signal—can result from multiple causes:

Congestion-Induced Loss:

Occurs when router buffers overflow
Affects multiple flows traversing the congested point
Correlated in time (bursts of drops)
Remedied by reducing transmission rate

Non-Congestion Loss:

Bit errors (wireless, noisy links)
Hardware failures (transient or permanent)
Routing changes (temporary blackholes)
Firewall/policy drops
Random fiber cuts, power outages

Why the distinction matters:

If TCP interprets all packet loss as congestion, it will reduce its transmission rate unnecessarily when losses are due to other causes. This is particularly problematic for wireless networks, where bit error rates can be significant but don't indicate network overload.

Congestion Loss Characteristics

•Occurs at specific points in network path
•Correlated across flows sharing that point
•Often occurs in bursts when buffer fills
•ECN markings may precede actual drops
•Increasing RTT observed before drops
•Affects both directions (data and ACKs)

Random Loss Characteristics

•Can occur anywhere in path
•Typically uncorrelated between flows
•Random distribution, not bursty
•No RTT increase preceding losses
•More common on wireless links
•May be asymmetric (one direction)

The Sender's Dilemma:

A TCP sender observing packet loss faces an impossible inference problem: it cannot directly observe what caused the loss. The sender only knows:

It sent a packet
It didn't receive an acknowledgment (either timeout or duplicate ACKs)
The packet may have been lost, or the ACK may have been lost

Classic TCP makes a conservative assumption: treat all loss as congestion. This is safe in the sense that reducing rate never harms the network—but it causes TCP to underperform on lossy links. Modern variants and extensions (ECN, BBR, loss differentiation algorithms) attempt to distinguish congestion from random loss, but the fundamental challenge remains.

The False Positive Problem

On a wireless link with 1% random packet loss, TCP will repeatedly reduce its congestion window, drastically underutilizing available bandwidth. This is why cellular and WiFi networks implement link-layer retransmission—to hide random losses from TCP and prevent unnecessary congestion responses.

The Delay-Throughput Trade-off: Bufferbloat and Beyond

Congestion manifests in two primary symptoms: increased delay and packet loss. Understanding the relationship between these is crucial for understanding modern congestion control debates.

The Buffer Dimension:

Routers contain buffers (queues) to absorb traffic bursts. Without buffers, any momentary overload would cause immediate drops. But buffers introduce delay—packets waiting in queue aren't moving toward their destination.

Small Buffers:

Low queuing delay
Packets drop quickly during congestion
Clear congestion signal to senders
May drop packets unnecessarily during short bursts

Large Buffers:

Can absorb longer bursts without drops
High queuing delay during congestion
Delayed congestion signals (sender doesn't know it's congesting)
Bufferbloat: excessive buffering that breaks congestion feedback

Buffer Size Trade-offs
Buffer Size	Delay Under Load	Loss Rate	Signal Clarity	Risk
Very Small	Minimal	High	Immediate, clear	Premature drops
Moderate	Bounded	Moderate	Timely	Balanced trade-off
Very Large	Extreme	Low	Severely delayed	Bufferbloat
Unbounded (impossible)	Infinite	Zero	Never	Complete collapse

The Bufferbloat Problem:

In the mid-2000s, memory became cheap enough that network equipment vendors began installing very large buffers. The intention was good: prevent packet loss. The effect was disastrous.

Consider a home router with 3 seconds of buffering at its upstream capacity:

User starts a large upload
TCP ramps up to fill the available capacity
Buffer fills with 3 seconds of packets
Interactive traffic (gaming, video calls) enters the queue
This interactive traffic waits 3 seconds before even starting to transmit
TCP sees no drops, so it doesn't slow down
The queue stays permanently full
Every packet experiences 3+ seconds of delay

The network isn't "dropping" packets—it's delaying them into uselessness. A VoIP call with 3 seconds of delay is unusable. A game with 3 seconds of latency is unplayable. Yet TCP, measuring only loss, sees nothing wrong.

The Modern Insight

Modern congestion control algorithms like BBR and COPA don't just react to loss—they actively measure delay and treat increasing delay as a congestion signal. This allows them to keep queues short, minimizing latency while maintaining throughput. The shift from 'loss-based' to 'delay-based' congestion control represents a fundamental evolution in TCP thinking.

The Fairness Dimension: Sharing the Pipe

Network congestion isn't just a capacity problem—it's a resource allocation problem. When demand exceeds supply, how should the scarce resource be divided among competing flows?

The Fairness Question:

Imagine a 100 Mbps link shared by 10 TCP connections. What's the "fair" allocation?

Simple answer: 10 Mbps each
But what if flows have different RTTs? Flows with shorter RTT can ramp up faster
What if flows have different loss rates? Some may be penalized more heavily
What if flows use different congestion control algorithms? Some may be more aggressive

Fairness in networking is both a technical and philosophical challenge.

Types of Fairness in Congestion Control

•Max-Min Fairness — Each flow gets the maximum possible rate such that no flow can increase its rate without decreasing the rate of a flow already at or below its rate. This is the classic IP fairness model.
•Proportional Fairness — Flows share bandwidth in proportion to their 'utility sensitivity.' In practice, TCP-friendly algorithms often approximate this naturally.
•Rate Proportional Fairness — More sophisticated fairness considering end-to-end path characteristics, the basis for some QoS schemes.
•TCP Fairness — Behaving as a well-implemented TCP would behave. This practical definition ensures new protocols don't starve TCP traffic.

The RTT Bias Problem:

Classic TCP has a significant fairness flaw: it favors flows with shorter RTT. Here's why:

TCP increases its window by ~1 packet per RTT in congestion avoidance
A flow with 10ms RTT gets 100 opportunities to increase per second
A flow with 100ms RTT gets only 10 opportunities to increase per second
Over time, the low-RTT flow captures more bandwidth

This RTT unfairness means a user close to a server (low RTT) gets better performance than a user far away (high RTT), even when both are paying for the same bandwidth. Various congestion control modifications attempt to address this bias, with varying success.

Fairness is Emergent

No central authority enforces fairness in the Internet. Fairness emerges from the collective behavior of adaptive endpoints. If senders don't implement compatible congestion control, fairness degrades. This is why TCP-friendliness matters: every flow implementing proper congestion control benefits all flows.

Historical Context: From Chaos to Control

The history of TCP congestion control is a fascinating case study in engineering evolution under pressure. Understanding this history illuminates why current mechanisms work the way they do.

The Original TCP (pre-1986):

The original TCP specification (RFC 793, 1981) had no congestion control whatsoever. TCP would send packets as fast as the receiver's advertised window allowed. The implicit assumption was that network capacity would always exceed demand.

This worked fine when:

The Internet was small (thousands of hosts)
Traffic was mostly interactive (small bursts)
Links were typically underutilized

But as the Internet grew, these assumptions broke down spectacularly.

Evolution of TCP Congestion Control
Year	Development	Key Contribution
1981	Original TCP (RFC 793)	No congestion control, reliance on flow control only
1986	Congestion collapse observed	Van Jacobson documents network-wide failures
1988	Jacobson algorithms	Slow start, congestion avoidance, fast retransmit introduced
1990	TCP Reno	Fast recovery added, standard for decades
1996	TCP Vegas	First delay-based approach (ahead of its time)
1999	Explicit Congestion Notification (ECN)	Routers can signal congestion without dropping
2004	TCP BIC/CUBIC	Better fairness on high-bandwidth-delay paths
2016	TCP BBR	Google's delay-based, model-based approach

Van Jacobson's Breakthrough:

In a seminal 1988 paper, Van Jacobson described the algorithms that saved the Internet. His key insights:

Conservation of packets: In equilibrium, a TCP connection should maintain a constant number of packets in flight. New packets enter only when old packets exit (through ACKs).
Equilibrium must be reached, not assumed: TCP cannot know network capacity a priori. It must probe the network by gradually increasing rate until congestion occurs.
Additive Increase, Multiplicative Decrease (AIMD): To be stable and fair, TCP should increase rate slowly but decrease rate rapidly when congestion is detected.
Self-clocking: Use ACK arrivals to pace packet transmission, naturally adapting to available bandwidth.

These principles remain the foundation of TCP congestion control today, 35+ years later.

Engineering Under Constraints

Jacobson's algorithms were designed for deployment on the existing Internet. He couldn't modify routers or require new protocols—the solution had to work with endpoints alone. This constraint shaped TCP congestion control as a purely end-to-end mechanism, a design philosophy with profound implications.

Summary: The Congestion Problem

We've established a comprehensive understanding of why congestion is one of networking's most critical challenges. Let's consolidate the key insights:

Key Takeaways

•Congestion is resource imbalance — It occurs when aggregate demand exceeds available capacity at some point in the network, causing queues to build and eventually overflow.
•Congestion can cascade into collapse — Without control, retransmissions create a feedback loop that can reduce useful throughput to near zero while keeping the network fully loaded.
•Congestion manifests as delay AND loss — Modern understanding recognizes that excessive buffering (delay without loss) is just as problematic as immediate drops.
•Distinguishing congestion from failure is hard — TCP must infer network state from packet-level observations, treating most loss as congestion even when it might not be.
•Fairness is a design goal — Congestion control must allocate scarce resources reasonably among competing flows, a property that emerges from cooperative sender behavior.
•History shapes current design — Today's TCP congestion control mechanisms descend directly from the 1988 work addressing the original Internet's collapse.

What's Next:

Understanding the problem is the first step. In the next page, we'll examine network capacity—how to think about the bandwidth and delay characteristics of network paths, and what determines the maximum theoretically achievable throughput for a TCP connection.

Page Complete

You now understand the fundamental nature of network congestion—what causes it, why it's dangerous, and how it nearly broke the early Internet. This foundation prepares you to understand why congestion control mechanisms work the way they do, and how TCP manages to keep the Internet stable despite billions of concurrent connections.

1 / 5

Loading learning content...

Computer NetworksCongestion Control Overview

TCP Congestion Control Overview

LevelIntermediate

Duration60 mins

TopicCongestion Control Overview

1 / 5

The Congestion Problem: Understanding Network Collapse

When Networks Come Crashing Down

What You Will Learn

Defining Network Congestion

To understand congestion precisely, we must first understand what it is not:

Congestion is not high utilization. A link running at 95% capacity with smooth, predictable traffic is efficient, not congested.
Congestion is not packet loss. While congestion often causes packet loss, packets can be lost for many reasons (bit errors, power outages, misconfiguration), and not all packet loss implies congestion.
Congestion is not slow connections. A low-bandwidth link serving its traffic well is not congested; congestion implies degraded performance relative to what the network should deliver.

The Precise Definition

The mathematical framing:

Consider a router with incoming traffic rate λ (lambda) and service capacity μ (mu). In queueing theory terms:

If λ < μ: The system is stable. Packets arrive slower than they can be processed, queues remain bounded, and delays are finite.
If λ = μ: The system is at capacity. Any burst causes temporary queuing, but the system can recover.
If λ > μ: The system is overloaded. Queues grow unboundedly, delays increase without limit, and eventually packets must be dropped.

Where Congestion Occurs: Anatomy of a Bottleneck

The Bottleneck Link:

In any end-to-end path, the bottleneck link is the link with the lowest available bandwidth. This is where congestion first appears and persists. Consider a path where:

Source connects to router at 1 Gbps
Router connects to another router at 100 Mbps (the bottleneck)
Final router connects to destination at 1 Gbps

If the source attempts to send at 500 Mbps, the 100 Mbps link becomes overwhelmed. Packets queue at the router preceding this link, and when the queue fills, packets drop.

Common Congestion Points in Network Architecture
Location	Cause	Characteristics
Access Network Uplink	Asymmetric bandwidth (faster download, slower upload)	Home/office connections bottleneck on upstream traffic
Peering Points	Traffic exchange between ISPs with limited capacity	Evening usage spikes cause widespread slowdowns
Core Router Interfaces	Multiple high-speed inputs converging on single output	Microsecond-scale congestion affecting many flows
Last-Mile Links	Shared medium (cable, wireless) among many users	Contention-based delays during peak usage
Data Center TOR Switches	Many servers competing for uplink bandwidth	Incast congestion from synchronized responses

The Many-to-One Problem:

In these scenarios:

Each sender has adequate bandwidth individually
The receiver link can handle each flow individually
But the aggregate arrival rate exceeds the receiver's ability to accept traffic
The converging flows compete destructively

This incast congestion pattern is especially problematic in data centers, where it can cause severe performance degradation or even TCP timeouts despite network health.

Congestion is Statistical

The Feedback Loop from Hell: How Congestion Spirals

The Deadly Spiral:

Consider what happens when a network becomes congested and TCP senders detect packet loss:

Step 1: Initial Overload

Traffic exceeds capacity at a bottleneck
Router buffers fill; packets are dropped
Some packets reach their destinations, many don't

Step 2: Retransmission Storm

Senders detect losses (timeout or duplicate ACKs)
Senders retransmit lost packets
These retransmissions add to the traffic load
Network is now handling original traffic PLUS retransmissions

Step 3: Amplification

Increased load causes more packets to be dropped
More drops trigger more retransmissions
Each round adds more traffic than it removes
RTT increases, so senders have more packets "in flight"

Step 4: Collapse

Virtually all traffic is retransmissions
Useful throughput approaches zero
The network is "full" but accomplishing nothing
Recovery requires traffic to subside—which may take hours

Converting Mermaid diagram...

The Mathematics of Collapse:

Let's quantify this. Suppose a network has true capacity C and current load L = 1.2C (20% overload). With no congestion control:

20% of packets are dropped (those exceeding capacity)
Senders retransmit these packets, adding 0.2C more traffic
New load: 1.2C + 0.2C = 1.4C
Now 40% of packets drop, triggering 0.4C in retransmissions
Load becomes 1.8C, then 2.6C, then 4.2C...

Congestion Collapse: The Real-World Impact

Congestion vs. Link Failure: A Critical Distinction

For TCP to handle congestion effectively, it must first recognize congestion. This is complicated by the fact that packet loss—TCP's primary signal—can result from multiple causes:

Congestion-Induced Loss:

Occurs when router buffers overflow
Affects multiple flows traversing the congested point
Correlated in time (bursts of drops)
Remedied by reducing transmission rate

Non-Congestion Loss:

Bit errors (wireless, noisy links)
Hardware failures (transient or permanent)
Routing changes (temporary blackholes)
Firewall/policy drops
Random fiber cuts, power outages

Why the distinction matters:

Congestion Loss Characteristics

•Occurs at specific points in network path
•Correlated across flows sharing that point
•Often occurs in bursts when buffer fills
•ECN markings may precede actual drops
•Increasing RTT observed before drops
•Affects both directions (data and ACKs)

Random Loss Characteristics

•Can occur anywhere in path
•Typically uncorrelated between flows
•Random distribution, not bursty
•No RTT increase preceding losses
•More common on wireless links
•May be asymmetric (one direction)

The Sender's Dilemma:

A TCP sender observing packet loss faces an impossible inference problem: it cannot directly observe what caused the loss. The sender only knows:

It sent a packet
It didn't receive an acknowledgment (either timeout or duplicate ACKs)
The packet may have been lost, or the ACK may have been lost

The False Positive Problem

The Delay-Throughput Trade-off: Bufferbloat and Beyond

Congestion manifests in two primary symptoms: increased delay and packet loss. Understanding the relationship between these is crucial for understanding modern congestion control debates.

The Buffer Dimension:

Small Buffers:

Low queuing delay
Packets drop quickly during congestion
Clear congestion signal to senders
May drop packets unnecessarily during short bursts

Large Buffers:

Can absorb longer bursts without drops
High queuing delay during congestion
Delayed congestion signals (sender doesn't know it's congesting)
Bufferbloat: excessive buffering that breaks congestion feedback

Buffer Size Trade-offs
Buffer Size	Delay Under Load	Loss Rate	Signal Clarity	Risk
Very Small	Minimal	High	Immediate, clear	Premature drops
Moderate	Bounded	Moderate	Timely	Balanced trade-off
Very Large	Extreme	Low	Severely delayed	Bufferbloat
Unbounded (impossible)	Infinite	Zero	Never	Complete collapse

The Bufferbloat Problem:

In the mid-2000s, memory became cheap enough that network equipment vendors began installing very large buffers. The intention was good: prevent packet loss. The effect was disastrous.

Consider a home router with 3 seconds of buffering at its upstream capacity:

User starts a large upload
TCP ramps up to fill the available capacity
Buffer fills with 3 seconds of packets
Interactive traffic (gaming, video calls) enters the queue
This interactive traffic waits 3 seconds before even starting to transmit
TCP sees no drops, so it doesn't slow down
The queue stays permanently full
Every packet experiences 3+ seconds of delay

The Modern Insight

The Fairness Dimension: Sharing the Pipe

Network congestion isn't just a capacity problem—it's a resource allocation problem. When demand exceeds supply, how should the scarce resource be divided among competing flows?

The Fairness Question:

Imagine a 100 Mbps link shared by 10 TCP connections. What's the "fair" allocation?

Simple answer: 10 Mbps each
But what if flows have different RTTs? Flows with shorter RTT can ramp up faster
What if flows have different loss rates? Some may be penalized more heavily
What if flows use different congestion control algorithms? Some may be more aggressive

Fairness in networking is both a technical and philosophical challenge.

Types of Fairness in Congestion Control

•Max-Min Fairness — Each flow gets the maximum possible rate such that no flow can increase its rate without decreasing the rate of a flow already at or below its rate. This is the classic IP fairness model.
•Proportional Fairness — Flows share bandwidth in proportion to their 'utility sensitivity.' In practice, TCP-friendly algorithms often approximate this naturally.
•Rate Proportional Fairness — More sophisticated fairness considering end-to-end path characteristics, the basis for some QoS schemes.
•TCP Fairness — Behaving as a well-implemented TCP would behave. This practical definition ensures new protocols don't starve TCP traffic.

The RTT Bias Problem:

Classic TCP has a significant fairness flaw: it favors flows with shorter RTT. Here's why:

TCP increases its window by ~1 packet per RTT in congestion avoidance
A flow with 10ms RTT gets 100 opportunities to increase per second
A flow with 100ms RTT gets only 10 opportunities to increase per second
Over time, the low-RTT flow captures more bandwidth

Fairness is Emergent

Historical Context: From Chaos to Control

The history of TCP congestion control is a fascinating case study in engineering evolution under pressure. Understanding this history illuminates why current mechanisms work the way they do.

The Original TCP (pre-1986):

This worked fine when:

The Internet was small (thousands of hosts)
Traffic was mostly interactive (small bursts)
Links were typically underutilized

But as the Internet grew, these assumptions broke down spectacularly.

Evolution of TCP Congestion Control
Year	Development	Key Contribution
1981	Original TCP (RFC 793)	No congestion control, reliance on flow control only
1986	Congestion collapse observed	Van Jacobson documents network-wide failures
1988	Jacobson algorithms	Slow start, congestion avoidance, fast retransmit introduced
1990	TCP Reno	Fast recovery added, standard for decades
1996	TCP Vegas	First delay-based approach (ahead of its time)
1999	Explicit Congestion Notification (ECN)	Routers can signal congestion without dropping
2004	TCP BIC/CUBIC	Better fairness on high-bandwidth-delay paths
2016	TCP BBR	Google's delay-based, model-based approach

Van Jacobson's Breakthrough:

In a seminal 1988 paper, Van Jacobson described the algorithms that saved the Internet. His key insights:

Conservation of packets: In equilibrium, a TCP connection should maintain a constant number of packets in flight. New packets enter only when old packets exit (through ACKs).
Equilibrium must be reached, not assumed: TCP cannot know network capacity a priori. It must probe the network by gradually increasing rate until congestion occurs.
Additive Increase, Multiplicative Decrease (AIMD): To be stable and fair, TCP should increase rate slowly but decrease rate rapidly when congestion is detected.
Self-clocking: Use ACK arrivals to pace packet transmission, naturally adapting to available bandwidth.

These principles remain the foundation of TCP congestion control today, 35+ years later.

Engineering Under Constraints

Summary: The Congestion Problem

We've established a comprehensive understanding of why congestion is one of networking's most critical challenges. Let's consolidate the key insights:

Key Takeaways

•Congestion is resource imbalance — It occurs when aggregate demand exceeds available capacity at some point in the network, causing queues to build and eventually overflow.
•Congestion can cascade into collapse — Without control, retransmissions create a feedback loop that can reduce useful throughput to near zero while keeping the network fully loaded.
•Congestion manifests as delay AND loss — Modern understanding recognizes that excessive buffering (delay without loss) is just as problematic as immediate drops.
•Distinguishing congestion from failure is hard — TCP must infer network state from packet-level observations, treating most loss as congestion even when it might not be.
•Fairness is a design goal — Congestion control must allocate scarce resources reasonably among competing flows, a property that emerges from cooperative sender behavior.
•History shapes current design — Today's TCP congestion control mechanisms descend directly from the 1988 work addressing the original Internet's collapse.

What's Next:

Page Complete

1 / 5