Computer NetworksTCP Flow & Congestion Control

Sliding Window

LevelIntermediate

Duration60 mins

TopicTCP Flow & Congestion Control

1 / 5

TCP Sliding Window

The Art of Continuous Flow

Imagine a water pipeline connecting two cities. If City A pumps water at maximum capacity without regard for City B's storage tanks, the tanks overflow, water is wasted, and the system grinds to a halt while crews clean up the mess. The naive solution—wait for confirmation after each small batch—works but crawls at a glacial pace. What we need is continuous flow with intelligent throttling.

This is precisely the challenge TCP faces. The sender wants to transmit data as fast as possible to maximize throughput. The receiver has finite buffer space and processing capacity. The network path between them has limited bandwidth and introduces delays. How do we achieve maximum throughput without overwhelming any component?

The answer is the TCP Sliding Window—one of the most elegant mechanisms in networking, simultaneously solving flow control, enabling pipelining, and providing the foundation for reliable delivery. Understanding this mechanism is essential for anyone who wants to truly comprehend how TCP achieves its remarkable combination of speed and reliability.

What You Will Learn

By the end of this page, you will understand the fundamental concepts of the TCP sliding window mechanism: why stop-and-wait is insufficient for modern networks, how the sliding window enables pipelining while preventing buffer overflow, the mathematical relationship between throughput, window size, and round-trip time, and how this mechanism forms the foundation for all of TCP's flow control behavior.

The Problem with Stop-and-Wait

To appreciate the sliding window, we must first understand the protocol it supersedes: Stop-and-Wait. In stop-and-wait, the sender transmits one segment, then waits for an acknowledgment before sending the next. This approach is simple and guarantees the receiver is never overwhelmed—but it's catastrophically inefficient.

Consider a concrete example:

You're transferring data between San Francisco and Tokyo. The round-trip time (RTT) is 100ms, and you're using segments of 1,000 bytes on a 10 Mbps link.

Stop-and-Wait Performance Analysis
Parameter	Value	Calculation
Segment size	1,000 bytes = 8,000 bits	Given
Link bandwidth	10 Mbps	Given
Transmission time	0.8 ms	8,000 bits ÷ 10,000,000 bps
Round-trip time	100 ms	Given (propagation delay)
Time per segment	100.8 ms	RTT + transmission time
Throughput	~79,400 bps	8,000 bits ÷ 0.1008 seconds
Link utilization	0.79%	79,400 ÷ 10,000,000

Catastrophic Inefficiency

With stop-and-wait, you're utilizing less than 1% of your available bandwidth! The sender spends 99.2% of its time waiting for acknowledgments rather than transmitting data. You're paying for a 10 Mbps connection but only getting 80 Kbps of actual throughput.

The fundamental problem is the bandwidth-delay product:

The bandwidth-delay product (BDP) represents the amount of data "in flight" that can exist between sender and receiver at any instant:

BDP = Bandwidth × Round-Trip Time
BDP = 10,000,000 bps × 0.1 seconds = 1,000,000 bits = 125,000 bytes

This means the network "pipe" between San Francisco and Tokyo can hold 125,000 bytes at any moment. With stop-and-wait, you're putting only 1,000 bytes in that pipe, then waiting. The pipe is 99.2% empty at all times.

The solution is obvious: keep the pipe full. Instead of waiting for each acknowledgment, send enough data to fill the bandwidth-delay product. This is exactly what the sliding window enables.

Converting Mermaid diagram...

The Sliding Window Concept

The sliding window protocol solves the stop-and-wait inefficiency by allowing the sender to transmit multiple segments before requiring acknowledgment. The "window" represents the range of sequence numbers the sender is permitted to transmit without waiting for ACKs.

Core principles:

The window is a range of permissible sequence numbers — The sender maintains a window of bytes it's allowed to send. This window "slides" forward as acknowledgments arrive.
The receiver advertises its capacity — The receiver tells the sender how much buffer space it has available. This prevents buffer overflow.
Bytes transition through states — Each byte in the sender's buffer is in one of several states: not yet sendable, sendable, sent but unacknowledged, or acknowledged.
The window slides, never jumps — As ACKs arrive, the left edge of the window advances. The right edge advances based on the receiver's advertised capacity.

The Window as a Permission Slip

Think of the sliding window as a permission slip from the receiver. It says: "You may send up to W bytes beyond what I've already acknowledged." As the receiver acknowledges data and frees buffer space, it issues a new permission slip that extends further into the byte stream.

Visualizing the sliding window:

Consider a stream of bytes numbered 0 through infinity. At any moment, the sender's view of this stream is divided into four regions:

The Four Regions of the Sender's Byte Stream
Region	Description	Sender Action
Acknowledged	Bytes that have been sent AND acknowledged by the receiver	Can be discarded from send buffer; no further action needed
Sent, Unacknowledged	Bytes that have been transmitted but ACK not yet received	Must be retained for potential retransmission
Sendable	Bytes within the window that haven't been sent yet	May be transmitted immediately
Not Yet Sendable	Bytes outside the current window	Must wait for window to slide forward

Sender's View of Byte Stream
Byte positions:  0    10    20    30    40    50    60    70    80    90   100
                 |-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|
                 
Current state (LastByteAcked=25, LastByteSent=45, Window=30):
 
     [===ACKED===][=SENT UNACKED=][==SENDABLE==][====NOT YET SENDABLE====]
     |<-- 0-25 -->|<-- 26-45 --->|<-- 46-55 -->|<------- 56+ ----------->|
                  ^               ^             ^
                  |               |             |
            Window starts    LastByteSent   Window ends (25+30=55)
                  
The window covers bytes 26-55 (30 bytes total):
- Bytes 26-45: Already sent, waiting for ACK
- Bytes 46-55: Sendable immediately
- Bytes 56+: Cannot send until window slides
 
When receiver ACKs byte 35:
- Window slides: now covers bytes 36-65
- Bytes 26-35: Move to "Acknowledged" region  
- Bytes 56-65: Move to "Sendable" region

The sliding mechanism:

The beauty of this design is how naturally the window "slides" forward:

Sender transmits: Bytes move from "Sendable" to "Sent, Unacknowledged"
Time passes: Segments travel through the network; ACKs return
ACK arrives: Bytes move from "Sent, Unacknowledged" to "Acknowledged"
Window slides: The left edge advances to match the cumulative ACK
New bytes become sendable: Previously blocked bytes enter the window

This creates a continuous flow: as fast as the receiver acknowledges data, the sender can transmit new data. The window acts as a governor, ensuring the sender never outpaces the receiver's ability to accept data.

Pipelining and Throughput

The sliding window enables pipelining—having multiple segments in flight simultaneously. This is the key to high throughput over high-latency networks.

Revisiting our San Francisco to Tokyo example:

Recall our parameters: 10 Mbps link, 100ms RTT, 1,000-byte segments. With stop-and-wait, we achieved a pitiful 0.79% utilization. Let's see what happens with a sliding window.

Window Size Impact on Throughput
Window Size	Segments in Flight	Effective Throughput	Link Utilization
1,000 bytes (1 segment)	1	79.4 Kbps	0.79%
10,000 bytes (10 segments)	10	794 Kbps	7.9%
50,000 bytes (50 segments)	50	3.97 Mbps	39.7%
100,000 bytes (100 segments)	100	7.94 Mbps	79.4%
125,000 bytes (125 segments)	125	~10 Mbps	~100%
200,000 bytes (200 segments)	125 (limited by BDP)	~10 Mbps	~100%

The Magic Number: Bandwidth-Delay Product

To fully utilize the link, the window size must equal or exceed the bandwidth-delay product (BDP). In our example, BDP = 10 Mbps × 100ms = 125,000 bytes. With a window of 125,000 bytes, the sender can keep the entire "pipe" full, achieving near-100% utilization.

The throughput formula:

For a sliding window protocol, throughput is governed by:

Throughput = min(Window Size / RTT, Link Bandwidth)

This reveals a critical insight: throughput is limited by the smaller of two factors:

Window Size / RTT: How fast the sender is allowed to inject data (flow control limit)
Link Bandwidth: How fast the network can physically carry data (capacity limit)

When Window Size ≥ BDP, throughput becomes bandwidth-limited (the ideal case). When Window Size < BDP, throughput is artificially constrained by the window—we have unused capacity.

Why this matters for modern networks:

High-bandwidth, high-latency networks (common in cloud computing, intercontinental links, and satellite communications) have enormous bandwidth-delay products. A 1 Gbps transatlantic link with 80ms RTT has a BDP of 10 megabytes. If your TCP connection can't advertise a window that large, you'll never come close to 1 Gbps throughput.

Converting Mermaid diagram...

TCP's Byte-Oriented Window

Unlike some protocols that count segments, TCP's sliding window counts bytes. This is a crucial detail that affects how everything works.

Why bytes, not segments?

Variable segment sizes: TCP can send segments of different sizes. A byte-oriented window ensures consistent measurement regardless of segmentation.
Partial acknowledgments: The receiver can acknowledge exactly how many bytes it received, allowing for fine-grained flow control.
Buffer management: Receiver buffers are measured in bytes, so a byte-oriented window directly corresponds to buffer capacity.
Application alignment: Applications write and read bytes, not segments. The byte abstraction maintains end-to-end consistency.

Sequence Numbers are Byte Positions

TCP sequence numbers identify the position of the first byte in a segment within the byte stream. If a segment has sequence number 1000 and carries 500 bytes, those bytes occupy positions 1000-1499 in the stream. This byte-level addressing enables the byte-oriented sliding window.

The TCP header's window field:

The TCP header contains a 16-bit Window field that the receiver uses to advertise how many bytes it can accept:

Minimum value: 0 (receiver says "stop sending, I'm full")
Maximum value: 65,535 bytes (approximately 64 KB)
Interpretation: "You may send up to [Window] bytes beyond the last acknowledged byte"

This 16-bit limitation was a design decision from TCP's 1981 specification (RFC 793). At the time, 64 KB seemed generous. Today, it's a critical bottleneck that required the Window Scale option to overcome (covered in a later page).

Window Size Impact on High-Speed Links
Network Type	Bandwidth	RTT	BDP	64KB Window Utilization
LAN (Ethernet)	1 Gbps	0.5 ms	62.5 KB	~100% (sufficient)
Metro network	100 Mbps	10 ms	125 KB	52% (limited)
Cross-country	1 Gbps	50 ms	6.25 MB	1% (severely limited)
Intercontinental	10 Gbps	100 ms	125 MB	0.05% (crippled)
Satellite link	100 Mbps	600 ms	7.5 MB	0.9% (crippled)

The 64KB Barrier

Without window scaling, TCP connections over high-bandwidth, high-latency links are severely bottlenecked. A 10 Gbps transcontinental link would be limited to ~5 Mbps of TCP throughput—less than 0.1% utilization. This is why window scaling (RFC 7323) was developed and is now ubiquitous.

Flow Control Foundation

The sliding window mechanism provides the foundation for TCP's receiver-based flow control. This ensures the sender never transmits faster than the receiver can handle, preventing buffer overflow and data loss.

How receiver-based flow control works:

Buffer allocation: The receiver allocates a buffer to hold incoming data before the application reads it.
Window advertisement: In every ACK, the receiver advertises its current available buffer space (the window).
Sender constraint: The sender limits its transmissions to stay within the advertised window.
Dynamic adjustment: As the application reads data, buffer space frees up, and the receiver advertises a larger window.
Continuous cycle: This creates a feedback loop where transmission rate adapts to receiver capacity.

Properties of Receiver-Based Flow Control

•End-to-end semantics: Flow control operates between the two endpoints, independent of intermediate network nodes.
•Self-regulating: No external coordination needed; the protocol automatically balances sender and receiver.
•Application-paced: The rate at which the application reads data ultimately controls the transmission rate.
•Buffer-protective: Guarantees the receiver's buffer cannot overflow, assuming correct implementation.
•Responsive: Window updates in every ACK enable rapid adaptation to changing conditions.

Contrast with congestion control:

It's essential to distinguish flow control from congestion control:

Aspect	Flow Control	Congestion Control
Protects	Receiver's buffer	Network capacity
Signaled by	Receiver's window advertisement	Packet loss, delay, ECN
Mechanism	rwnd (receive window)	cwnd (congestion window)
Scope	End-to-end	Sender-centric (path-aware)

TCP uses both mechanisms simultaneously. The actual transmission window is the minimum of rwnd and cwnd—the sender is constrained by both the receiver's capacity and the network's capacity.

The Two Governors

Think of rwnd as the receiver saying "I can handle this much," and cwnd as the network saying "I can carry this much." The sender respects both limits by taking the minimum. This dual governance prevents both receiver overflow and network congestion.

Window Mechanics in Detail

Let's trace through the precise mechanics of how the sliding window operates in TCP. Understanding these details is crucial for debugging performance issues and implementing TCP stacks.

Key variables at the sender:

Sender-Side Window Variables
Variable	Meaning	Updated When
`SND.UNA`	Send Unacknowledged — oldest byte sent but not yet ACKed	ACK received
`SND.NXT`	Send Next — next byte to be sent	Segment transmitted
`SND.WND`	Send Window — receiver's advertised window	ACK received with window update
`SND.WL1`	Segment sequence number for last window update	ACK received with window update
`SND.WL2`	Segment ACK number for last window update	ACK received with window update

The send window boundaries:

The sender can transmit bytes in the range:

[SND.UNA, SND.UNA + SND.WND)

Bytes from SND.UNA to SND.NXT - 1 have already been sent but are awaiting acknowledgment. Bytes from SND.NXT to SND.UNA + SND.WND - 1 may be sent immediately.

The usable window calculation:

Usable Window = SND.WND - (SND.NXT - SND.UNA)
             = (SND.UNA + SND.WND) - SND.NXT

This represents how many more bytes the sender can transmit right now.

Window State Evolution Example
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Initial State:
  SND.UNA = 1000    (oldest unacknowledged byte)
  SND.NXT = 1000    (next byte to send)
  SND.WND = 4000    (receiver advertised 4000 bytes)
  
  Usable Window = 4000 - (1000 - 1000) = 4000 bytes
 
After sending 2000 bytes (bytes 1000-2999):
  SND.UNA = 1000    (still waiting for ACK)
  SND.NXT = 3000    (next byte is 3000)
  SND.WND = 4000    (unchanged)
  
  Usable Window = 4000 - (3000 - 1000) = 2000 bytes
  
After receiving ACK 2000, Window 4000:
  SND.UNA = 2000    (bytes 1000-1999 acknowledged)
  SND.NXT = 3000    (unchanged, no new data sent)
  SND.WND = 4000    (receiver still has space)
  
  Usable Window = 4000 - (3000 - 2000) = 3000 bytes
  Window slides: now covers bytes 2000-5999

Window Updates Must Be Accepted Carefully

TCP must guard against accepting stale window updates from reordered ACKs. The SND.WL1 and SND.WL2 variables track which segment provided the last window update. A new window value is only accepted if it comes from a segment with a higher sequence number or acknowledgment number than the previous update.

Practical Implications

Understanding the sliding window has direct practical implications for system administrators, application developers, and network engineers.

Diagnosing throughput problems:

When TCP throughput is lower than expected, the sliding window is often the culprit. Common issues include:

Common Window-Related Performance Problems

•Small receive buffer: The application or OS configures a small receive buffer, limiting the advertised window. Even with ample bandwidth, throughput is capped at Window / RTT.
•Slow application reads: If the receiving application doesn't read data fast enough, the buffer fills up, the window shrinks, and the sender throttles.
•Missing window scaling: Connections without window scaling are limited to 64KB windows, devastating for high-BDP paths.
•Window stuck at zero: A persistent zero window indicates the receiver is overwhelmed. The persist timer mechanism handles this but adds latency.
•Asymmetric paths: Different RTTs in forward vs. reverse direction can confuse window tuning heuristics.

Tuning recommendations:

For high-performance networking, consider these window-related tuning strategies:

Increase socket buffer sizes: OS defaults may be too small. On Linux, adjust net.core.rmem_max, net.core.wmem_max, and the TCP-specific equivalents.
Enable window scaling: Modern systems enable this by default (RFC 7323), but verify it's active for your connections.
Enable TCP buffer auto-tuning: Modern operating systems can automatically adjust buffer sizes based on observed RTT. Ensure auto-tuning is enabled.
Match buffer to BDP: For known high-latency connections (e.g., to specific cloud regions), ensure buffers are at least as large as the expected BDP.
Monitor with packet captures: Tools like Wireshark can display window sizes and identify window-limited connections.

The Takeaway

The sliding window determines the upper bound of TCP throughput. No amount of bandwidth upgrade helps if the window is too small. When diagnosing performance issues, always check: Is the window large enough to fill the bandwidth-delay product?

Summary: The Sliding Window Foundation

We've covered the foundational concepts of TCP's sliding window mechanism. Let's consolidate the key takeaways:

Key Takeaways

•Stop-and-wait is catastrophically inefficient — Waiting for each ACK wastes bandwidth-delay product, resulting in sub-1% utilization on high-latency links.
•The sliding window enables pipelining — By allowing multiple bytes in flight, the window fills the network "pipe" and achieves high throughput.
•Throughput = min(Window/RTT, Bandwidth) — The window directly determines maximum achievable throughput. Window < BDP means wasted capacity.
•TCP counts bytes, not segments — This enables fine-grained flow control and matches buffer management semantics.
•The window slides as ACKs arrive — Left edge advances on acknowledgment; right edge advances based on receiver advertisement and left edge position.
•Flow control protects the receiver — The advertised window ensures the sender never overwhelms receiver buffers.
•The original 64KB limit is a bottleneck — Window scaling (covered later) extends this to meet modern network demands.

What's next:

Now that we understand the sliding window concept, we'll examine its two components in detail: the send window (the sender's perspective on what can be transmitted) and the receive window (the receiver's buffer management and advertisement). These perspectives are complementary but have distinct mechanisms and considerations.

Page Complete

You now understand the fundamental TCP sliding window mechanism—how it enables high throughput through pipelining, how it provides flow control through receiver advertisements, and why window size is critical for performance. Next, we'll explore the send window in depth.

1 / 5

Loading learning content...

Computer NetworksTCP Flow & Congestion Control

Sliding Window

LevelIntermediate

Duration60 mins

TopicTCP Flow & Congestion Control

1 / 5

TCP Sliding Window

The Art of Continuous Flow

What You Will Learn

The Problem with Stop-and-Wait

Consider a concrete example:

You're transferring data between San Francisco and Tokyo. The round-trip time (RTT) is 100ms, and you're using segments of 1,000 bytes on a 10 Mbps link.

Stop-and-Wait Performance Analysis
Parameter	Value	Calculation
Segment size	1,000 bytes = 8,000 bits	Given
Link bandwidth	10 Mbps	Given
Transmission time	0.8 ms	8,000 bits ÷ 10,000,000 bps
Round-trip time	100 ms	Given (propagation delay)
Time per segment	100.8 ms	RTT + transmission time
Throughput	~79,400 bps	8,000 bits ÷ 0.1008 seconds
Link utilization	0.79%	79,400 ÷ 10,000,000

Catastrophic Inefficiency

The fundamental problem is the bandwidth-delay product:

The bandwidth-delay product (BDP) represents the amount of data "in flight" that can exist between sender and receiver at any instant:

BDP = Bandwidth × Round-Trip Time
BDP = 10,000,000 bps × 0.1 seconds = 1,000,000 bits = 125,000 bytes

The solution is obvious: keep the pipe full. Instead of waiting for each acknowledgment, send enough data to fill the bandwidth-delay product. This is exactly what the sliding window enables.

Converting Mermaid diagram...

The Sliding Window Concept

Core principles:

The window is a range of permissible sequence numbers — The sender maintains a window of bytes it's allowed to send. This window "slides" forward as acknowledgments arrive.
The receiver advertises its capacity — The receiver tells the sender how much buffer space it has available. This prevents buffer overflow.
Bytes transition through states — Each byte in the sender's buffer is in one of several states: not yet sendable, sendable, sent but unacknowledged, or acknowledged.
The window slides, never jumps — As ACKs arrive, the left edge of the window advances. The right edge advances based on the receiver's advertised capacity.

The Window as a Permission Slip

Visualizing the sliding window:

Consider a stream of bytes numbered 0 through infinity. At any moment, the sender's view of this stream is divided into four regions:

The Four Regions of the Sender's Byte Stream
Region	Description	Sender Action
Acknowledged	Bytes that have been sent AND acknowledged by the receiver	Can be discarded from send buffer; no further action needed
Sent, Unacknowledged	Bytes that have been transmitted but ACK not yet received	Must be retained for potential retransmission
Sendable	Bytes within the window that haven't been sent yet	May be transmitted immediately
Not Yet Sendable	Bytes outside the current window	Must wait for window to slide forward

Sender's View of Byte Stream
Byte positions:  0    10    20    30    40    50    60    70    80    90   100
                 |-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|
                 
Current state (LastByteAcked=25, LastByteSent=45, Window=30):
 
     [===ACKED===][=SENT UNACKED=][==SENDABLE==][====NOT YET SENDABLE====]
     |<-- 0-25 -->|<-- 26-45 --->|<-- 46-55 -->|<------- 56+ ----------->|
                  ^               ^             ^
                  |               |             |
            Window starts    LastByteSent   Window ends (25+30=55)
                  
The window covers bytes 26-55 (30 bytes total):
- Bytes 26-45: Already sent, waiting for ACK
- Bytes 46-55: Sendable immediately
- Bytes 56+: Cannot send until window slides
 
When receiver ACKs byte 35:
- Window slides: now covers bytes 36-65
- Bytes 26-35: Move to "Acknowledged" region  
- Bytes 56-65: Move to "Sendable" region

The sliding mechanism:

The beauty of this design is how naturally the window "slides" forward:

Sender transmits: Bytes move from "Sendable" to "Sent, Unacknowledged"
Time passes: Segments travel through the network; ACKs return
ACK arrives: Bytes move from "Sent, Unacknowledged" to "Acknowledged"
Window slides: The left edge advances to match the cumulative ACK
New bytes become sendable: Previously blocked bytes enter the window

Pipelining and Throughput

The sliding window enables pipelining—having multiple segments in flight simultaneously. This is the key to high throughput over high-latency networks.

Revisiting our San Francisco to Tokyo example:

Recall our parameters: 10 Mbps link, 100ms RTT, 1,000-byte segments. With stop-and-wait, we achieved a pitiful 0.79% utilization. Let's see what happens with a sliding window.

Window Size Impact on Throughput
Window Size	Segments in Flight	Effective Throughput	Link Utilization
1,000 bytes (1 segment)	1	79.4 Kbps	0.79%
10,000 bytes (10 segments)	10	794 Kbps	7.9%
50,000 bytes (50 segments)	50	3.97 Mbps	39.7%
100,000 bytes (100 segments)	100	7.94 Mbps	79.4%
125,000 bytes (125 segments)	125	~10 Mbps	~100%
200,000 bytes (200 segments)	125 (limited by BDP)	~10 Mbps	~100%

The Magic Number: Bandwidth-Delay Product

The throughput formula:

For a sliding window protocol, throughput is governed by:

Throughput = min(Window Size / RTT, Link Bandwidth)

This reveals a critical insight: throughput is limited by the smaller of two factors:

Window Size / RTT: How fast the sender is allowed to inject data (flow control limit)
Link Bandwidth: How fast the network can physically carry data (capacity limit)

When Window Size ≥ BDP, throughput becomes bandwidth-limited (the ideal case). When Window Size < BDP, throughput is artificially constrained by the window—we have unused capacity.

Why this matters for modern networks:

Converting Mermaid diagram...

TCP's Byte-Oriented Window

Unlike some protocols that count segments, TCP's sliding window counts bytes. This is a crucial detail that affects how everything works.

Why bytes, not segments?

Variable segment sizes: TCP can send segments of different sizes. A byte-oriented window ensures consistent measurement regardless of segmentation.
Partial acknowledgments: The receiver can acknowledge exactly how many bytes it received, allowing for fine-grained flow control.
Buffer management: Receiver buffers are measured in bytes, so a byte-oriented window directly corresponds to buffer capacity.
Application alignment: Applications write and read bytes, not segments. The byte abstraction maintains end-to-end consistency.

Sequence Numbers are Byte Positions

The TCP header's window field:

The TCP header contains a 16-bit Window field that the receiver uses to advertise how many bytes it can accept:

Minimum value: 0 (receiver says "stop sending, I'm full")
Maximum value: 65,535 bytes (approximately 64 KB)
Interpretation: "You may send up to [Window] bytes beyond the last acknowledged byte"

Window Size Impact on High-Speed Links
Network Type	Bandwidth	RTT	BDP	64KB Window Utilization
LAN (Ethernet)	1 Gbps	0.5 ms	62.5 KB	~100% (sufficient)
Metro network	100 Mbps	10 ms	125 KB	52% (limited)
Cross-country	1 Gbps	50 ms	6.25 MB	1% (severely limited)
Intercontinental	10 Gbps	100 ms	125 MB	0.05% (crippled)
Satellite link	100 Mbps	600 ms	7.5 MB	0.9% (crippled)

The 64KB Barrier

Flow Control Foundation

How receiver-based flow control works:

Buffer allocation: The receiver allocates a buffer to hold incoming data before the application reads it.
Window advertisement: In every ACK, the receiver advertises its current available buffer space (the window).
Sender constraint: The sender limits its transmissions to stay within the advertised window.
Dynamic adjustment: As the application reads data, buffer space frees up, and the receiver advertises a larger window.
Continuous cycle: This creates a feedback loop where transmission rate adapts to receiver capacity.

Properties of Receiver-Based Flow Control

•End-to-end semantics: Flow control operates between the two endpoints, independent of intermediate network nodes.
•Self-regulating: No external coordination needed; the protocol automatically balances sender and receiver.
•Application-paced: The rate at which the application reads data ultimately controls the transmission rate.
•Buffer-protective: Guarantees the receiver's buffer cannot overflow, assuming correct implementation.
•Responsive: Window updates in every ACK enable rapid adaptation to changing conditions.

Contrast with congestion control:

It's essential to distinguish flow control from congestion control:

Aspect	Flow Control	Congestion Control
Protects	Receiver's buffer	Network capacity
Signaled by	Receiver's window advertisement	Packet loss, delay, ECN
Mechanism	rwnd (receive window)	cwnd (congestion window)
Scope	End-to-end	Sender-centric (path-aware)

TCP uses both mechanisms simultaneously. The actual transmission window is the minimum of rwnd and cwnd—the sender is constrained by both the receiver's capacity and the network's capacity.

The Two Governors

Window Mechanics in Detail

Let's trace through the precise mechanics of how the sliding window operates in TCP. Understanding these details is crucial for debugging performance issues and implementing TCP stacks.

Key variables at the sender:

Sender-Side Window Variables
Variable	Meaning	Updated When
`SND.UNA`	Send Unacknowledged — oldest byte sent but not yet ACKed	ACK received
`SND.NXT`	Send Next — next byte to be sent	Segment transmitted
`SND.WND`	Send Window — receiver's advertised window	ACK received with window update
`SND.WL1`	Segment sequence number for last window update	ACK received with window update
`SND.WL2`	Segment ACK number for last window update	ACK received with window update

The send window boundaries:

The sender can transmit bytes in the range:

[SND.UNA, SND.UNA + SND.WND)

Bytes from SND.UNA to SND.NXT - 1 have already been sent but are awaiting acknowledgment. Bytes from SND.NXT to SND.UNA + SND.WND - 1 may be sent immediately.

The usable window calculation:

Usable Window = SND.WND - (SND.NXT - SND.UNA)
             = (SND.UNA + SND.WND) - SND.NXT

This represents how many more bytes the sender can transmit right now.

Window State Evolution Example
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Initial State:
  SND.UNA = 1000    (oldest unacknowledged byte)
  SND.NXT = 1000    (next byte to send)
  SND.WND = 4000    (receiver advertised 4000 bytes)
  
  Usable Window = 4000 - (1000 - 1000) = 4000 bytes
 
After sending 2000 bytes (bytes 1000-2999):
  SND.UNA = 1000    (still waiting for ACK)
  SND.NXT = 3000    (next byte is 3000)
  SND.WND = 4000    (unchanged)
  
  Usable Window = 4000 - (3000 - 1000) = 2000 bytes
  
After receiving ACK 2000, Window 4000:
  SND.UNA = 2000    (bytes 1000-1999 acknowledged)
  SND.NXT = 3000    (unchanged, no new data sent)
  SND.WND = 4000    (receiver still has space)
  
  Usable Window = 4000 - (3000 - 2000) = 3000 bytes
  Window slides: now covers bytes 2000-5999

Window Updates Must Be Accepted Carefully

Practical Implications

Understanding the sliding window has direct practical implications for system administrators, application developers, and network engineers.

Diagnosing throughput problems:

When TCP throughput is lower than expected, the sliding window is often the culprit. Common issues include:

Common Window-Related Performance Problems

•Small receive buffer: The application or OS configures a small receive buffer, limiting the advertised window. Even with ample bandwidth, throughput is capped at Window / RTT.
•Slow application reads: If the receiving application doesn't read data fast enough, the buffer fills up, the window shrinks, and the sender throttles.
•Missing window scaling: Connections without window scaling are limited to 64KB windows, devastating for high-BDP paths.
•Window stuck at zero: A persistent zero window indicates the receiver is overwhelmed. The persist timer mechanism handles this but adds latency.
•Asymmetric paths: Different RTTs in forward vs. reverse direction can confuse window tuning heuristics.

Tuning recommendations:

For high-performance networking, consider these window-related tuning strategies:

Increase socket buffer sizes: OS defaults may be too small. On Linux, adjust net.core.rmem_max, net.core.wmem_max, and the TCP-specific equivalents.
Enable window scaling: Modern systems enable this by default (RFC 7323), but verify it's active for your connections.
Enable TCP buffer auto-tuning: Modern operating systems can automatically adjust buffer sizes based on observed RTT. Ensure auto-tuning is enabled.
Match buffer to BDP: For known high-latency connections (e.g., to specific cloud regions), ensure buffers are at least as large as the expected BDP.
Monitor with packet captures: Tools like Wireshark can display window sizes and identify window-limited connections.

The Takeaway

Summary: The Sliding Window Foundation

We've covered the foundational concepts of TCP's sliding window mechanism. Let's consolidate the key takeaways:

Key Takeaways

•Stop-and-wait is catastrophically inefficient — Waiting for each ACK wastes bandwidth-delay product, resulting in sub-1% utilization on high-latency links.
•The sliding window enables pipelining — By allowing multiple bytes in flight, the window fills the network "pipe" and achieves high throughput.
•Throughput = min(Window/RTT, Bandwidth) — The window directly determines maximum achievable throughput. Window < BDP means wasted capacity.
•TCP counts bytes, not segments — This enables fine-grained flow control and matches buffer management semantics.
•The window slides as ACKs arrive — Left edge advances on acknowledgment; right edge advances based on receiver advertisement and left edge position.
•Flow control protects the receiver — The advertised window ensures the sender never overwhelms receiver buffers.
•The original 64KB limit is a bottleneck — Window scaling (covered later) extends this to meet modern network demands.

What's next:

Page Complete

1 / 5