Computer NetworksFast Retransmit

Fast Retransmit

LevelAdvanced

Duration75 mins

TopicFast Retransmit

3 / 5

Timeout Avoidance

The Catastrophic Cost of Timeout

The retransmission timeout (RTO) is TCP's mechanism of last resort—a blunt instrument that ensures eventual recovery when all else fails. But this reliability comes at an enormous performance cost. When RTO fires, TCP doesn't just retransmit a segment; it assumes the network is severely congested and responds with the most conservative measures available: resetting the congestion window to 1 segment and entering slow start.

Fast retransmit exists specifically to avoid RTO whenever possible. By detecting loss through duplicate ACKs and retransmitting immediately, TCP bypasses the lengthy timeout period entirely. The performance difference is not marginal—it can be 10x to 100x faster recovery, translating directly into higher throughput, lower latency, and better user experience.

What You Will Master

By the end of this page, you will understand: (1) the quantitative cost of RTO in terms of time and throughput, (2) why RTO values are necessarily conservative, (3) how fast retransmit provides 10-100x faster recovery, (4) the mathematical analysis of timeout avoidance, and (5) real-world impact on applications and user experience.

Anatomy of a Retransmission Timeout

To appreciate timeout avoidance, we must first understand exactly what happens during an RTO event. The retransmission timeout is not simply a delay—it triggers a cascade of conservative actions that devastate throughput.

The RTO Timeline:

Segment Sent: A data segment is transmitted and the RTO timer starts
Waiting Period: The sender waits for an acknowledgment (RTO duration)
Timer Expiration: No ACK arrives; RTO fires
Emergency Response: TCP assumes severe congestion and takes drastic action

TCP Actions on RTO Expiration
Action	Value/Effect	Rationale
ssthresh = max(FlightSize/2, 2*SMSS)	Halved flight size	Record last 'safe' rate before problem
cwnd = 1 SMSS (or LW)	One segment only	Most conservative possible—slow start from scratch
Retransmit oldest unacked segment	snd_una segment	Attempt to make progress
RTO = RTO * 2 (exponential backoff)	2x, 4x, 8x... up to 64s+ \| Prevent overwhelming congested network
Reset duplicate ACK counter	dupacks = 0	Fresh detection state
Enter slow start	Exponential growth phase	Probe network capacity cautiously

The Double Penalty:

RTO imposes a double penalty on throughput:

The Wait: No data transmission during the entire RTO period (typically 200ms to several seconds)
The Reset: After RTO, cwnd resets to 1 segment, requiring slow start to rebuild

Consider a connection with cwnd = 100 segments (1.5MB window at MSS=1460). After RTO:

Window drops from 100 segments to 1 segment (99% reduction)
Must complete ~7 RTTs of slow start to return to 100 segments
For RTT = 50ms, that's 350ms of suboptimal throughput after the RTO period

Total impact: RTO duration + slow start recovery time + opportunity cost of reduced window.

RTO Minimum: 1 Second (RFC 6298)

RFC 6298 specifies a minimum RTO of 1 second. Even on low-latency networks where RTT is 10ms, the RTO cannot go below 1 second in compliant implementations. This conservative floor protects the network but makes RTO recovery extremely costly on fast networks.

Why RTO Must Be Conservative:

The RTO is intentionally set higher than necessary because:

False positive danger: Premature retransmission wastes bandwidth and can worsen congestion
Network variability: RTT can spike due to queuing; RTO must accommodate variance
Safety margin: Conservative RTO prevents retransmission storms that could collapse the network
Clock granularity: Historically, systems had coarse timers (500ms ticks); RTO must be safe even with measurement error

The classic RTO formula (RFC 6298) includes 4x the RTT variance:

RTO = SRTT + max(G, K × RTTVAR)

Where K=4, G=clock granularity (typically 1ms today), SRTT=smoothed RTT, RTTVAR=RTT variance.

This formula errs heavily on the side of caution.

Quantifying the Cost of Timeout

Let's develop a rigorous mathematical framework for understanding the performance cost of RTO versus fast retransmit. This analysis reveals why avoidance is so critical.

Recovery Time Comparison:

RTO Recovery Time = RTO + SlowStartRecovery + ResumeTime

Where:

RTO: The timeout duration (1s minimum, often higher)
SlowStartRecovery: Time to rebuild cwnd via slow start
ResumeTime: Time to return to steady-state throughput

Fast Retransmit Recovery Time = 3 × PacketTime + RTT

Where:

3 × PacketTime: Time for 3 duplicate ACKs to arrive
RTT: Time for retransmission to be acknowledged

Numerical Example:

Consider a connection with:

RTT = 50ms
cwnd = 100 segments before loss
RTO = 1 second (minimum)

Scenario A: RTO Recovery

1. RTO fires: 1000ms wait
2. Slow start from 1 segment:
   - RTT 1: cwnd = 2
   - RTT 2: cwnd = 4
   - RTT 3: cwnd = 8
   - RTT 4: cwnd = 16
   - RTT 5: cwnd = 32
   - RTT 6: cwnd = 50 (ssthresh)
   - Congestion avoidance begins
   - RTT 7+: linear growth to 100
3. Total recovery: 1000ms + 7×50ms = 1350ms minimum

Scenario B: Fast Retransmit Recovery

1. 3 dup ACKs arrive: ~10ms (3 segments at typical BDP)
2. Retransmit immediately
3. ACK for retransmit: 50ms RTT
4. cwnd halved but maintained in fast recovery
5. Total recovery: ~60ms

Speedup: 1350ms / 60ms ≈ 22x faster!

Recovery Time Comparison Across Network Types
Network Type	RTT	RTO	Fast Retransmit Recovery	RTO Recovery	Speedup
LAN	1ms	1000ms	~3ms	~1040ms	~350x
Regional WAN	20ms	1000ms	~30ms	~1140ms	~38x
Cross-continent	100ms	1000ms	~120ms	~1700ms	~14x
Satellite	600ms	1800ms	~700ms	~5400ms	~8x
Mobile (variable)	80ms avg	1200ms	~100ms	~1600ms	~16x

The LAN Paradox

Ironically, the speedup from fast retransmit is greatest on low-latency networks (LANs), where the fixed 1-second RTO minimum creates the largest relative penalty. On a 1ms RTT LAN, RTO is 1000x the RTT, making timeout avoidance extraordinarily valuable.

Throughput Impact Analysis

Beyond raw recovery time, we must consider the impact on actual throughput. A stalled connection contributes zero bytes during recovery; the throughput impact compounds with loss frequency.

Theoretical Throughput Model:

For a TCP connection with:

Steady-state throughput: T₀ (bytes/second)
Loss probability: p (per segment)
Average segments per loss event: N = 1/p
Recovery time: R (seconds)

Effective throughput:

T_effective = T₀ × (transmission_time) / (transmission_time + recovery_time)
            = T₀ × (N × segment_time) / (N × segment_time + R)

For small R (fast retransmit): throughput impact is minimal For large R (RTO): throughput degrades significantly

Numerical Throughput Example:

Connection parameters:

Link rate: 100 Mbps
MSS: 1460 bytes
RTT: 50ms
Loss rate: 0.1% (1 in 1000 segments)

With RTO Recovery:

Segments per loss: 1000
Transmission time for 1000 segments: ~117ms (at 100 Mbps)
RTO recovery time: 1000ms (minimum)

Effective throughput = 100 × 117/(117+1000) = ~10.5 Mbps
Efficiency: 10.5% of link capacity

With Fast Retransmit:

Segments per loss: 1000
Transmission time for 1000 segments: ~117ms
Fast retransmit recovery: 60ms

Effective throughput = 100 × 117/(117+60) = ~66 Mbps
Efficiency: 66% of link capacity

Improvement: 66 Mbps vs 10.5 Mbps = 6.3x higher throughput!

Converting Mermaid diagram...

The Exponential Backoff Multiplier:

The situation worsens dramatically when multiple consecutive losses occur. RTO uses exponential backoff:

1st timeout: RTO (e.g., 1s)
2nd timeout: 2 × RTO (2s)
3rd timeout: 4 × RTO (4s)
4th timeout: 8 × RTO (8s)
...

Two consecutive RTOs mean 3 seconds of idle time (1s + 2s). Three consecutive RTOs mean 7 seconds. This exponential penalty can cause perceived "connection freezes" that last several seconds—devastating for interactive applications.

Real-World Impact

Studies show that even a 1% loss rate can reduce TCP throughput by 90%+ when RTO is the primary recovery mechanism. Fast retransmit transforms this from a catastrophic scenario to manageable performance degradation.

The Fast Path Advantage

Fast retransmit provides what we call the "fast path" for loss recovery. Understanding why this path is so much faster requires examining the fundamental differences in the two recovery mechanisms.

Signal Detection Speed:

Fast Retransmit Path

•Detection: Implicit via duplicate ACKs
•Latency: ~1 RTT (time for dup ACKs to return)
•Certainty: High (3 dup ACKs indicate probable loss)
•Congestion signal: Moderate (network still working)
•Response: Halve cwnd, retransmit, fast recovery

RTO Path

•Detection: Explicit via timer expiration
•Latency: RTO (1s minimum, often higher)
•Certainty: Absolute (timer fired = something wrong)
•Congestion signal: Severe (assumes network collapse)
•Response: Reset cwnd to 1, retransmit, slow start

The Information Advantage:

Duplicate ACKs carry information that pure timeout cannot provide:

Receiver is alive: The receiver is processing segments and sending ACKs
Network is functional: Segments are getting through (just not in order)
Specific loss identified: The ACK number tells exactly which byte is missing
Subsequent data buffered: Later segments arrived and are buffered

RTO provides none of this information. When RTO fires, TCP knows only that something went wrong—it could be:

Packet loss
ACK loss
Route change with higher RTT
Receiver crashed
Network partition

This uncertainty forces the most conservative possible response.

Window Preservation:

One of fast recovery's key optimizations (introduced in TCP Reno) is window preservation:

RTO Path:

cwnd before loss: 100 segments
cwnd after RTO: 1 segment
Bandwidth utilization: 1% of previous

Fast Retransmit Path:

cwnd before loss: 100 segments
ssthresh after fast retransmit: 50 segments
cwnd during fast recovery: 50 + dup_acks × 1
Bandwidth utilization: ~50% of previous

The ability to maintain approximately half the previous window during fast recovery (rather than dropping to 1) is tremendously valuable. It means the pipe stays mostly full while recovery proceeds.

Pipe Management

Network engineers often think in terms of 'keeping the pipe full.' Fast retransmit allows TCP to keep sending new data during recovery (window inflates with each dup ACK), while RTO empties the pipe completely. This is the fundamental difference in throughput efficiency.

When Fast Retransmit Cannot Avoid Timeout

Despite its power, fast retransmit has fundamental limitations. Certain scenarios inevitably lead to timeout regardless of fast retransmit capability. Understanding these scenarios is essential for both implementation and debugging.

Scenario 1: Tail Loss (Insufficient Duplicate ACKs)

When the last few segments of a transmission are lost, there are no subsequent segments to generate duplicate ACKs.

tail_loss_timeline.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
Timeline: Tail Loss Without Fast Retransmit
 
Time 0ms:     Sender transmits segments 1-10
Time 5ms:     Segments 1-8 arrive (segments 9, 10 lost)
Time 20ms:    ACK for segment 8 arrives (ACK 11680)
              
              [No more segments arriving → No duplicate ACKs possible]
              
Time 20-1000ms: Sender waits... application is idle or waiting for window
Time 1000ms:  RTO fires
Time 1020ms:  Segment 9 retransmitted
Time 1040ms:  Segment 10 retransmitted (or sender waits for ACK 9)
Time 1060ms+: Recovery continues via slow start
 
Total stall: 1000ms (RTO duration)
Fast retransmit: IMPOSSIBLE - no dup ACKs generated

Scenario 2: Small Congestion Window

With cwnd ≤ 3 segments, even loss of the first segment cannot generate 3 duplicate ACKs.

Fast Retransmit Feasibility by Window Size
cwnd (segments)	Segments After Loss	Max Dup ACKs	Fast Retransmit?
1	0	0	❌ Never
2	1	1	❌ Never
3	2	2	❌ Never
4	3	3	✅ Only if 1st segment lost
5	4	4	✅ If 1st or 2nd segment lost
10	9	9	✅ Most losses recoverable
100	99	99	✅ Virtually all losses

Scenario 3: Multiple Consecutive Losses

When many consecutive segments are lost, there may be no "bookending" segments to trigger duplicate ACKs.

Sent: [1][2][3][4][5][6][7][8][9][10]
Lost: [1][2][3][4][5][6][7][8]  ← 8 consecutive losses
Arrived at receiver: [9][10]

Receiver never got segment 1 (rcv_nxt = 1)
Segments 9, 10 are too far ahead → likely dropped or buffered
No contiguous acks possible → RTO required

Scenario 4: ACK Loss

If the duplicate ACKs themselves are lost, the sender never receives the signal.

Scenario 5: Network Partition

If the network path is completely broken (no packets getting through), neither data nor ACKs are flowing. RTO is the only detection mechanism.

Modern Mitigations

Modern TCP includes mechanisms to address these limitations: Tail Loss Probe (TLP) sends a probe segment to elicit ACKs for tail loss scenarios. Early Retransmit allows fast retransmit with fewer than 3 dup ACKs when the flight size is small. RACK uses timing rather than counting to detect loss.

Application-Level Impact

The difference between RTO and fast retransmit recovery propagates up to applications in ways that directly affect user experience. Different application types have different sensitivities to these recovery mechanisms.

Web Browsing (HTTP/HTTPS):

Web Page Load Impact

•Fast Retransmit: Page load delayed by ~1 RTT for lost resource. User may not notice on typical page with 50+ resources.
•RTO: Page load stalls for 1+ seconds. User perceives 'frozen' loading. May trigger browser timeout or user abandonment.
•Critical Path Impact: Loss of HTML or critical CSS triggers RTO = visible freeze. Loss of image triggers RTO = delayed render but not blocking.
•HTTP/2 Multiplexing: Single RTO blocks all streams on connection. Fast retransmit recovers one stream without blocking others.

Video Streaming:

Fast Retransmit: Brief quality dip possible; buffer absorbs the ~100ms recovery time
RTO: 1+ second stall may drain buffer, causing visible rebuffering
Live Streaming: 1-second delay can desynchronize from real-time; fast retransmit keeps stream viable

Online Gaming:

Fast Retransmit: ~100ms latency spike; noticeable but playable for most games
RTO: 1+ second freeze; player likely dead/disconnected in competitive games
Real-time requirement: Most games switch to UDP for this reason, implementing their own loss handling

VoIP/Video Conferencing:

Fast Retransmit: Slight audio glitch or video artifact
RTO: 1+ second of silence/frozen video; conversation interrupted
Note: VoIP typically uses RTP over UDP; when used over TCP, RTO causes severe degradation

User Experience Impact by Application Type
Application	Acceptable Delay	Fast Retransmit Impact	RTO Impact
Web browsing	100-500ms	Imperceptible	Noticeable stall
Video streaming	1-5s (buffered)	None (buffer absorbs)	May rebuffer
Online gaming	< 100ms	Minor latency spike	Game-breaking freeze
VoIP	< 150ms	Brief audio gap	Conversation break
File transfer	Throughput matters	Minimal throughput loss	Up to 90%+ throughput loss
SSH/interactive	< 200ms	Sluggish response	Session appears frozen

The 1-Second Wall

Research consistently shows that perceived performance degrades sharply at ~1 second delays. This aligns exactly with the minimum RTO. Fast retransmit keeps recovery under this threshold for most scenarios; RTO exceeds it by definition.

Measuring Timeout Avoidance Effectiveness

In production systems, monitoring the effectiveness of timeout avoidance is crucial for understanding TCP performance. Several metrics and tools can illuminate whether fast retransmit is working as expected.

Key Metrics:

Timeout Avoidance Metrics

•FastRetrans / TotalRetrans Ratio: Percentage of retransmissions via fast retransmit vs RTO. Healthy connections show >90% fast retransmit.
•TimeoutRatio: Number of RTOs per connection or per unit time. Lower is better.
•SpuriousRTO: Count of RTOs that fired unnecessarily (segment was in flight, not lost). Indicates RTO too aggressive.
•RecoveryTime: Average time from loss detection to successful retransmission acknowledgment.
•RecoveryEpisodes: Count of distinct loss recovery events, broken down by fast retransmit vs RTO.

linux_timeout_metrics.sh
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
# Linux: View TCP retransmission statistics
 
# Kernel-wide aggregates
$ cat /proc/net/netstat | grep -i retrans
TcpExt: ... TCPFastRetrans 12847 ... TCPSlowStartRetrans 234 ...
# FastRetrans >> SlowStartRetrans = good
# SlowStartRetrans high = too many RTOs
 
# Detailed breakdown
$ nstat -sz | grep -E "Tcp.*(Fast|Slow|Timeout|Retrans)"
TcpRetransSegs          45892    # Total retransmitted segments
TcpFastRetrans          44123    # Via fast retransmit
TcpSlowStartRetrans     1769     # Post-RTO (during slow start)
TcpTimeouts             1823     # Number of RTOs
 
# Calculate effectiveness
# Fast Retransmit Ratio = FastRetrans / RetransSegs
# 44123 / 45892 = 96.1% ← Excellent
 
# Per-connection stats via ss
$ ss -ti dst 10.0.0.1 | grep -E "(retrans|rto)"
         rto:204 ... retrans:0/2 reordering:5
# retrans: X/Y = X outstanding retrans, Y total for connection
# rto: current RTO in ms
 
# Real-time monitoring with tcpstat
$ tcpstat -o "Time:%S,Retrans:%r,TimeoutRto:%e" -i eth0

Diagnostic Approach:

High RTO count: Check for tail loss (TLP may help), small windows (check cwnd, rwnd), or path issues (packet captures)
Low FastRetrans despite loss: Verify SACK negotiation, check for receiver issues, analyze duplicate ACK patterns
Spurious RTOs: RTO may be too aggressive; check RTT variance, consider increasing RTO min (non-standard)
FastRetrans not helping throughput: Verify fast recovery is working (not just fast retransmit); check for SACK reneging

Wireshark Visualization

Wireshark's Statistics → TCP Stream Graphs → Time-Sequence (Stevens) provides visual insight into retransmissions. Fast retransmits appear as brief backward movements followed by immediate progress; RTOs appear as long flat periods before retransmission.

Summary: The Value of Avoiding Timeout

Timeout avoidance is not merely an optimization—it is essential for TCP's viability on modern networks. Without fast retransmit, the combination of conservative RTO values and multiplicative backoff would render TCP unusable for most applications on lossy networks.

Key Takeaways

•RTO imposes a double penalty — Both the wait time (1s+) and the cwnd reset (to 1 segment) devastate throughput
•Fast retransmit provides 10-100x faster recovery — Detection in ~1 RTT vs RTO's 1s+; window halved vs reset
•Throughput impact is dramatic — 0.1% loss rate: ~66% throughput with fast retransmit, ~10% with RTO only
•The fast path preserves window state — Approximately half of cwnd is maintained, keeping the pipe partially full
•Some timeouts are unavoidable — Tail loss, small windows, multiple losses, and ACK loss still require RTO
•Application impact is severe — RTO exceeds human perception thresholds; fast retransmit stays under them
•Monitoring is essential — Track FastRetrans/TotalRetrans ratio; healthy connections show >90% fast retransmit

What's Next:

We've established that fast retransmit provides enormous performance benefits by avoiding timeout. The next page quantifies these benefits more precisely, exploring the performance improvement metrics and comparing TCP's behavior with and without fast retransmit across various network conditions.

Page Complete

You now understand the critical importance of timeout avoidance in TCP—the mathematical basis for its performance impact, the scenarios where it succeeds and fails, and how to measure its effectiveness in production systems. This knowledge is fundamental to understanding modern TCP performance.

3 / 5

Loading learning content...

Computer NetworksFast Retransmit

Fast Retransmit

LevelAdvanced

Duration75 mins

TopicFast Retransmit

3 / 5

Timeout Avoidance

The Catastrophic Cost of Timeout

What You Will Master

Anatomy of a Retransmission Timeout

The RTO Timeline:

Segment Sent: A data segment is transmitted and the RTO timer starts
Waiting Period: The sender waits for an acknowledgment (RTO duration)
Timer Expiration: No ACK arrives; RTO fires
Emergency Response: TCP assumes severe congestion and takes drastic action

TCP Actions on RTO Expiration
Action	Value/Effect	Rationale
ssthresh = max(FlightSize/2, 2*SMSS)	Halved flight size	Record last 'safe' rate before problem
cwnd = 1 SMSS (or LW)	One segment only	Most conservative possible—slow start from scratch
Retransmit oldest unacked segment	snd_una segment	Attempt to make progress
RTO = RTO * 2 (exponential backoff)	2x, 4x, 8x... up to 64s+ \| Prevent overwhelming congested network
Reset duplicate ACK counter	dupacks = 0	Fresh detection state
Enter slow start	Exponential growth phase	Probe network capacity cautiously

The Double Penalty:

RTO imposes a double penalty on throughput:

The Wait: No data transmission during the entire RTO period (typically 200ms to several seconds)
The Reset: After RTO, cwnd resets to 1 segment, requiring slow start to rebuild

Consider a connection with cwnd = 100 segments (1.5MB window at MSS=1460). After RTO:

Window drops from 100 segments to 1 segment (99% reduction)
Must complete ~7 RTTs of slow start to return to 100 segments
For RTT = 50ms, that's 350ms of suboptimal throughput after the RTO period

Total impact: RTO duration + slow start recovery time + opportunity cost of reduced window.

RTO Minimum: 1 Second (RFC 6298)

Why RTO Must Be Conservative:

The RTO is intentionally set higher than necessary because:

False positive danger: Premature retransmission wastes bandwidth and can worsen congestion
Network variability: RTT can spike due to queuing; RTO must accommodate variance
Safety margin: Conservative RTO prevents retransmission storms that could collapse the network
Clock granularity: Historically, systems had coarse timers (500ms ticks); RTO must be safe even with measurement error

The classic RTO formula (RFC 6298) includes 4x the RTT variance:

RTO = SRTT + max(G, K × RTTVAR)

Where K=4, G=clock granularity (typically 1ms today), SRTT=smoothed RTT, RTTVAR=RTT variance.

This formula errs heavily on the side of caution.

Quantifying the Cost of Timeout

Let's develop a rigorous mathematical framework for understanding the performance cost of RTO versus fast retransmit. This analysis reveals why avoidance is so critical.

Recovery Time Comparison:

RTO Recovery Time = RTO + SlowStartRecovery + ResumeTime

Where:

RTO: The timeout duration (1s minimum, often higher)
SlowStartRecovery: Time to rebuild cwnd via slow start
ResumeTime: Time to return to steady-state throughput

Fast Retransmit Recovery Time = 3 × PacketTime + RTT

Where:

3 × PacketTime: Time for 3 duplicate ACKs to arrive
RTT: Time for retransmission to be acknowledged

Numerical Example:

Consider a connection with:

RTT = 50ms
cwnd = 100 segments before loss
RTO = 1 second (minimum)

Scenario A: RTO Recovery

1. RTO fires: 1000ms wait
2. Slow start from 1 segment:
   - RTT 1: cwnd = 2
   - RTT 2: cwnd = 4
   - RTT 3: cwnd = 8
   - RTT 4: cwnd = 16
   - RTT 5: cwnd = 32
   - RTT 6: cwnd = 50 (ssthresh)
   - Congestion avoidance begins
   - RTT 7+: linear growth to 100
3. Total recovery: 1000ms + 7×50ms = 1350ms minimum

Scenario B: Fast Retransmit Recovery

1. 3 dup ACKs arrive: ~10ms (3 segments at typical BDP)
2. Retransmit immediately
3. ACK for retransmit: 50ms RTT
4. cwnd halved but maintained in fast recovery
5. Total recovery: ~60ms

Speedup: 1350ms / 60ms ≈ 22x faster!

Recovery Time Comparison Across Network Types
Network Type	RTT	RTO	Fast Retransmit Recovery	RTO Recovery	Speedup
LAN	1ms	1000ms	~3ms	~1040ms	~350x
Regional WAN	20ms	1000ms	~30ms	~1140ms	~38x
Cross-continent	100ms	1000ms	~120ms	~1700ms	~14x
Satellite	600ms	1800ms	~700ms	~5400ms	~8x
Mobile (variable)	80ms avg	1200ms	~100ms	~1600ms	~16x

The LAN Paradox

Throughput Impact Analysis

Beyond raw recovery time, we must consider the impact on actual throughput. A stalled connection contributes zero bytes during recovery; the throughput impact compounds with loss frequency.

Theoretical Throughput Model:

For a TCP connection with:

Steady-state throughput: T₀ (bytes/second)
Loss probability: p (per segment)
Average segments per loss event: N = 1/p
Recovery time: R (seconds)

Effective throughput:

T_effective = T₀ × (transmission_time) / (transmission_time + recovery_time)
            = T₀ × (N × segment_time) / (N × segment_time + R)

For small R (fast retransmit): throughput impact is minimal For large R (RTO): throughput degrades significantly

Numerical Throughput Example:

Connection parameters:

Link rate: 100 Mbps
MSS: 1460 bytes
RTT: 50ms
Loss rate: 0.1% (1 in 1000 segments)

With RTO Recovery:

Segments per loss: 1000
Transmission time for 1000 segments: ~117ms (at 100 Mbps)
RTO recovery time: 1000ms (minimum)

Effective throughput = 100 × 117/(117+1000) = ~10.5 Mbps
Efficiency: 10.5% of link capacity

With Fast Retransmit:

Segments per loss: 1000
Transmission time for 1000 segments: ~117ms
Fast retransmit recovery: 60ms

Effective throughput = 100 × 117/(117+60) = ~66 Mbps
Efficiency: 66% of link capacity

Improvement: 66 Mbps vs 10.5 Mbps = 6.3x higher throughput!

Converting Mermaid diagram...

The Exponential Backoff Multiplier:

The situation worsens dramatically when multiple consecutive losses occur. RTO uses exponential backoff:

1st timeout: RTO (e.g., 1s)
2nd timeout: 2 × RTO (2s)
3rd timeout: 4 × RTO (4s)
4th timeout: 8 × RTO (8s)
...

Real-World Impact

The Fast Path Advantage

Signal Detection Speed:

Fast Retransmit Path

•Detection: Implicit via duplicate ACKs
•Latency: ~1 RTT (time for dup ACKs to return)
•Certainty: High (3 dup ACKs indicate probable loss)
•Congestion signal: Moderate (network still working)
•Response: Halve cwnd, retransmit, fast recovery

RTO Path

•Detection: Explicit via timer expiration
•Latency: RTO (1s minimum, often higher)
•Certainty: Absolute (timer fired = something wrong)
•Congestion signal: Severe (assumes network collapse)
•Response: Reset cwnd to 1, retransmit, slow start

The Information Advantage:

Duplicate ACKs carry information that pure timeout cannot provide:

Receiver is alive: The receiver is processing segments and sending ACKs
Network is functional: Segments are getting through (just not in order)
Specific loss identified: The ACK number tells exactly which byte is missing
Subsequent data buffered: Later segments arrived and are buffered

RTO provides none of this information. When RTO fires, TCP knows only that something went wrong—it could be:

Packet loss
ACK loss
Route change with higher RTT
Receiver crashed
Network partition

This uncertainty forces the most conservative possible response.

Window Preservation:

One of fast recovery's key optimizations (introduced in TCP Reno) is window preservation:

RTO Path:

cwnd before loss: 100 segments
cwnd after RTO: 1 segment
Bandwidth utilization: 1% of previous

Fast Retransmit Path:

cwnd before loss: 100 segments
ssthresh after fast retransmit: 50 segments
cwnd during fast recovery: 50 + dup_acks × 1
Bandwidth utilization: ~50% of previous

The ability to maintain approximately half the previous window during fast recovery (rather than dropping to 1) is tremendously valuable. It means the pipe stays mostly full while recovery proceeds.

Pipe Management

When Fast Retransmit Cannot Avoid Timeout

Scenario 1: Tail Loss (Insufficient Duplicate ACKs)

When the last few segments of a transmission are lost, there are no subsequent segments to generate duplicate ACKs.

tail_loss_timeline.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
Timeline: Tail Loss Without Fast Retransmit
 
Time 0ms:     Sender transmits segments 1-10
Time 5ms:     Segments 1-8 arrive (segments 9, 10 lost)
Time 20ms:    ACK for segment 8 arrives (ACK 11680)
              
              [No more segments arriving → No duplicate ACKs possible]
              
Time 20-1000ms: Sender waits... application is idle or waiting for window
Time 1000ms:  RTO fires
Time 1020ms:  Segment 9 retransmitted
Time 1040ms:  Segment 10 retransmitted (or sender waits for ACK 9)
Time 1060ms+: Recovery continues via slow start
 
Total stall: 1000ms (RTO duration)
Fast retransmit: IMPOSSIBLE - no dup ACKs generated

Scenario 2: Small Congestion Window

With cwnd ≤ 3 segments, even loss of the first segment cannot generate 3 duplicate ACKs.

Fast Retransmit Feasibility by Window Size
cwnd (segments)	Segments After Loss	Max Dup ACKs	Fast Retransmit?
1	0	0	❌ Never
2	1	1	❌ Never
3	2	2	❌ Never
4	3	3	✅ Only if 1st segment lost
5	4	4	✅ If 1st or 2nd segment lost
10	9	9	✅ Most losses recoverable
100	99	99	✅ Virtually all losses

Scenario 3: Multiple Consecutive Losses

When many consecutive segments are lost, there may be no "bookending" segments to trigger duplicate ACKs.

Sent: [1][2][3][4][5][6][7][8][9][10]
Lost: [1][2][3][4][5][6][7][8]  ← 8 consecutive losses
Arrived at receiver: [9][10]

Receiver never got segment 1 (rcv_nxt = 1)
Segments 9, 10 are too far ahead → likely dropped or buffered
No contiguous acks possible → RTO required

Scenario 4: ACK Loss

If the duplicate ACKs themselves are lost, the sender never receives the signal.

Scenario 5: Network Partition

If the network path is completely broken (no packets getting through), neither data nor ACKs are flowing. RTO is the only detection mechanism.

Modern Mitigations

Application-Level Impact

Web Browsing (HTTP/HTTPS):

Web Page Load Impact

•Fast Retransmit: Page load delayed by ~1 RTT for lost resource. User may not notice on typical page with 50+ resources.
•RTO: Page load stalls for 1+ seconds. User perceives 'frozen' loading. May trigger browser timeout or user abandonment.
•Critical Path Impact: Loss of HTML or critical CSS triggers RTO = visible freeze. Loss of image triggers RTO = delayed render but not blocking.
•HTTP/2 Multiplexing: Single RTO blocks all streams on connection. Fast retransmit recovers one stream without blocking others.

Video Streaming:

Fast Retransmit: Brief quality dip possible; buffer absorbs the ~100ms recovery time
RTO: 1+ second stall may drain buffer, causing visible rebuffering
Live Streaming: 1-second delay can desynchronize from real-time; fast retransmit keeps stream viable

Online Gaming:

Fast Retransmit: ~100ms latency spike; noticeable but playable for most games
RTO: 1+ second freeze; player likely dead/disconnected in competitive games
Real-time requirement: Most games switch to UDP for this reason, implementing their own loss handling

VoIP/Video Conferencing:

Fast Retransmit: Slight audio glitch or video artifact
RTO: 1+ second of silence/frozen video; conversation interrupted
Note: VoIP typically uses RTP over UDP; when used over TCP, RTO causes severe degradation

User Experience Impact by Application Type
Application	Acceptable Delay	Fast Retransmit Impact	RTO Impact
Web browsing	100-500ms	Imperceptible	Noticeable stall
Video streaming	1-5s (buffered)	None (buffer absorbs)	May rebuffer
Online gaming	< 100ms	Minor latency spike	Game-breaking freeze
VoIP	< 150ms	Brief audio gap	Conversation break
File transfer	Throughput matters	Minimal throughput loss	Up to 90%+ throughput loss
SSH/interactive	< 200ms	Sluggish response	Session appears frozen

The 1-Second Wall

Measuring Timeout Avoidance Effectiveness

Key Metrics:

Timeout Avoidance Metrics

•FastRetrans / TotalRetrans Ratio: Percentage of retransmissions via fast retransmit vs RTO. Healthy connections show >90% fast retransmit.
•TimeoutRatio: Number of RTOs per connection or per unit time. Lower is better.
•SpuriousRTO: Count of RTOs that fired unnecessarily (segment was in flight, not lost). Indicates RTO too aggressive.
•RecoveryTime: Average time from loss detection to successful retransmission acknowledgment.
•RecoveryEpisodes: Count of distinct loss recovery events, broken down by fast retransmit vs RTO.

linux_timeout_metrics.sh
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
# Linux: View TCP retransmission statistics
 
# Kernel-wide aggregates
$ cat /proc/net/netstat | grep -i retrans
TcpExt: ... TCPFastRetrans 12847 ... TCPSlowStartRetrans 234 ...
# FastRetrans >> SlowStartRetrans = good
# SlowStartRetrans high = too many RTOs
 
# Detailed breakdown
$ nstat -sz | grep -E "Tcp.*(Fast|Slow|Timeout|Retrans)"
TcpRetransSegs          45892    # Total retransmitted segments
TcpFastRetrans          44123    # Via fast retransmit
TcpSlowStartRetrans     1769     # Post-RTO (during slow start)
TcpTimeouts             1823     # Number of RTOs
 
# Calculate effectiveness
# Fast Retransmit Ratio = FastRetrans / RetransSegs
# 44123 / 45892 = 96.1% ← Excellent
 
# Per-connection stats via ss
$ ss -ti dst 10.0.0.1 | grep -E "(retrans|rto)"
         rto:204 ... retrans:0/2 reordering:5
# retrans: X/Y = X outstanding retrans, Y total for connection
# rto: current RTO in ms
 
# Real-time monitoring with tcpstat
$ tcpstat -o "Time:%S,Retrans:%r,TimeoutRto:%e" -i eth0

Diagnostic Approach:

High RTO count: Check for tail loss (TLP may help), small windows (check cwnd, rwnd), or path issues (packet captures)
Low FastRetrans despite loss: Verify SACK negotiation, check for receiver issues, analyze duplicate ACK patterns
Spurious RTOs: RTO may be too aggressive; check RTT variance, consider increasing RTO min (non-standard)
FastRetrans not helping throughput: Verify fast recovery is working (not just fast retransmit); check for SACK reneging

Wireshark Visualization

Summary: The Value of Avoiding Timeout

Key Takeaways

•RTO imposes a double penalty — Both the wait time (1s+) and the cwnd reset (to 1 segment) devastate throughput
•Fast retransmit provides 10-100x faster recovery — Detection in ~1 RTT vs RTO's 1s+; window halved vs reset
•Throughput impact is dramatic — 0.1% loss rate: ~66% throughput with fast retransmit, ~10% with RTO only
•The fast path preserves window state — Approximately half of cwnd is maintained, keeping the pipe partially full
•Some timeouts are unavoidable — Tail loss, small windows, multiple losses, and ACK loss still require RTO
•Application impact is severe — RTO exceeds human perception thresholds; fast retransmit stays under them
•Monitoring is essential — Track FastRetrans/TotalRetrans ratio; healthy connections show >90% fast retransmit

What's Next:

Page Complete

3 / 5