Dynamic Timeout - Learning Module

Loading content...

0/228

RTO Calculation: The Complete RFC 6298 Algorithm

The Moment of Truth

Every time TCP sends a segment, it faces a critical decision: How long should I wait for acknowledgment before assuming the segment was lost? Set the timer too short, and you'll waste bandwidth retransmitting segments that were actually on their way. Set it too long, and you'll sit idle while valuable transmission time ticks away.

This decision—the Retransmission Timeout (RTO)—is the culmination of everything we've studied: RTT sampling, Jacobson's variance-based estimation, and Karn's algorithm for handling ambiguity. RFC 6298, "Computing TCP's Retransmission Timer," codifies the complete algorithm that modern TCP implementations follow.

In this page, we'll dissect RFC 6298 step by step, understanding not just what the algorithm specifies, but why each element exists.

What You Will Learn

By the end of this page, you will understand the complete RTO calculation algorithm as specified in RFC 6298, all the constants and bounds involved, the clock granularity consideration, how to initialize the algorithm, detailed update procedures, and how RTO fits into TCP's larger retransmission framework.

RFC 6298 Overview: The Standard Specification

RFC 6298, published in 2011, obsoletes the earlier RFC 2988 and provides the current standard for TCP RTO computation. It consolidates decades of research and operational experience into a precise specification.

Key Components

The RFC defines:

State Variables: SRTT (Smoothed RTT) and RTTVAR (RTT Variance)
Constants: Smoothing factors α = 1/8, β = 1/4, and variance multiplier K = 4
Bounds: Minimum RTO (at least 1 second), granularity adjustments
Initialization: First measurement handling
Update Rules: How to incorporate new RTT samples
Timer Management: How to set and manage the retransmission timer

The Core Equation

The fundamental RTO calculation is:

RTO = SRTT + max(G, K × RTTVAR)

Where:

SRTT = Smoothed Round-Trip Time
RTTVAR = Round-Trip Time Variance
G = Clock granularity (minimum timer precision)
K = 4 (variance multiplier)

RFC 6298 Constants and Their Purposes
Constant	Value	Purpose	Implementation Note
α (SRTT smoothing)	1/8	Weight for new RTT sample in mean estimation	Implemented as >> 3
β (RTTVAR smoothing)	1/4	Weight for new deviation in variance estimation	Implemented as >> 2
K (variance multiplier)	4	Safety margin multiplier for variance term	Implemented as << 2
Minimum RTO	≥ 1 second	Lower bound on RTO to prevent spurious retrans	Some systems use 200ms
G (granularity)	System-dependent	Clock timer resolution	Often negligible on modern systems

RFC 6298 vs. RFC 2988

RFC 6298 made one significant change from RFC 2988: it changed the minimum RTO recommendation from 1 second (MUST) to a more flexible requirement. However, it still recommends 1 second as a safe minimum. Some modern implementations (especially in data centers) use lower minimums like 200ms, which can improve latency but requires careful consideration of delayed ACK timers.

Initialization: Before Any Measurements

When a TCP connection is first established, there's no RTT history to work with. RFC 6298 specifies a two-phase initialization:

Phase 1: Before First RTT Measurement

Until the first RTT sample is obtained (typically from the SYN-ACK during connection establishment):

RTO = 1 second

This is a conservative initial value. It's long enough to accommodate most networks while short enough not to stall connection setup excessively.

Rationale: Setting RTO too low initially could cause spurious retransmissions of SYN packets, potentially preventing connection establishment on high-latency links. Setting it too high delays connection setup if the first SYN is lost.

Phase 2: After First RTT Measurement

When the first RTT measurement R is made:

SRTT = R

RTTVAR = R / 2

RTO = SRTT + max(G, K × RTTVAR) = R + max(G, 4 × R/2) = R + max(G, 2R)

Assuming G < 2R (which is almost always true):

RTO = R + 2R = 3R

The initial RTO is thus 3× the first RTT measurement.

rto_initialization.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
class TCPRTO:
    """TCP RTO Calculator following RFC 6298."""
    
    # Constants from RFC 6298
    ALPHA = 1/8           # SRTT smoothing factor
    BETA = 1/4            # RTTVAR smoothing factor
    K = 4                 # Variance multiplier
    MIN_RTO = 1000        # Minimum RTO in milliseconds (1 second)
    INITIAL_RTO = 1000    # Initial RTO before any measurements
    MAX_RTO = 60000       # Maximum RTO in milliseconds (60 seconds)
    CLOCK_GRANULARITY = 1 # G: Assume 1ms granularity on modern systems
    
    def __init__(self):
        # State: None indicates no measurements yet
        self.srtt = None
        self.rttvar = None
        self.rto = self.INITIAL_RTO  # Start with 1 second
        self._first_measurement = True
    
    def on_first_measurement(self, R: float):
        """
        Handle the first RTT measurement.
        
        Per RFC 6298 Section 2.2:
        - SRTT <- R
        - RTTVAR <- R/2
        - RTO <- SRTT + max(G, K*RTTVAR)
        """
        self.srtt = R
        self.rttvar = R / 2
        
        # Calculate RTO with granularity consideration
        variance_term = max(self.CLOCK_GRANULARITY, self.K * self.rttvar)
        self.rto = self.srtt + variance_term
        
        # Apply minimum bound
        self.rto = max(self.rto, self.MIN_RTO)
        
        self._first_measurement = False
        
        print(f"First measurement: R={R}ms")
        print(f"  SRTT={self.srtt}ms, RTTVAR={self.rttvar}ms")
        print(f"  RTO={self.rto}ms (= {R} + 4×{R/2} = 3×{R}ms)")
 
 
# Example: Connection to a server with 100ms RTT
rto_calc = TCPRTO()
print(f"Initial RTO (no measurements): {rto_calc.rto}ms")
 
# First RTT measurement from SYN-ACK
rto_calc.on_first_measurement(100)
# Output: RTO = 300ms (3 × first measurement)

Why RTTVAR = R/2?

Setting the initial variance to half the first measurement is a heuristic that errs on the side of caution:

If the network is stable, RTTVAR will quickly decrease as subsequent measurements confirm the initial estimate.
If the network is variable, the 2R safety margin provides protection while we gather more data.
The resulting 3R initial RTO is conservative but not excessively so.

This heuristic has proven robust across decades of Internet operation.

Connection Establishment

The first RTT measurement typically comes from the three-way handshake: the time between sending SYN and receiving SYN-ACK. This provides an RTT estimate before any application data is transmitted, allowing data segments to use a properly calibrated RTO from the start.

Subsequent Measurements: The Steady-State Algorithm

After the first measurement, each new RTT sample R' updates the estimator using Jacobson's algorithm. RFC 6298 Section 2.3 specifies:

Step 1: Compute Error

Err = R' - SRTT

The error is the difference between the new sample and the current estimate.

Step 2: Update RTTVAR

RTTVAR = (1 - β) × RTTVAR + β × |Err|

RTTVAR = (1 - 1/4) × RTTVAR + (1/4) × |Err|

RTTVAR = 3/4 × RTTVAR + 1/4 × |Err|

Important: RTTVAR is updated before SRTT. This ensures we use the old SRTT value (not yet updated) when calculating the error magnitude.

Step 3: Update SRTT

SRTT = (1 - α) × SRTT + α × R'

SRTT = (1 - 1/8) × SRTT + (1/8) × R'

SRTT = 7/8 × SRTT + 1/8 × R'

Step 4: Compute RTO

RTO = SRTT + max(G, K × RTTVAR)

RTO = SRTT + max(G, 4 × RTTVAR)

Step 5: Apply Bounds

RTO = max(RTO, MinRTO)

RTO = min(RTO, MaxRTO) (optional but common)

rto_update.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
def on_subsequent_measurement(self, R_prime: float):
    """
    Handle subsequent RTT measurements.
    
    Per RFC 6298 Section 2.3, the update order matters:
    1. Update RTTVAR using OLD SRTT
    2. Update SRTT
    3. Recompute RTO
    """
    # Step 1: Compute error using OLD SRTT
    err = R_prime - self.srtt
    
    # Step 2: Update RTTVAR (using OLD SRTT for error calculation)
    # RTTVAR = (1 - β) * RTTVAR + β * |err|
    # With β = 1/4: RTTVAR = 3/4 * RTTVAR + 1/4 * |err|
    abs_err = abs(err)
    self.rttvar = (1 - self.BETA) * self.rttvar + self.BETA * abs_err
    
    # Alternative formulation:
    # self.rttvar = self.rttvar + self.BETA * (abs_err - self.rttvar)
    
    # Step 3: Update SRTT
    # SRTT = (1 - α) * SRTT + α * R'
    # With α = 1/8: SRTT = 7/8 * SRTT + 1/8 * R'
    self.srtt = (1 - self.ALPHA) * self.srtt + self.ALPHA * R_prime
    
    # Alternative formulation:
    # self.srtt = self.srtt + self.ALPHA * err  # Uses err from step 1
    
    # Step 4: Compute RTO
    variance_term = max(self.CLOCK_GRANULARITY, self.K * self.rttvar)
    self.rto = self.srtt + variance_term
    
    # Step 5: Apply bounds
    self.rto = max(self.rto, self.MIN_RTO)
    self.rto = min(self.rto, self.MAX_RTO)
    
    return self.rto
 
 
# Example trace:
rto_calc = TCPRTO()
rto_calc.on_first_measurement(100)  # First: RTO = 300ms
 
measurements = [105, 95, 102, 98, 100]  # Stable network
for r in measurements:
    rto = rto_calc.on_subsequent_measurement(r)
    print(f"R={r}ms -> SRTT={rto_calc.srtt:.1f}, RTTVAR={rto_calc.rttvar:.1f}, RTO={rto:.1f}")
    
# Output shows RTTVAR decreasing as network proves stable

Why Update RTTVAR Before SRTT?

This ordering is critical and often implemented incorrectly. Consider what happens if we update SRTT first:

SRTT_new = f(SRTT_old, R')
Err = R' - SRTT_new ← Uses NEW SRTT
RTTVAR = f(RTTVAR, Err)

The error calculated with the new SRTT is artificially reduced because SRTT has already moved toward R'. This dampens RTTVAR incorrectly.

By updating RTTVAR first, we measure the deviation from what we expected (old SRTT), not from an already-adjusted value.

Implementation Pitfall

Updating SRTT before RTTVAR is a common implementation bug. It appears to work in testing because the error is subtle—RTTVAR decreases slightly faster than it should. The problem manifests as overly aggressive RTO in variable networks, leading to occasional spurious retransmissions.

Clock Granularity Considerations

RFC 6298 includes the granularity term G in the RTO calculation:

RTO = SRTT + max(G, K × RTTVAR)

This term ensures that RTO never depends solely on a variance estimate that might be smaller than the clock resolution.

History: Why Granularity Matters

In the early days of TCP, system clocks had coarse granularity—often 500ms or even 1 second ticks. This created several problems:

Measurement quantization: RTT could only be measured in multiples of G
Artificially small RTTVAR: If all RTT samples happen to round to the same value, RTTVAR approaches zero
Minimum resolution: The timer couldn't fire more precisely than G

Including max(G, 4×RTTVAR) ensures RTO is at least one clock tick more than SRTT, even if RTTVAR is tiny.

Clock Granularity Over TCP History
Era	Typical Granularity	Impact on RTO
1980s Unix	500ms - 1s	Significant: Many RTTs fall within one tick
1990s Systems	10ms - 100ms	Moderate: Important for LAN connections
2000s Systems	1ms - 10ms	Minor: Mostly affects high-speed LANs
Modern Systems	1μs - 1ms	Negligible: G term rarely dominates

Modern Systems

On modern systems with microsecond or better clock resolution, the G term is effectively negligible. However, it remains in the specification for:

Backward compatibility: Older embedded systems may have coarser clocks
Correctness under edge cases: Even modern systems might have timer precision issues under heavy load
Specification completeness: The algorithm should work correctly regardless of clock characteristics

Practical Implementation

granularity_handling.c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
/* RFC 6298 RTO calculation with granularity */
 
/* Clock granularity in same units as SRTT/RTTVAR (e.g., milliseconds) */
#define CLOCK_G  1  /* 1ms granularity on modern systems */
 
/* Calculate RTO from SRTT and RTTVAR */
unsigned int calculate_rto(unsigned int srtt, unsigned int rttvar) {
    unsigned int variance_term;
    unsigned int rto;
    
    /* RFC 6298: RTO = SRTT + max(G, K*RTTVAR) */
    variance_term = 4 * rttvar;  /* K = 4 */
    
    if (variance_term < CLOCK_G) {
        variance_term = CLOCK_G;  /* max(G, K*RTTVAR) */
    }
    
    rto = srtt + variance_term;
    
    /* Apply minimum bound from RFC 6298 */
    if (rto < MIN_RTO) {
        rto = MIN_RTO;
    }
    
    /* Apply maximum bound (implementation-specific) */
    if (rto > MAX_RTO) {
        rto = MAX_RTO;
    }
    
    return rto;
}
 
/* 
 * Note: In practice, the MIN_RTO bound (typically 1 second)
 * is usually larger than any value that would result from
 * the G term dominating, so G is often irrelevant.
 */

The Minimum RTO Dominates

In most practical scenarios, the minimum RTO bound (1 second per RFC 6298) is much larger than the granularity term. The granularity consideration is mainly relevant for high-speed, low-latency networks where the calculated RTO might otherwise be in the single-digit milliseconds—which itself is below the minimum RTO. The G term thus rarely affects actual RTO values in conforming implementations.

RTO Bounds and Their Rationale

RFC 6298 specifies bounds on the RTO value, and these bounds have important rationale:

Minimum RTO (Lower Bound)

RFC 6298 states:

Whenever RTO is computed, if it is less than 1 second then the RTO SHOULD be rounded up to 1 second.

The RFC acknowledges this may be relaxed in controlled environments but maintains 1 second as the default recommendation.

Why Minimum RTO = 1 Second

•Delayed ACK interaction: Receivers may delay ACKs up to 500ms (per RFC 1122). An RTO below ~500ms risks spurious timeouts.
•Clock synchronization: Different systems may have slightly misaligned clocks; too-tight timing risks false positives.
•Transient delays: Brief network hiccups (route changes, queue bursts) can temporarily delay packets; a margin prevents overreaction.
•Congestion safety: Very aggressive retransmission can amplify congestion; the minimum provides a circuit breaker.
•Historical practice: The 1-second minimum has proven safe across decades of Internet operation.

Maximum RTO (Upper Bound)

RFC 6298 does not mandate a maximum RTO, but implementations typically enforce one (often 60-120 seconds). The rationale:

Connection liveness: At some point, we must give up or at least attempt recovery
User experience: A connection that appears frozen for minutes is effectively dead to the user
Resource management: Connections with huge RTOs consume memory and state without progress

Data Center Considerations

In controlled environments (data centers, private networks), operators sometimes relax the minimum RTO to values like 200ms or even lower. This is acceptable when:

Delayed ACKs are disabled or set to very short timers
RTT is well-known and stable (same rack, same building)
The network is overprovisioned and queuing rare
Spurious retransmission cost (extra bandwidth) is acceptable
Lower latency for loss recovery outweighs the risk

RTO Bounds in Different Environments
Environment	Typical MinRTO	Typical MaxRTO	Notes
Public Internet	1 second	60 seconds	RFC 6298 recommendation
Enterprise LAN	200ms - 1s	30-60 seconds	Often configurable
Data Center	20ms - 200ms	10-30 seconds	Optimized for low latency
High-Frequency Trading	<1ms	100ms - 1s	Extreme tuning, specialized stacks

Relaxing Minimum RTO is Risky

Lowering minimum RTO below 1 second in uncontrolled environments is dangerous. A network hiccup or delayed ACK can trigger spurious retransmissions, potentially leading to: (1) wasted bandwidth, (2) unnecessary congestion response, (3) degraded throughput due to cwnd reduction. Only relax MinRTO when you fully control both endpoints and the network path.

Timer Management: Setting and Resetting RTO

The RTO calculation tells us what the timeout should be, but RFC 6298 also specifies when to set and reset the timer:

Rule 5.1: When to Start the Timer

When a segment containing data is sent (including a retransmission), if the timer is not running, start it running so that it will expire after RTO seconds.

The timer starts when the first unacknowledged segment is sent. It doesn't restart for every segment—only if it's not already running.

Rule 5.2: When to Stop the Timer

When all outstanding data has been acknowledged, turn off the retransmission timer.

When the receiver has acknowledged everything, there's nothing to time out on.

Rule 5.3: When to Restart the Timer

When an ACK is received that acknowledges new data, restart the retransmission timer so that it will expire after RTO seconds.

Each ACK that makes progress restarts the timer with the current RTO value. This ensures the timer reflects the most recent segment, not an old one.

timer_management.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
class TCPRetransmissionTimer:
    """Manages the retransmission timer per RFC 6298 Section 5."""
    
    def __init__(self):
        self.timer_running = False
        self.timer_expiry = None
        self.rto_calculator = TCPRTO()
        
        # Track unacknowledged data
        self.snd_una = 0      # Oldest unacknowledged byte
        self.snd_nxt = 0      # Next byte to send
    
    def send_data(self, segment):
        """Called when data is transmitted."""
        segment_end = segment.sequence + len(segment.data)
        self.snd_nxt = max(self.snd_nxt, segment_end)
        
        # Rule 5.1: Start timer if not already running
        if not self.timer_running and self.snd_una < self.snd_nxt:
            self._start_timer()
    
    def receive_ack(self, ack_num, segment):
        """Called when ACK is received."""
        
        # Check if ACK acknowledges new data
        if ack_num > self.snd_una:
            # New data acknowledged
            self.snd_una = ack_num
            
            # Update RTO estimate if applicable (Karn's algorithm handled elsewhere)
            if not segment.was_retransmitted:
                sample_rtt = self._get_rtt_for_segment(segment)
                self.rto_calculator.on_measurement(sample_rtt)
            
            # Rule 5.2: All data acknowledged?
            if self.snd_una >= self.snd_nxt:
                self._stop_timer()
            
            # Rule 5.3: Still have outstanding data? Restart timer
            elif self.snd_una < self.snd_nxt:
                self._restart_timer()
    
    def handle_timeout(self):
        """Called when the retransmission timer expires."""
        # This is a genuine timeout (not a spurious one)
        
        # RFC 6298 Section 5.5: Backoff RTO
        self.rto_calculator.backoff()
        
        # RFC 6298 Section 5.4: Retransmit earliest unacknowledged segment
        self._retransmit_segment(self.snd_una)
        
        # RFC 6298 Section 5.6: Restart timer with backed-off RTO
        self._start_timer()
    
    def _start_timer(self):
        """Start the retransmission timer."""
        self.timer_running = True
        rto = self.rto_calculator.get_rto()
        self.timer_expiry = current_time() + rto
        print(f"Timer started: expires in {rto}ms")
    
    def _stop_timer(self):
        """Stop the retransmission timer."""
        self.timer_running = False
        self.timer_expiry = None
        print("Timer stopped: all data acknowledged")
    
    def _restart_timer(self):
        """Restart the timer with current RTO."""
        rto = self.rto_calculator.get_rto()
        self.timer_expiry = current_time() + rto
        print(f"Timer restarted: expires in {rto}ms")

Why Restart on Progress?

Restarting the timer when new data is acknowledged prevents a subtle problem:

Sender has 10 segments in flight
Timer is set for the oldest (segment 1)
ACK arrives for segments 1-9
Only segment 10 remains unacknowledged
If timer wasn't restarted, it would expire based on segment 1's send time, not segment 10's
This is too aggressive—segment 10 deserves its full RTO

By restarting on each progressive ACK, we ensure the timer tracks the actual oldest unacknowledged data.

Single Timer per Connection

RFC 6298 recommends a single retransmission timer per connection, not per-segment timers. This simplifies implementation and aligns with the cumulative ACK nature of TCP. The timer always tracks the oldest unacknowledged segment; when it's acknowledged, the timer restarts for the next oldest.

Complete RFC 6298 Implementation

Let's bring everything together into a complete, RFC 6298-compliant implementation:

rfc6298_complete.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
"""
Complete RFC 6298 RTO Implementation
 
This implementation includes:
- Jacobson's algorithm for SRTT/RTTVAR estimation
- Karn's algorithm for retransmission handling
- All bounds and constraints from RFC 6298
- Timer management per RFC 6298 Section 5
"""
 
from dataclasses import dataclass
from typing import Optional
import time
 
@dataclass
class RTOState:
    """Complete RTO calculator state."""
    srtt: Optional[float] = None      # Smoothed RTT (ms)
    rttvar: Optional[float] = None    # RTT Variance (ms)
    rto: float = 1000                 # Retransmission timeout (ms)
    
    # Constants (RFC 6298)
    ALPHA: float = 1/8
    BETA: float = 1/4
    K: float = 4
    G: float = 1                      # Clock granularity (ms)
    MIN_RTO: float = 1000             # 1 second
    MAX_RTO: float = 60000            # 60 seconds
    
    def on_first_rtt(self, R: float) -> float:
        """Handle first RTT measurement. Section 2.2."""
        self.srtt = R
        self.rttvar = R / 2
        self._update_rto()
        return self.rto
    
    def on_rtt_measurement(self, R: float) -> float:
        """Handle subsequent RTT measurement. Section 2.3."""
        if self.srtt is None:
            return self.on_first_rtt(R)
        
        # IMPORTANT: Calculate err using OLD SRTT
        err = R - self.srtt
        
        # Update RTTVAR first (uses old SRTT)
        self.rttvar = (1 - self.BETA) * self.rttvar + self.BETA * abs(err)
        
        # Update SRTT
        self.srtt = (1 - self.ALPHA) * self.srtt + self.ALPHA * R
        
        self._update_rto()
        return self.rto
    
    def on_timeout(self) -> float:
        """Handle retransmission timeout. Section 5.5."""
        # Exponential backoff
        self.rto = min(self.rto * 2, self.MAX_RTO)
        return self.rto
    
    def _update_rto(self):
        """Compute RTO from current SRTT and RTTVAR."""
        # RTO = SRTT + max(G, K * RTTVAR)
        variance_term = max(self.G, self.K * self.rttvar)
        self.rto = self.srtt + variance_term
        
        # Apply bounds
        self.rto = max(self.rto, self.MIN_RTO)
        self.rto = min(self.rto, self.MAX_RTO)
 
 
class TCPConnectionRTO:
    """
    Complete TCP connection RTO management.
    Includes segment tracking and Karn's algorithm.
    """
    
    def __init__(self):
        self.state = RTOState()
        
        # Segment tracking: seq_num -> (send_time, was_retransmitted)
        self.pending = {}
        
        # Timer state
        self.timer_expiry: Optional[float] = None
    
    def segment_sent(self, seq_num: int, is_retransmit: bool = False):
        """Record segment transmission."""
        now = time.time() * 1000  # Current time in ms
        
        if seq_num in self.pending:
            # Mark as retransmitted (for Karn's algorithm)
            self.pending[seq_num] = (self.pending[seq_num][0], True)
        else:
            self.pending[seq_num] = (now, is_retransmit)
        
        # Start/restart timer
        if self.timer_expiry is None:
            self.timer_expiry = now + self.state.rto
    
    def ack_received(self, ack_num: int):
        """Process acknowledgment."""
        now = time.time() * 1000
        
        # Find acknowledged segments
        acked_seqs = [s for s in self.pending.keys() if s < ack_num]
        
        for seq in acked_seqs:
            send_time, was_retransmitted = self.pending.pop(seq)
            
            # Karn's Rule 1: Only use clean samples
            if not was_retransmitted:
                sample_rtt = now - send_time
                self.state.on_rtt_measurement(sample_rtt)
        
        # Timer management
        if not self.pending:
            # All acknowledged: stop timer
            self.timer_expiry = None
        else:
            # Restart timer for remaining data
            self.timer_expiry = now + self.state.rto
    
    def check_timeout(self) -> bool:
        """Check if timeout has occurred."""
        if self.timer_expiry is None:
            return False
        
        now = time.time() * 1000
        if now >= self.timer_expiry:
            # Timeout occurred
            # Karn's Rule 2: Back off
            self.state.on_timeout()
            return True
        
        return False
    
    def get_current_rto(self) -> float:
        return self.state.rto
 
 
# === Demonstration ===
if __name__ == "__main__":
    conn = TCPConnectionRTO()
    
    print("=== RFC 6298 RTO Calculation Demo ===\n")
    
    # Simulate connection establishment
    print("Connection established, first RTT measurement: 100ms")
    conn.state.on_first_rtt(100)
    print(f"  SRTT={conn.state.srtt}ms, RTTVAR={conn.state.rttvar}ms, RTO={conn.state.rto}ms")
    print(f"  (Initial RTO = 3 × first RTT = 300ms, bounded to min 1000ms)\n")
    
    # Simulate some data transfer
    measurements = [95, 105, 98, 102, 100, 97, 103]
    print("Subsequent RTT measurements:")
    for i, rtt in enumerate(measurements, 2):
        conn.state.on_rtt_measurement(rtt)
        print(f"  Sample {i}: R={rtt}ms -> SRTT={conn.state.srtt:.1f}, RTTVAR={conn.state.rttvar:.1f}, RTO={conn.state.rto:.1f}")
    
    print("\n(Note: RTO stays at minimum 1000ms even though calculated value is lower)")

Production Considerations

Real TCP implementations add additional complexity: handling out-of-order segments, SACK-based retransmission, multiple segment tracking, and integration with congestion control. This implementation captures the core RFC 6298 algorithm; production stacks build significant infrastructure around it.

Summary: The Complete RTO Algorithm

We've now covered the complete RTO calculation as specified in RFC 6298. Let's consolidate the key takeaways:

Key Takeaways

•RFC 6298 is the standard — This RFC codifies Jacobson's algorithm, Karn's algorithm, and all the bounds and practices for RTO computation.
•RTO = SRTT + max(G, 4 × RTTVAR) — The fundamental formula combines mean RTT with a variance-based safety margin.
•Initialization matters — Before first measurement: RTO = 1s. After first measurement R: SRTT = R, RTTVAR = R/2, RTO = 3R (bounded).
•Update order matters — RTTVAR must be updated before SRTT to use the correct error value.
•Minimum RTO provides safety — The 1-second minimum prevents spurious retransmissions in most Internet scenarios.
•Timer management follows specific rules — Start on first unACKed send, restart on progress, stop when all acknowledged.
•Backoff on timeout — When timeout occurs, RTO doubles. This continues until clean samples arrive.

What's next:

We've seen how RTO is calculated and how timeouts trigger backoff. But what happens after a timeout? The next page explores Exponential Backoff in depth—the mechanism that progressively increases RTO after repeated timeouts, preventing network overload during congestion events.

Page Complete

You now understand the complete RFC 6298 RTO calculation algorithm—initialization, updates, bounds, timer management, and implementation details. This is the culmination of RTT estimation, Jacobson's algorithm, and Karn's algorithm into a practical, standardized procedure. Next, we'll dive deep into exponential backoff behavior.

RTO Calculation: The Complete RFC 6298 Algorithm

The Moment of Truth

In this page, we'll dissect RFC 6298 step by step, understanding not just what the algorithm specifies, but why each element exists.

What You Will Learn

RFC 6298 Overview: The Standard Specification

Key Components

The RFC defines:

State Variables: SRTT (Smoothed RTT) and RTTVAR (RTT Variance)
Constants: Smoothing factors α = 1/8, β = 1/4, and variance multiplier K = 4
Bounds: Minimum RTO (at least 1 second), granularity adjustments
Initialization: First measurement handling
Update Rules: How to incorporate new RTT samples
Timer Management: How to set and manage the retransmission timer

The Core Equation

The fundamental RTO calculation is:

RTO = SRTT + max(G, K × RTTVAR)

Where:

SRTT = Smoothed Round-Trip Time
RTTVAR = Round-Trip Time Variance
G = Clock granularity (minimum timer precision)
K = 4 (variance multiplier)

RFC 6298 Constants and Their Purposes
Constant	Value	Purpose	Implementation Note
α (SRTT smoothing)	1/8	Weight for new RTT sample in mean estimation	Implemented as >> 3
β (RTTVAR smoothing)	1/4	Weight for new deviation in variance estimation	Implemented as >> 2
K (variance multiplier)	4	Safety margin multiplier for variance term	Implemented as << 2
Minimum RTO	≥ 1 second	Lower bound on RTO to prevent spurious retrans	Some systems use 200ms
G (granularity)	System-dependent	Clock timer resolution	Often negligible on modern systems

RFC 6298 vs. RFC 2988

Initialization: Before Any Measurements

When a TCP connection is first established, there's no RTT history to work with. RFC 6298 specifies a two-phase initialization:

Phase 1: Before First RTT Measurement

Until the first RTT sample is obtained (typically from the SYN-ACK during connection establishment):

RTO = 1 second

This is a conservative initial value. It's long enough to accommodate most networks while short enough not to stall connection setup excessively.

Phase 2: After First RTT Measurement

When the first RTT measurement R is made:

SRTT = R

RTTVAR = R / 2

RTO = SRTT + max(G, K × RTTVAR) = R + max(G, 4 × R/2) = R + max(G, 2R)

Assuming G < 2R (which is almost always true):

RTO = R + 2R = 3R

The initial RTO is thus 3× the first RTT measurement.

rto_initialization.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
class TCPRTO:
    """TCP RTO Calculator following RFC 6298."""
    
    # Constants from RFC 6298
    ALPHA = 1/8           # SRTT smoothing factor
    BETA = 1/4            # RTTVAR smoothing factor
    K = 4                 # Variance multiplier
    MIN_RTO = 1000        # Minimum RTO in milliseconds (1 second)
    INITIAL_RTO = 1000    # Initial RTO before any measurements
    MAX_RTO = 60000       # Maximum RTO in milliseconds (60 seconds)
    CLOCK_GRANULARITY = 1 # G: Assume 1ms granularity on modern systems
    
    def __init__(self):
        # State: None indicates no measurements yet
        self.srtt = None
        self.rttvar = None
        self.rto = self.INITIAL_RTO  # Start with 1 second
        self._first_measurement = True
    
    def on_first_measurement(self, R: float):
        """
        Handle the first RTT measurement.
        
        Per RFC 6298 Section 2.2:
        - SRTT <- R
        - RTTVAR <- R/2
        - RTO <- SRTT + max(G, K*RTTVAR)
        """
        self.srtt = R
        self.rttvar = R / 2
        
        # Calculate RTO with granularity consideration
        variance_term = max(self.CLOCK_GRANULARITY, self.K * self.rttvar)
        self.rto = self.srtt + variance_term
        
        # Apply minimum bound
        self.rto = max(self.rto, self.MIN_RTO)
        
        self._first_measurement = False
        
        print(f"First measurement: R={R}ms")
        print(f"  SRTT={self.srtt}ms, RTTVAR={self.rttvar}ms")
        print(f"  RTO={self.rto}ms (= {R} + 4×{R/2} = 3×{R}ms)")
 
 
# Example: Connection to a server with 100ms RTT
rto_calc = TCPRTO()
print(f"Initial RTO (no measurements): {rto_calc.rto}ms")
 
# First RTT measurement from SYN-ACK
rto_calc.on_first_measurement(100)
# Output: RTO = 300ms (3 × first measurement)

Why RTTVAR = R/2?

Setting the initial variance to half the first measurement is a heuristic that errs on the side of caution:

If the network is stable, RTTVAR will quickly decrease as subsequent measurements confirm the initial estimate.
If the network is variable, the 2R safety margin provides protection while we gather more data.
The resulting 3R initial RTO is conservative but not excessively so.

This heuristic has proven robust across decades of Internet operation.

Connection Establishment

Subsequent Measurements: The Steady-State Algorithm

After the first measurement, each new RTT sample R' updates the estimator using Jacobson's algorithm. RFC 6298 Section 2.3 specifies:

Step 1: Compute Error

Err = R' - SRTT

The error is the difference between the new sample and the current estimate.

Step 2: Update RTTVAR

RTTVAR = (1 - β) × RTTVAR + β × |Err|

RTTVAR = (1 - 1/4) × RTTVAR + (1/4) × |Err|

RTTVAR = 3/4 × RTTVAR + 1/4 × |Err|

Important: RTTVAR is updated before SRTT. This ensures we use the old SRTT value (not yet updated) when calculating the error magnitude.

Step 3: Update SRTT

SRTT = (1 - α) × SRTT + α × R'

SRTT = (1 - 1/8) × SRTT + (1/8) × R'

SRTT = 7/8 × SRTT + 1/8 × R'

Step 4: Compute RTO

RTO = SRTT + max(G, K × RTTVAR)

RTO = SRTT + max(G, 4 × RTTVAR)

Step 5: Apply Bounds

RTO = max(RTO, MinRTO)

RTO = min(RTO, MaxRTO) (optional but common)

rto_update.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
def on_subsequent_measurement(self, R_prime: float):
    """
    Handle subsequent RTT measurements.
    
    Per RFC 6298 Section 2.3, the update order matters:
    1. Update RTTVAR using OLD SRTT
    2. Update SRTT
    3. Recompute RTO
    """
    # Step 1: Compute error using OLD SRTT
    err = R_prime - self.srtt
    
    # Step 2: Update RTTVAR (using OLD SRTT for error calculation)
    # RTTVAR = (1 - β) * RTTVAR + β * |err|
    # With β = 1/4: RTTVAR = 3/4 * RTTVAR + 1/4 * |err|
    abs_err = abs(err)
    self.rttvar = (1 - self.BETA) * self.rttvar + self.BETA * abs_err
    
    # Alternative formulation:
    # self.rttvar = self.rttvar + self.BETA * (abs_err - self.rttvar)
    
    # Step 3: Update SRTT
    # SRTT = (1 - α) * SRTT + α * R'
    # With α = 1/8: SRTT = 7/8 * SRTT + 1/8 * R'
    self.srtt = (1 - self.ALPHA) * self.srtt + self.ALPHA * R_prime
    
    # Alternative formulation:
    # self.srtt = self.srtt + self.ALPHA * err  # Uses err from step 1
    
    # Step 4: Compute RTO
    variance_term = max(self.CLOCK_GRANULARITY, self.K * self.rttvar)
    self.rto = self.srtt + variance_term
    
    # Step 5: Apply bounds
    self.rto = max(self.rto, self.MIN_RTO)
    self.rto = min(self.rto, self.MAX_RTO)
    
    return self.rto
 
 
# Example trace:
rto_calc = TCPRTO()
rto_calc.on_first_measurement(100)  # First: RTO = 300ms
 
measurements = [105, 95, 102, 98, 100]  # Stable network
for r in measurements:
    rto = rto_calc.on_subsequent_measurement(r)
    print(f"R={r}ms -> SRTT={rto_calc.srtt:.1f}, RTTVAR={rto_calc.rttvar:.1f}, RTO={rto:.1f}")
    
# Output shows RTTVAR decreasing as network proves stable

Why Update RTTVAR Before SRTT?

This ordering is critical and often implemented incorrectly. Consider what happens if we update SRTT first:

SRTT_new = f(SRTT_old, R')
Err = R' - SRTT_new ← Uses NEW SRTT
RTTVAR = f(RTTVAR, Err)

The error calculated with the new SRTT is artificially reduced because SRTT has already moved toward R'. This dampens RTTVAR incorrectly.

By updating RTTVAR first, we measure the deviation from what we expected (old SRTT), not from an already-adjusted value.

Implementation Pitfall

Clock Granularity Considerations

RFC 6298 includes the granularity term G in the RTO calculation:

RTO = SRTT + max(G, K × RTTVAR)

This term ensures that RTO never depends solely on a variance estimate that might be smaller than the clock resolution.

History: Why Granularity Matters

In the early days of TCP, system clocks had coarse granularity—often 500ms or even 1 second ticks. This created several problems:

Measurement quantization: RTT could only be measured in multiples of G
Artificially small RTTVAR: If all RTT samples happen to round to the same value, RTTVAR approaches zero
Minimum resolution: The timer couldn't fire more precisely than G

Including max(G, 4×RTTVAR) ensures RTO is at least one clock tick more than SRTT, even if RTTVAR is tiny.

Clock Granularity Over TCP History
Era	Typical Granularity	Impact on RTO
1980s Unix	500ms - 1s	Significant: Many RTTs fall within one tick
1990s Systems	10ms - 100ms	Moderate: Important for LAN connections
2000s Systems	1ms - 10ms	Minor: Mostly affects high-speed LANs
Modern Systems	1μs - 1ms	Negligible: G term rarely dominates

Modern Systems

On modern systems with microsecond or better clock resolution, the G term is effectively negligible. However, it remains in the specification for:

Backward compatibility: Older embedded systems may have coarser clocks
Correctness under edge cases: Even modern systems might have timer precision issues under heavy load
Specification completeness: The algorithm should work correctly regardless of clock characteristics

Practical Implementation

granularity_handling.c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
/* RFC 6298 RTO calculation with granularity */
 
/* Clock granularity in same units as SRTT/RTTVAR (e.g., milliseconds) */
#define CLOCK_G  1  /* 1ms granularity on modern systems */
 
/* Calculate RTO from SRTT and RTTVAR */
unsigned int calculate_rto(unsigned int srtt, unsigned int rttvar) {
    unsigned int variance_term;
    unsigned int rto;
    
    /* RFC 6298: RTO = SRTT + max(G, K*RTTVAR) */
    variance_term = 4 * rttvar;  /* K = 4 */
    
    if (variance_term < CLOCK_G) {
        variance_term = CLOCK_G;  /* max(G, K*RTTVAR) */
    }
    
    rto = srtt + variance_term;
    
    /* Apply minimum bound from RFC 6298 */
    if (rto < MIN_RTO) {
        rto = MIN_RTO;
    }
    
    /* Apply maximum bound (implementation-specific) */
    if (rto > MAX_RTO) {
        rto = MAX_RTO;
    }
    
    return rto;
}
 
/* 
 * Note: In practice, the MIN_RTO bound (typically 1 second)
 * is usually larger than any value that would result from
 * the G term dominating, so G is often irrelevant.
 */

The Minimum RTO Dominates

RTO Bounds and Their Rationale

RFC 6298 specifies bounds on the RTO value, and these bounds have important rationale:

Minimum RTO (Lower Bound)

RFC 6298 states:

Whenever RTO is computed, if it is less than 1 second then the RTO SHOULD be rounded up to 1 second.

The RFC acknowledges this may be relaxed in controlled environments but maintains 1 second as the default recommendation.

Why Minimum RTO = 1 Second

•Delayed ACK interaction: Receivers may delay ACKs up to 500ms (per RFC 1122). An RTO below ~500ms risks spurious timeouts.
•Clock synchronization: Different systems may have slightly misaligned clocks; too-tight timing risks false positives.
•Transient delays: Brief network hiccups (route changes, queue bursts) can temporarily delay packets; a margin prevents overreaction.
•Congestion safety: Very aggressive retransmission can amplify congestion; the minimum provides a circuit breaker.
•Historical practice: The 1-second minimum has proven safe across decades of Internet operation.

Maximum RTO (Upper Bound)

RFC 6298 does not mandate a maximum RTO, but implementations typically enforce one (often 60-120 seconds). The rationale:

Connection liveness: At some point, we must give up or at least attempt recovery
User experience: A connection that appears frozen for minutes is effectively dead to the user
Resource management: Connections with huge RTOs consume memory and state without progress

Data Center Considerations

In controlled environments (data centers, private networks), operators sometimes relax the minimum RTO to values like 200ms or even lower. This is acceptable when:

Delayed ACKs are disabled or set to very short timers
RTT is well-known and stable (same rack, same building)
The network is overprovisioned and queuing rare
Spurious retransmission cost (extra bandwidth) is acceptable
Lower latency for loss recovery outweighs the risk

RTO Bounds in Different Environments
Environment	Typical MinRTO	Typical MaxRTO	Notes
Public Internet	1 second	60 seconds	RFC 6298 recommendation
Enterprise LAN	200ms - 1s	30-60 seconds	Often configurable
Data Center	20ms - 200ms	10-30 seconds	Optimized for low latency
High-Frequency Trading	<1ms	100ms - 1s	Extreme tuning, specialized stacks

Relaxing Minimum RTO is Risky

Timer Management: Setting and Resetting RTO

The RTO calculation tells us what the timeout should be, but RFC 6298 also specifies when to set and reset the timer:

Rule 5.1: When to Start the Timer

When a segment containing data is sent (including a retransmission), if the timer is not running, start it running so that it will expire after RTO seconds.

The timer starts when the first unacknowledged segment is sent. It doesn't restart for every segment—only if it's not already running.

Rule 5.2: When to Stop the Timer

When all outstanding data has been acknowledged, turn off the retransmission timer.

When the receiver has acknowledged everything, there's nothing to time out on.

Rule 5.3: When to Restart the Timer

When an ACK is received that acknowledges new data, restart the retransmission timer so that it will expire after RTO seconds.

Each ACK that makes progress restarts the timer with the current RTO value. This ensures the timer reflects the most recent segment, not an old one.

timer_management.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
class TCPRetransmissionTimer:
    """Manages the retransmission timer per RFC 6298 Section 5."""
    
    def __init__(self):
        self.timer_running = False
        self.timer_expiry = None
        self.rto_calculator = TCPRTO()
        
        # Track unacknowledged data
        self.snd_una = 0      # Oldest unacknowledged byte
        self.snd_nxt = 0      # Next byte to send
    
    def send_data(self, segment):
        """Called when data is transmitted."""
        segment_end = segment.sequence + len(segment.data)
        self.snd_nxt = max(self.snd_nxt, segment_end)
        
        # Rule 5.1: Start timer if not already running
        if not self.timer_running and self.snd_una < self.snd_nxt:
            self._start_timer()
    
    def receive_ack(self, ack_num, segment):
        """Called when ACK is received."""
        
        # Check if ACK acknowledges new data
        if ack_num > self.snd_una:
            # New data acknowledged
            self.snd_una = ack_num
            
            # Update RTO estimate if applicable (Karn's algorithm handled elsewhere)
            if not segment.was_retransmitted:
                sample_rtt = self._get_rtt_for_segment(segment)
                self.rto_calculator.on_measurement(sample_rtt)
            
            # Rule 5.2: All data acknowledged?
            if self.snd_una >= self.snd_nxt:
                self._stop_timer()
            
            # Rule 5.3: Still have outstanding data? Restart timer
            elif self.snd_una < self.snd_nxt:
                self._restart_timer()
    
    def handle_timeout(self):
        """Called when the retransmission timer expires."""
        # This is a genuine timeout (not a spurious one)
        
        # RFC 6298 Section 5.5: Backoff RTO
        self.rto_calculator.backoff()
        
        # RFC 6298 Section 5.4: Retransmit earliest unacknowledged segment
        self._retransmit_segment(self.snd_una)
        
        # RFC 6298 Section 5.6: Restart timer with backed-off RTO
        self._start_timer()
    
    def _start_timer(self):
        """Start the retransmission timer."""
        self.timer_running = True
        rto = self.rto_calculator.get_rto()
        self.timer_expiry = current_time() + rto
        print(f"Timer started: expires in {rto}ms")
    
    def _stop_timer(self):
        """Stop the retransmission timer."""
        self.timer_running = False
        self.timer_expiry = None
        print("Timer stopped: all data acknowledged")
    
    def _restart_timer(self):
        """Restart the timer with current RTO."""
        rto = self.rto_calculator.get_rto()
        self.timer_expiry = current_time() + rto
        print(f"Timer restarted: expires in {rto}ms")

Why Restart on Progress?

Restarting the timer when new data is acknowledged prevents a subtle problem:

Sender has 10 segments in flight
Timer is set for the oldest (segment 1)
ACK arrives for segments 1-9
Only segment 10 remains unacknowledged
If timer wasn't restarted, it would expire based on segment 1's send time, not segment 10's
This is too aggressive—segment 10 deserves its full RTO

By restarting on each progressive ACK, we ensure the timer tracks the actual oldest unacknowledged data.

Single Timer per Connection

Complete RFC 6298 Implementation

Let's bring everything together into a complete, RFC 6298-compliant implementation:

rfc6298_complete.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
"""
Complete RFC 6298 RTO Implementation
 
This implementation includes:
- Jacobson's algorithm for SRTT/RTTVAR estimation
- Karn's algorithm for retransmission handling
- All bounds and constraints from RFC 6298
- Timer management per RFC 6298 Section 5
"""
 
from dataclasses import dataclass
from typing import Optional
import time
 
@dataclass
class RTOState:
    """Complete RTO calculator state."""
    srtt: Optional[float] = None      # Smoothed RTT (ms)
    rttvar: Optional[float] = None    # RTT Variance (ms)
    rto: float = 1000                 # Retransmission timeout (ms)
    
    # Constants (RFC 6298)
    ALPHA: float = 1/8
    BETA: float = 1/4
    K: float = 4
    G: float = 1                      # Clock granularity (ms)
    MIN_RTO: float = 1000             # 1 second
    MAX_RTO: float = 60000            # 60 seconds
    
    def on_first_rtt(self, R: float) -> float:
        """Handle first RTT measurement. Section 2.2."""
        self.srtt = R
        self.rttvar = R / 2
        self._update_rto()
        return self.rto
    
    def on_rtt_measurement(self, R: float) -> float:
        """Handle subsequent RTT measurement. Section 2.3."""
        if self.srtt is None:
            return self.on_first_rtt(R)
        
        # IMPORTANT: Calculate err using OLD SRTT
        err = R - self.srtt
        
        # Update RTTVAR first (uses old SRTT)
        self.rttvar = (1 - self.BETA) * self.rttvar + self.BETA * abs(err)
        
        # Update SRTT
        self.srtt = (1 - self.ALPHA) * self.srtt + self.ALPHA * R
        
        self._update_rto()
        return self.rto
    
    def on_timeout(self) -> float:
        """Handle retransmission timeout. Section 5.5."""
        # Exponential backoff
        self.rto = min(self.rto * 2, self.MAX_RTO)
        return self.rto
    
    def _update_rto(self):
        """Compute RTO from current SRTT and RTTVAR."""
        # RTO = SRTT + max(G, K * RTTVAR)
        variance_term = max(self.G, self.K * self.rttvar)
        self.rto = self.srtt + variance_term
        
        # Apply bounds
        self.rto = max(self.rto, self.MIN_RTO)
        self.rto = min(self.rto, self.MAX_RTO)
 
 
class TCPConnectionRTO:
    """
    Complete TCP connection RTO management.
    Includes segment tracking and Karn's algorithm.
    """
    
    def __init__(self):
        self.state = RTOState()
        
        # Segment tracking: seq_num -> (send_time, was_retransmitted)
        self.pending = {}
        
        # Timer state
        self.timer_expiry: Optional[float] = None
    
    def segment_sent(self, seq_num: int, is_retransmit: bool = False):
        """Record segment transmission."""
        now = time.time() * 1000  # Current time in ms
        
        if seq_num in self.pending:
            # Mark as retransmitted (for Karn's algorithm)
            self.pending[seq_num] = (self.pending[seq_num][0], True)
        else:
            self.pending[seq_num] = (now, is_retransmit)
        
        # Start/restart timer
        if self.timer_expiry is None:
            self.timer_expiry = now + self.state.rto
    
    def ack_received(self, ack_num: int):
        """Process acknowledgment."""
        now = time.time() * 1000
        
        # Find acknowledged segments
        acked_seqs = [s for s in self.pending.keys() if s < ack_num]
        
        for seq in acked_seqs:
            send_time, was_retransmitted = self.pending.pop(seq)
            
            # Karn's Rule 1: Only use clean samples
            if not was_retransmitted:
                sample_rtt = now - send_time
                self.state.on_rtt_measurement(sample_rtt)
        
        # Timer management
        if not self.pending:
            # All acknowledged: stop timer
            self.timer_expiry = None
        else:
            # Restart timer for remaining data
            self.timer_expiry = now + self.state.rto
    
    def check_timeout(self) -> bool:
        """Check if timeout has occurred."""
        if self.timer_expiry is None:
            return False
        
        now = time.time() * 1000
        if now >= self.timer_expiry:
            # Timeout occurred
            # Karn's Rule 2: Back off
            self.state.on_timeout()
            return True
        
        return False
    
    def get_current_rto(self) -> float:
        return self.state.rto
 
 
# === Demonstration ===
if __name__ == "__main__":
    conn = TCPConnectionRTO()
    
    print("=== RFC 6298 RTO Calculation Demo ===\n")
    
    # Simulate connection establishment
    print("Connection established, first RTT measurement: 100ms")
    conn.state.on_first_rtt(100)
    print(f"  SRTT={conn.state.srtt}ms, RTTVAR={conn.state.rttvar}ms, RTO={conn.state.rto}ms")
    print(f"  (Initial RTO = 3 × first RTT = 300ms, bounded to min 1000ms)\n")
    
    # Simulate some data transfer
    measurements = [95, 105, 98, 102, 100, 97, 103]
    print("Subsequent RTT measurements:")
    for i, rtt in enumerate(measurements, 2):
        conn.state.on_rtt_measurement(rtt)
        print(f"  Sample {i}: R={rtt}ms -> SRTT={conn.state.srtt:.1f}, RTTVAR={conn.state.rttvar:.1f}, RTO={conn.state.rto:.1f}")
    
    print("\n(Note: RTO stays at minimum 1000ms even though calculated value is lower)")

Production Considerations

Summary: The Complete RTO Algorithm

We've now covered the complete RTO calculation as specified in RFC 6298. Let's consolidate the key takeaways:

Key Takeaways

•RFC 6298 is the standard — This RFC codifies Jacobson's algorithm, Karn's algorithm, and all the bounds and practices for RTO computation.
•RTO = SRTT + max(G, 4 × RTTVAR) — The fundamental formula combines mean RTT with a variance-based safety margin.
•Initialization matters — Before first measurement: RTO = 1s. After first measurement R: SRTT = R, RTTVAR = R/2, RTO = 3R (bounded).
•Update order matters — RTTVAR must be updated before SRTT to use the correct error value.
•Minimum RTO provides safety — The 1-second minimum prevents spurious retransmissions in most Internet scenarios.
•Timer management follows specific rules — Start on first unACKed send, restart on progress, stop when all acknowledged.
•Backoff on timeout — When timeout occurs, RTO doubles. This continues until clean samples arrive.

What's next:

Page Complete