Computer NetworksOSPF

Open Shortest Path First (OSPF)

LevelIntermediate

Duration60 mins

TopicOSPF

2 / 5

Link State Protocol Fundamentals

The Power of Complete Topology Knowledge

The fundamental distinction between link-state and distance-vector routing protocols isn't merely academic—it represents two philosophically different approaches to solving the routing problem. Distance-vector protocols operate on trust: each router believes what its neighbors tell it about the network and makes decisions based on this second-hand information. Link-state protocols operate on direct knowledge: every router independently constructs a complete map of the network and calculates its own optimal paths.

This difference has profound implications for convergence speed, loop prevention, troubleshooting, and scalability. OSPF's effectiveness as an enterprise routing protocol stems directly from its link-state foundation. Understanding how link-state databases are built, maintained, and used for path computation is essential for mastering OSPF's behavior.

What You Will Learn

By the end of this page, you will understand the complete link-state operational model: how routers discover and advertise topology information, how the Link-State Database (LSDB) is constructed and synchronized, how Dijkstra's Shortest Path First algorithm computes optimal routes, and how reliable flooding ensures all routers maintain identical network views. You'll grasp the mathematical elegance that makes OSPF provably loop-free.

The Link-State Operational Model

Link-state routing protocols implement a deceptively simple concept: every router should have complete knowledge of the network topology. With this complete knowledge, each router can independently compute optimal paths to every destination. The elegance lies in how this shared knowledge is achieved and maintained.

The Five Stages of Link-State Operation:

Every link-state protocol, including OSPF, follows this operational sequence:

Stage 1: Neighbor Discovery Routers identify their directly connected neighbors through the Hello protocol. This establishes the 'local survey'—each router knows who its immediate neighbors are.

Stage 2: Link-State Database Construction Each router creates Link-State Advertisements (LSAs) describing its local topology—its own interfaces, connected networks, and neighbors. This is the router's contribution to the collective map.

Stage 3: Reliable Flooding Routers distribute their LSAs to all other routers in the domain. This flooding is reliable—every LSA is acknowledged, and retransmissions occur until acknowledgment is received.

Stage 4: Database Synchronization Once flooding is complete, every router has received every other router's LSAs. The collection of all LSAs forms the Link-State Database (LSDB)—a complete topological map of the network.

Stage 5: Shortest Path Computation Using the LSDB as input, each router runs the Dijkstra SPF algorithm to compute the shortest path tree rooted at itself. This tree determines the next-hop for every destination.

Converting Mermaid diagram...

The Cartographer Analogy Revisited

Imagine a kingdom where each city has a cartographer. Each cartographer surveys only their own city—its roads and connections to neighboring cities. They share these surveys with messengers who distribute copies to every other city. Once complete, every cartographer has everyone's surveys and can construct an identical kingdom map. From this map, each can independently calculate the best path to any destination. Link-state routing works exactly this way.

The Link-State Database (LSDB)

The Link-State Database (LSDB) is the heart of OSPF operation—a structured collection of all LSAs describing the network topology. Every router in an OSPF area maintains an identical LSDB, and this synchronization is what enables loop-free, consistent routing.

LSDB Structure:

The LSDB is organized as a collection of LSAs, each identified by a unique tuple:

LS Type: The kind of LSA (we'll explore types in detail later)
Link State ID: Identifies what the LSA describes
Advertising Router: The router that originated the LSA

This three-part key ensures every LSA is uniquely identifiable across the network.

Critical LSDB Properties:

LSDB Imperatives

•Identical across routers — All routers in an area must have the exact same LSDB content. Any discrepancy causes routing inconsistencies.
•Self-healing — OSPF mechanisms automatically detect and correct database inconsistencies through periodic refresh and checksums.
•Bounded aging — LSAs have maximum age (3600 seconds). If not refreshed, they're automatically purged, preventing stale entries.
•Versioned — Sequence numbers (32-bit) distinguish newer LSAs from older ones, ensuring updates supersede outdated information.
•Checksum protected — Fletcher checksum detects corruption, triggering LSA retransmission if errors are detected.

LSA Lifecycle Management:

Every LSA has associated metadata controlling its lifecycle:

LSA Header Fields for Lifecycle Management
Field	Size	Purpose	Key Values
LS Age	16 bits	Time since LSA originated	0-3600 seconds (MaxAge)
LS Sequence Number	32 bits	Version number for updates	0x80000001 to 0x7FFFFFFF
LS Checksum	16 bits	Integrity verification	Fletcher checksum of LSA body
Length	16 bits	Total LSA size in bytes	Varies by LSA type

The LSA Refresh Mechanism:

LSAs are not permanent—they age and must be refreshed to remain valid:

LS Age increments every second as the LSA sits in the database
Originating router refreshes its own LSAs every 30 minutes (LSRefreshTime) by re-originating with a higher sequence number and age reset to 0
MaxAge (3600 seconds) triggers LSA flushing—the LSA is removed from the database
Premature aging can force immediate removal by setting age to MaxAge (used for topology changes)

This aging mechanism ensures that if a router fails without explicitly withdrawing its LSAs, the stale entries will eventually age out (within 1 hour maximum).

Sequence Number Wrapping

The sequence number space is designed to prevent wrap-around issues. Starting at 0x80000001 (using signed comparison), the sequence number increments until reaching 0x7FFFFFFF. Before wrap-around can occur, the router must flush the old LSA (set to MaxAge) and wait for it to be removed from all databases before restarting at 0x80000001. This ensures newer LSAs are always distinguishable from older ones.

LSA Flooding Mechanics

Reliable flooding is the mechanism that distributes LSAs to all routers, ensuring database synchronization. Unlike simple broadcast, OSPF's flooding is:

Reliable: Every LSA is acknowledged; unacknowledged LSAs are retransmitted
Efficient: Only changed LSAs are flooded, not entire databases
Controlled: Flooding scope is limited by area boundaries and LSA types

The Flooding Algorithm:

When a router receives an LSA (in a Link-State Update packet), it executes the following decision process:

LSA Reception Processing

•Check if LSA is newer — Compare sequence number, checksum, and age against any existing copy in LSDB
•If older or identical — Discard (unless requesting acknowledgment for the older version to the sender)
•If newer: Install in LSDB — Replace any existing copy
•Send acknowledgment — LSAck to the sender confirming receipt
•Flood out other interfaces — Forward to all neighbors except the source (with modifications for DR networks)
•Schedule SPF calculation — The topology has changed, routes may need recalculation

Flooding Scope and Area Boundaries:

Not all LSAs flood everywhere. OSPF defines flooding scope based on LSA type:

Area Scope: Most LSAs (Type 1, 2, 3, 4) flood only within their area. ABRs do not relay these LSAs to other areas.
AS Scope: External LSAs (Type 5) flood throughout the entire OSPF domain, crossing area boundaries.
Link-Local Scope: Some OSPFv3 LSAs flood only on a single link.

This controlled flooding is what enables OSPF's hierarchical design—different areas have different LSDBs (for area-scoped LSAs) but share the same external route information.

Converting Mermaid diagram...

Flooding on Broadcast Networks

On broadcast networks with a Designated Router, flooding is optimized. Routers send LSAs only to the DR (using 224.0.0.6), and the DR floods them back out to all routers (using 224.0.0.5). This reduces duplicate LSAs and flooding overhead. The DR acts as a flooding relay for the segment.

Retransmission and Reliability:

OSPF ensures every LSA is received by every neighbor through retransmission:

When sending an LSU, the router starts a retransmission timer (RxmtInterval, default 5 seconds)
If no LSAck is received before timer expiration, the LSA is retransmitted
After multiple failed retransmissions, the neighbor adjacency may be reset
Each interface maintains a Link State Retransmission List—LSAs awaiting acknowledgment

This reliable flooding mechanism is what makes OSPF's database synchronization robust even in unstable network conditions.

Dijkstra's Shortest Path First Algorithm

At the core of OSPF's path computation lies Dijkstra's Shortest Path First (SPF) algorithm—an algorithm developed by Edsger Dijkstra in 1956 and published in 1959. This algorithm efficiently finds the shortest path from a single source node to all other nodes in a weighted graph.

Why Dijkstra's Algorithm:

OSPF requires an algorithm that:

Finds shortest paths from one router (the computing router) to all destinations
Works with weighted edges (link costs can vary)
Guarantees optimal results (not heuristics or approximations)
Executes efficiently even on large topologies

Dijkstra's algorithm satisfies all these requirements with time complexity of O((V + E) log V) using a priority queue, where V is the number of vertices (routers) and E is the number of edges (links).

The Algorithm Conceptually:

Dijkstra's algorithm works by progressively building a Shortest Path Tree (SPT) rooted at the computing router:

Initialize: Distance to self = 0; distance to all others = ∞; all nodes unmarked
Select: Choose the unmarked node with smallest distance (initially, this is self)
Mark: Mark the selected node as 'visited' (added to SPT)
Relax: For each neighbor of the selected node, check if going through this node provides a shorter path than currently known. If so, update the distance.
Repeat: Go to step 2 until all nodes are marked

When complete, the SPT contains the shortest path from the root to every other node.

dijkstra_conceptual.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
def dijkstra_spf(graph, source):
    """
    Dijkstra's SPF Algorithm - Conceptual Implementation
    
    graph: Dictionary where graph[node] = [(neighbor, cost), ...]
    source: The root node (computing router)
    
    Returns: Dictionary of {node: (distance, predecessor)}
    """
    import heapq
    
    # Initialize distances and predecessors
    distance = {node: float('infinity') for node in graph}
    predecessor = {node: None for node in graph}
    distance[source] = 0
    
    # Priority queue: (distance, node)
    # Using min-heap to always process closest unvisited node
    priority_queue = [(0, source)]
    visited = set()
    
    while priority_queue:
        # Step 2: Select node with smallest distance
        current_distance, current_node = heapq.heappop(priority_queue)
        
        # Skip if already processed (may have duplicate entries)
        if current_node in visited:
            continue
        
        # Step 3: Mark as visited (added to SPT)
        visited.add(current_node)
        
        # Step 4: Relax all neighbors
        for neighbor, edge_cost in graph[current_node]:
            if neighbor in visited:
                continue
                
            new_distance = current_distance + edge_cost
            
            # If this path is shorter, update
            if new_distance < distance[neighbor]:
                distance[neighbor] = new_distance
                predecessor[neighbor] = current_node
                heapq.heappush(priority_queue, (new_distance, neighbor))
    
    return {node: (distance[node], predecessor[node]) for node in graph}
 
# Example OSPF-like topology
# Router IDs: R1, R2, R3, R4, R5
topology = {
    'R1': [('R2', 10), ('R3', 5)],
    'R2': [('R1', 10), ('R3', 2), ('R4', 1)],
    'R3': [('R1', 5), ('R2', 2), ('R5', 4)],
    'R4': [('R2', 1), ('R5', 3)],
    'R5': [('R3', 4), ('R4', 3)]
}
 
# Compute SPF from R1's perspective
result = dijkstra_spf(topology, 'R1')
print("Shortest paths from R1:")
for router, (dist, pred) in result.items():
    print(f"  {router}: cost={dist}, via={pred}")

Visual Walkthrough:

Let's trace through the algorithm on a simple topology to see how OSPF computes paths:

Converting Mermaid diagram...

Dijkstra Execution from R1
Iteration	Process	Distance to R2	Distance to R3	Distance to R4	Distance to R5	SPT Nodes
0 (Init)	Start at R1	∞	∞	∞	∞	{R1}
1	Relax R1's neighbors	10 (via R1)	5 (via R1)	∞	∞	{R1}
2	Process R3 (closest)	7 (via R3)★	5	∞	9 (via R3)	{R1,R3}
3	Process R2	7	5	8 (via R2)	9	{R1,R3,R2}
4	Process R4	7	5	8	9 (11 via R4, keep 9)	{R1,R3,R2,R4}
5	Process R5	7	5	8	9	{R1,R3,R2,R4,R5}

Key Observations:

Notice in Iteration 2 how the path to R2 improved from 10 (direct) to 7 (via R3). The algorithm discovered that going R1→R3→R2 (cost 5+2=7) is cheaper than R1→R2 (cost 10).
The algorithm naturally handles non-intuitive optimal paths—it doesn't assume the direct path is best.
Equal-cost paths (ECMP) can be tracked by maintaining multiple predecessors when distances tie.

Why OSPF's SPF is Provably Loop-Free

Since every router has an identical LSDB and runs the same algorithm, every router computes consistent paths. If R1 sends to R2 for destination X, and R2 sends to R3, the path R1→R2→R3→...→X is guaranteed to decrease in cost at each hop. No router would send a packet backward toward a higher-cost path. This mathematical guarantee is why link-state protocols cannot have persistent routing loops (assuming synchronized databases).

SPF Scheduling and Throttling

Running Dijkstra's algorithm is computationally expensive—especially in large networks with thousands of LSAs. OSPF implementations don't run SPF immediately on every topology change; instead, they use SPF scheduling and throttling to balance responsiveness with CPU protection.

The Problem:

In an unstable network with flapping interfaces, each state change generates new LSAs, and naive implementations would run SPF continuously—potentially consuming all CPU resources and preventing the router from forwarding traffic.

The Solution: SPF Throttling

Modern OSPF implementations use exponential backoff for SPF scheduling:

SPF Throttling Parameters (Typical Cisco Values)
Parameter	Purpose	Default Value
spf-start	Initial wait before first SPF after trigger	5000 ms (5 seconds)
spf-hold	Minimum interval between consecutive SPF runs	10000 ms (10 seconds)
spf-max-wait	Maximum interval between SPF runs during instability	10000 ms (10 seconds)

How Throttling Works:

When the first topology change arrives, SPF is scheduled after spf-start milliseconds
If another change arrives before SPF runs, the timer continues (changes are batched)
After SPF runs, subsequent triggers must wait at least spf-hold before another SPF
During instability, the hold time may increase exponentially up to spf-max-wait
Once the network stabilizes, hold time decreases back toward spf-hold

This approach ensures:

Fast initial convergence (short spf-start)
Protection during instability (exponential backoff)
Eventual full convergence (changes aren't lost, just batched)

Tuning SPF Timers

Aggressive SPF timers (sub-second values) provide faster convergence but risk CPU exhaustion during instability. Conservative timers protect CPU but delay convergence. The right balance depends on your network's stability, hardware capabilities, and application requirements. High-frequency trading networks might use 1ms initial delay; large enterprise networks might use 5+ seconds.

Partial SPF and Incremental SPF:

Some OSPF implementations optimize further:

Partial SPF (pSPF): When only leaf routes change (e.g., a stub network goes down), a full SPF isn't needed—just update the affected routes. This is much faster than recomputing the entire tree.
Incremental SPF (iSPF): Some implementations can incrementally update the SPT when topology changes, rather than rebuilding from scratch. This is an implementation optimization, not part of the OSPF specification.

These optimizations significantly reduce CPU impact in production networks where most changes are minor.

Database Synchronization Between Neighbors

When two OSPF routers form an adjacency, they must synchronize their LSDBs before the adjacency is considered operational. This synchronization uses a structured exchange process involving Database Description (DBD), Link-State Request (LSR), and Link-State Update (LSU) packets.

The Synchronization Process:

Adjacency Formation Steps

•ExStart State: Routers negotiate master/slave relationship based on Router ID. The router with higher RID becomes master and controls the exchange sequence.
•Exchange State: Master and slave exchange DBD packets containing LSA headers (not full LSAs). Each DBD has a sequence number for reliability.
•Compare DBD contents: Each router compares received LSA headers against its LSDB. If the neighbor has a newer version or an LSA we don't have, we note it for request.
•Loading State: Routers send LSR packets requesting the LSAs they need. The neighbor responds with LSU packets containing the full LSAs.
•Full State: Once all requested LSAs are received and installed, databases are synchronized. The adjacency is complete.

Converting Mermaid diagram...

DBD Packet Flags:

Database Description packets include control flags coordinating the exchange:

I (Initialize): Set in first DBD of exchange sequence
M (More): Indicates more DBD packets to follow
MS (Master/Slave): Set by the master router

These flags enable reliable, sequenced exchange of potentially large LSDB summaries.

MTU Mismatch: A Common Killer

DBD packets can be large when summarizing extensive LSDBs. If interfaces have mismatched MTU, DBD packets may be fragmented and potentially dropped by firewalls or switches that don't handle fragments well. OSPF checks MTU in DBD packets (by default on many platforms) and won't form adjacency if MTUs differ. Neighbors stuck in ExStart or Exchange states often indicate MTU mismatch. Solutions include fixing MTUs to match or disabling MTU checking (not recommended).

Link-State vs. Distance-Vector: Fundamental Differences

Having explored link-state mechanics in depth, let's explicitly contrast them with distance-vector approaches to crystallize why OSPF behaves as it does.

The Information Model:

Link-State (OSPF)

•Routers share raw topology data (LSAs)
•Every router has complete network map
•Paths computed locally by each router
•Routers don't trust neighbor's path choices
•Changes trigger incremental updates only

Distance-Vector (RIP)

•Routers share computed distances
•Routers only know next-hop and distance
•Paths computed by neighbors and reported
•Routers trust neighbor's advertised distances
•Changes may require full table exchange

Convergence Behavior:

The information model difference directly impacts convergence:

Link-State Convergence:

Detecting router originates updated LSA
LSA flooded to all routers (milliseconds to seconds)
Each router runs SPF independently
All routers converge simultaneously with consistent paths

Distance-Vector Convergence:

Detecting router updates its distance and tells neighbors
Neighbors recalculate and tell their neighbors
Updates propagate hop-by-hop through the network
Routers may have inconsistent views during propagation
Counting to infinity can delay convergence for minutes

Link-state's parallel flooding and computation is inherently faster than distance-vector's sequential propagation.

Link-State vs. Distance-Vector Detailed Comparison
Characteristic	Link-State (OSPF)	Distance-Vector (RIP)
Knowledge Scope	Complete topology	Neighbors' distances only
Algorithm	Dijkstra (centralized per router)	Bellman-Ford (distributed)
Routing Loops	Impossible with synchronized DB	Possible; mitigated with split horizon
Convergence Time	Seconds (sub-second possible)	Minutes (count-to-infinity)
Memory Requirements	High (stores entire LSDB)	Low (stores routing table only)
CPU Requirements	High (SPF computation)	Low (simple distance comparison)
Bandwidth Usage	Efficient (incremental updates)	Inefficient (periodic full updates)
Troubleshooting	Easier (can examine complete topology)	Harder (limited visibility)
Scalability	Excellent (with hierarchy)	Poor (hop count limits)

EIGRP: A Hybrid Approach

Cisco's EIGRP uses distance-vector's information model but adds link-state-like features: triggered updates, neighbor relationships, and the DUAL algorithm for loop-free convergence. It achieves fast convergence without full topology knowledge. However, it still lacks the troubleshooting visibility advantage of true link-state protocols—you can't examine the complete topology from a single router.

Summary: Link-State Protocol Mastery

We've completed a deep exploration of OSPF's link-state foundation—the mechanisms that make it a reliable, fast-converging, and provably correct routing protocol. Let's consolidate the essential knowledge:

Key Takeaways

•Link-state protocols distribute topology, not computed routes — Each router receives raw LSAs and independently computes paths
•The LSDB must be identical across all routers — Inconsistent databases cause routing anomalies; OSPF has extensive synchronization mechanisms
•LSA lifecycle is carefully managed — Sequence numbers, aging, checksums ensure freshness and integrity
•Reliable flooding guarantees LSA delivery — Acknowledgments and retransmissions ensure no LSA is lost
•Dijkstra's SPF algorithm computes optimal paths — Mathematical guarantee of shortest paths from the router to all destinations
•SPF throttling protects CPU during instability — Exponential backoff prevents resource exhaustion
•Database synchronization occurs during adjacency formation — DBD/LSR/LSU exchange ensures new neighbors have complete information
•Link-state's complete topology view enables fast, loop-free convergence — Fundamental advantage over distance-vector approaches

What's Next:

With a solid understanding of link-state fundamentals, we'll explore OSPF's hierarchical area design—the mechanism that enables OSPF to scale to networks with thousands of routers. You'll learn how areas partition the LSDB, how Area 0 serves as the backbone, and how ABRs summarize between areas to control routing table growth.

Page Complete

You now understand the link-state operational model that powers OSPF: how LSDBs are built, synchronized, and used for SPF calculation. This knowledge is essential for understanding why OSPF behaves as it does and for effectively troubleshooting OSPF deployments. Next, we explore the power of hierarchical area design.

2 / 5

Loading learning content...

Computer NetworksOSPF

Open Shortest Path First (OSPF)

LevelIntermediate

Duration60 mins

TopicOSPF

2 / 5

Link State Protocol Fundamentals

The Power of Complete Topology Knowledge

What You Will Learn

The Link-State Operational Model

The Five Stages of Link-State Operation:

Every link-state protocol, including OSPF, follows this operational sequence:

Converting Mermaid diagram...

The Cartographer Analogy Revisited

The Link-State Database (LSDB)

LSDB Structure:

The LSDB is organized as a collection of LSAs, each identified by a unique tuple:

LS Type: The kind of LSA (we'll explore types in detail later)
Link State ID: Identifies what the LSA describes
Advertising Router: The router that originated the LSA

This three-part key ensures every LSA is uniquely identifiable across the network.

Critical LSDB Properties:

LSDB Imperatives

•Identical across routers — All routers in an area must have the exact same LSDB content. Any discrepancy causes routing inconsistencies.
•Self-healing — OSPF mechanisms automatically detect and correct database inconsistencies through periodic refresh and checksums.
•Bounded aging — LSAs have maximum age (3600 seconds). If not refreshed, they're automatically purged, preventing stale entries.
•Versioned — Sequence numbers (32-bit) distinguish newer LSAs from older ones, ensuring updates supersede outdated information.
•Checksum protected — Fletcher checksum detects corruption, triggering LSA retransmission if errors are detected.

LSA Lifecycle Management:

Every LSA has associated metadata controlling its lifecycle:

LSA Header Fields for Lifecycle Management
Field	Size	Purpose	Key Values
LS Age	16 bits	Time since LSA originated	0-3600 seconds (MaxAge)
LS Sequence Number	32 bits	Version number for updates	0x80000001 to 0x7FFFFFFF
LS Checksum	16 bits	Integrity verification	Fletcher checksum of LSA body
Length	16 bits	Total LSA size in bytes	Varies by LSA type

The LSA Refresh Mechanism:

LSAs are not permanent—they age and must be refreshed to remain valid:

LS Age increments every second as the LSA sits in the database
Originating router refreshes its own LSAs every 30 minutes (LSRefreshTime) by re-originating with a higher sequence number and age reset to 0
MaxAge (3600 seconds) triggers LSA flushing—the LSA is removed from the database
Premature aging can force immediate removal by setting age to MaxAge (used for topology changes)

This aging mechanism ensures that if a router fails without explicitly withdrawing its LSAs, the stale entries will eventually age out (within 1 hour maximum).

Sequence Number Wrapping

LSA Flooding Mechanics

Reliable flooding is the mechanism that distributes LSAs to all routers, ensuring database synchronization. Unlike simple broadcast, OSPF's flooding is:

Reliable: Every LSA is acknowledged; unacknowledged LSAs are retransmitted
Efficient: Only changed LSAs are flooded, not entire databases
Controlled: Flooding scope is limited by area boundaries and LSA types

The Flooding Algorithm:

When a router receives an LSA (in a Link-State Update packet), it executes the following decision process:

LSA Reception Processing

•Check if LSA is newer — Compare sequence number, checksum, and age against any existing copy in LSDB
•If older or identical — Discard (unless requesting acknowledgment for the older version to the sender)
•If newer: Install in LSDB — Replace any existing copy
•Send acknowledgment — LSAck to the sender confirming receipt
•Flood out other interfaces — Forward to all neighbors except the source (with modifications for DR networks)
•Schedule SPF calculation — The topology has changed, routes may need recalculation

Flooding Scope and Area Boundaries:

Not all LSAs flood everywhere. OSPF defines flooding scope based on LSA type:

Area Scope: Most LSAs (Type 1, 2, 3, 4) flood only within their area. ABRs do not relay these LSAs to other areas.
AS Scope: External LSAs (Type 5) flood throughout the entire OSPF domain, crossing area boundaries.
Link-Local Scope: Some OSPFv3 LSAs flood only on a single link.

This controlled flooding is what enables OSPF's hierarchical design—different areas have different LSDBs (for area-scoped LSAs) but share the same external route information.

Converting Mermaid diagram...

Flooding on Broadcast Networks

Retransmission and Reliability:

OSPF ensures every LSA is received by every neighbor through retransmission:

When sending an LSU, the router starts a retransmission timer (RxmtInterval, default 5 seconds)
If no LSAck is received before timer expiration, the LSA is retransmitted
After multiple failed retransmissions, the neighbor adjacency may be reset
Each interface maintains a Link State Retransmission List—LSAs awaiting acknowledgment

This reliable flooding mechanism is what makes OSPF's database synchronization robust even in unstable network conditions.

Dijkstra's Shortest Path First Algorithm

Why Dijkstra's Algorithm:

OSPF requires an algorithm that:

Finds shortest paths from one router (the computing router) to all destinations
Works with weighted edges (link costs can vary)
Guarantees optimal results (not heuristics or approximations)
Executes efficiently even on large topologies

The Algorithm Conceptually:

Dijkstra's algorithm works by progressively building a Shortest Path Tree (SPT) rooted at the computing router:

Initialize: Distance to self = 0; distance to all others = ∞; all nodes unmarked
Select: Choose the unmarked node with smallest distance (initially, this is self)
Mark: Mark the selected node as 'visited' (added to SPT)
Relax: For each neighbor of the selected node, check if going through this node provides a shorter path than currently known. If so, update the distance.
Repeat: Go to step 2 until all nodes are marked

When complete, the SPT contains the shortest path from the root to every other node.

dijkstra_conceptual.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
def dijkstra_spf(graph, source):
    """
    Dijkstra's SPF Algorithm - Conceptual Implementation
    
    graph: Dictionary where graph[node] = [(neighbor, cost), ...]
    source: The root node (computing router)
    
    Returns: Dictionary of {node: (distance, predecessor)}
    """
    import heapq
    
    # Initialize distances and predecessors
    distance = {node: float('infinity') for node in graph}
    predecessor = {node: None for node in graph}
    distance[source] = 0
    
    # Priority queue: (distance, node)
    # Using min-heap to always process closest unvisited node
    priority_queue = [(0, source)]
    visited = set()
    
    while priority_queue:
        # Step 2: Select node with smallest distance
        current_distance, current_node = heapq.heappop(priority_queue)
        
        # Skip if already processed (may have duplicate entries)
        if current_node in visited:
            continue
        
        # Step 3: Mark as visited (added to SPT)
        visited.add(current_node)
        
        # Step 4: Relax all neighbors
        for neighbor, edge_cost in graph[current_node]:
            if neighbor in visited:
                continue
                
            new_distance = current_distance + edge_cost
            
            # If this path is shorter, update
            if new_distance < distance[neighbor]:
                distance[neighbor] = new_distance
                predecessor[neighbor] = current_node
                heapq.heappush(priority_queue, (new_distance, neighbor))
    
    return {node: (distance[node], predecessor[node]) for node in graph}
 
# Example OSPF-like topology
# Router IDs: R1, R2, R3, R4, R5
topology = {
    'R1': [('R2', 10), ('R3', 5)],
    'R2': [('R1', 10), ('R3', 2), ('R4', 1)],
    'R3': [('R1', 5), ('R2', 2), ('R5', 4)],
    'R4': [('R2', 1), ('R5', 3)],
    'R5': [('R3', 4), ('R4', 3)]
}
 
# Compute SPF from R1's perspective
result = dijkstra_spf(topology, 'R1')
print("Shortest paths from R1:")
for router, (dist, pred) in result.items():
    print(f"  {router}: cost={dist}, via={pred}")

Visual Walkthrough:

Let's trace through the algorithm on a simple topology to see how OSPF computes paths:

Converting Mermaid diagram...

Dijkstra Execution from R1
Iteration	Process	Distance to R2	Distance to R3	Distance to R4	Distance to R5	SPT Nodes
0 (Init)	Start at R1	∞	∞	∞	∞	{R1}
1	Relax R1's neighbors	10 (via R1)	5 (via R1)	∞	∞	{R1}
2	Process R3 (closest)	7 (via R3)★	5	∞	9 (via R3)	{R1,R3}
3	Process R2	7	5	8 (via R2)	9	{R1,R3,R2}
4	Process R4	7	5	8	9 (11 via R4, keep 9)	{R1,R3,R2,R4}
5	Process R5	7	5	8	9	{R1,R3,R2,R4,R5}

Key Observations:

Notice in Iteration 2 how the path to R2 improved from 10 (direct) to 7 (via R3). The algorithm discovered that going R1→R3→R2 (cost 5+2=7) is cheaper than R1→R2 (cost 10).
The algorithm naturally handles non-intuitive optimal paths—it doesn't assume the direct path is best.
Equal-cost paths (ECMP) can be tracked by maintaining multiple predecessors when distances tie.

Why OSPF's SPF is Provably Loop-Free

SPF Scheduling and Throttling

The Problem:

The Solution: SPF Throttling

Modern OSPF implementations use exponential backoff for SPF scheduling:

SPF Throttling Parameters (Typical Cisco Values)
Parameter	Purpose	Default Value
spf-start	Initial wait before first SPF after trigger	5000 ms (5 seconds)
spf-hold	Minimum interval between consecutive SPF runs	10000 ms (10 seconds)
spf-max-wait	Maximum interval between SPF runs during instability	10000 ms (10 seconds)

How Throttling Works:

When the first topology change arrives, SPF is scheduled after spf-start milliseconds
If another change arrives before SPF runs, the timer continues (changes are batched)
After SPF runs, subsequent triggers must wait at least spf-hold before another SPF
During instability, the hold time may increase exponentially up to spf-max-wait
Once the network stabilizes, hold time decreases back toward spf-hold

This approach ensures:

Fast initial convergence (short spf-start)
Protection during instability (exponential backoff)
Eventual full convergence (changes aren't lost, just batched)

Tuning SPF Timers

Partial SPF and Incremental SPF:

Some OSPF implementations optimize further:

Partial SPF (pSPF): When only leaf routes change (e.g., a stub network goes down), a full SPF isn't needed—just update the affected routes. This is much faster than recomputing the entire tree.
Incremental SPF (iSPF): Some implementations can incrementally update the SPT when topology changes, rather than rebuilding from scratch. This is an implementation optimization, not part of the OSPF specification.

These optimizations significantly reduce CPU impact in production networks where most changes are minor.

Database Synchronization Between Neighbors

The Synchronization Process:

Adjacency Formation Steps

•ExStart State: Routers negotiate master/slave relationship based on Router ID. The router with higher RID becomes master and controls the exchange sequence.
•Exchange State: Master and slave exchange DBD packets containing LSA headers (not full LSAs). Each DBD has a sequence number for reliability.
•Compare DBD contents: Each router compares received LSA headers against its LSDB. If the neighbor has a newer version or an LSA we don't have, we note it for request.
•Loading State: Routers send LSR packets requesting the LSAs they need. The neighbor responds with LSU packets containing the full LSAs.
•Full State: Once all requested LSAs are received and installed, databases are synchronized. The adjacency is complete.

Converting Mermaid diagram...

DBD Packet Flags:

Database Description packets include control flags coordinating the exchange:

I (Initialize): Set in first DBD of exchange sequence
M (More): Indicates more DBD packets to follow
MS (Master/Slave): Set by the master router

These flags enable reliable, sequenced exchange of potentially large LSDB summaries.

MTU Mismatch: A Common Killer

Link-State vs. Distance-Vector: Fundamental Differences

Having explored link-state mechanics in depth, let's explicitly contrast them with distance-vector approaches to crystallize why OSPF behaves as it does.

The Information Model:

Link-State (OSPF)

•Routers share raw topology data (LSAs)
•Every router has complete network map
•Paths computed locally by each router
•Routers don't trust neighbor's path choices
•Changes trigger incremental updates only

Distance-Vector (RIP)

•Routers share computed distances
•Routers only know next-hop and distance
•Paths computed by neighbors and reported
•Routers trust neighbor's advertised distances
•Changes may require full table exchange

Convergence Behavior:

The information model difference directly impacts convergence:

Link-State Convergence:

Detecting router originates updated LSA
LSA flooded to all routers (milliseconds to seconds)
Each router runs SPF independently
All routers converge simultaneously with consistent paths

Distance-Vector Convergence:

Detecting router updates its distance and tells neighbors
Neighbors recalculate and tell their neighbors
Updates propagate hop-by-hop through the network
Routers may have inconsistent views during propagation
Counting to infinity can delay convergence for minutes

Link-state's parallel flooding and computation is inherently faster than distance-vector's sequential propagation.

Link-State vs. Distance-Vector Detailed Comparison
Characteristic	Link-State (OSPF)	Distance-Vector (RIP)
Knowledge Scope	Complete topology	Neighbors' distances only
Algorithm	Dijkstra (centralized per router)	Bellman-Ford (distributed)
Routing Loops	Impossible with synchronized DB	Possible; mitigated with split horizon
Convergence Time	Seconds (sub-second possible)	Minutes (count-to-infinity)
Memory Requirements	High (stores entire LSDB)	Low (stores routing table only)
CPU Requirements	High (SPF computation)	Low (simple distance comparison)
Bandwidth Usage	Efficient (incremental updates)	Inefficient (periodic full updates)
Troubleshooting	Easier (can examine complete topology)	Harder (limited visibility)
Scalability	Excellent (with hierarchy)	Poor (hop count limits)

EIGRP: A Hybrid Approach

Summary: Link-State Protocol Mastery

Key Takeaways

•Link-state protocols distribute topology, not computed routes — Each router receives raw LSAs and independently computes paths
•The LSDB must be identical across all routers — Inconsistent databases cause routing anomalies; OSPF has extensive synchronization mechanisms
•LSA lifecycle is carefully managed — Sequence numbers, aging, checksums ensure freshness and integrity
•Reliable flooding guarantees LSA delivery — Acknowledgments and retransmissions ensure no LSA is lost
•Dijkstra's SPF algorithm computes optimal paths — Mathematical guarantee of shortest paths from the router to all destinations
•SPF throttling protects CPU during instability — Exponential backoff prevents resource exhaustion
•Database synchronization occurs during adjacency formation — DBD/LSR/LSU exchange ensures new neighbors have complete information
•Link-state's complete topology view enables fast, loop-free convergence — Fundamental advantage over distance-vector approaches

What's Next:

Page Complete

2 / 5