Data Structures & AlgorithmsFloyd-Warshall Algorithm

All-Pairs Shortest Path — Floyd-Warshall (Conceptual)

LevelAdvanced

Duration75 mins

TopicFloyd-Warshall Algorithm

1 / 4

Finding Shortest Paths Between All Pairs

The Complete Distance Matrix Problem

Imagine you're building a navigation system for a city with hundreds of intersections. A user might ask for directions between any two locations—not just from a single starting point, but from any point A to any point B in the entire network. They might also want to compare multiple routes, find detour options, or understand which parts of the city are closest to each other.

You could run Dijkstra's algorithm from each starting point, but as the city grows, this approach becomes increasingly expensive. What if there were an algorithm specifically designed to answer every possible shortest path query in one unified computation?

This is precisely what the All-Pairs Shortest Path (APSP) problem addresses, and the Floyd-Warshall algorithm provides an elegant, powerful solution through dynamic programming.

What You Will Learn

By the end of this page, you will understand exactly what the all-pairs shortest path problem is, why it differs fundamentally from single-source approaches, the scenarios that demand complete distance information, and the mathematical formulation that makes Floyd-Warshall so elegant. You'll see why this algorithm is a cornerstone of graph theory.

The Nature of the All-Pairs Problem

Before diving into Floyd-Warshall, we must precisely understand what we're solving. The All-Pairs Shortest Path (APSP) problem asks:

Given a weighted directed graph G = (V, E), compute the shortest path distance d(u, v) for every pair of vertices u, v ∈ V.

This is a fundamentally different question than what Dijkstra's or Bellman-Ford answer. Those algorithms solve the Single-Source Shortest Path (SSSP) problem—finding shortest paths from one source to all other vertices. APSP requires us to answer the question for every possible source.

Single-Source vs All-Pairs Shortest Path
Aspect	SSSP (Dijkstra/Bellman-Ford)	APSP (Floyd-Warshall)
Question Answered	Shortest paths from ONE source to all vertices	Shortest paths between EVERY pair of vertices
Output Size	O(V) distances	O(V²) distances (a matrix)
Input Focus	Optimized for a single starting point	Treats all vertices equally as potential sources
Typical Use Case	GPS from current location	Precomputed distance tables, network analysis
Repeated Queries	Must re-run for each new source	All answers precomputed

The Output: A Distance Matrix

The result of solving APSP is a distance matrix D where D[i][j] represents the shortest path distance from vertex i to vertex j. For a graph with V vertices, this is a V × V matrix containing V² entries.

Consider a small example with 4 cities: A, B, C, D. The distance matrix might look like:

From \ To	A	B	C	D
A	0	3	8	4
B	∞	0	2	7
C	5	1	0	9
D	2	6	4	0

Each cell answers a different query: "What is the shortest distance from row vertex to column vertex?" Having this entire matrix precomputed means any shortest path query can be answered in O(1) time—just look up the value in the matrix.

The Power of Precomputation

The key insight is that computing the entire matrix ONCE allows answering unlimited queries INSTANTLY. For applications with many distance queries on a relatively static graph, the upfront cost of APSP pays for itself many times over. This is the classic trade-off: more computation upfront for faster responses later.

When Do We Need All Pairs?

You might wonder: "Why not just run Dijkstra's algorithm V times, once from each vertex?" This is a valid approach called running SSSP from all sources. However, there are specific scenarios where a dedicated APSP algorithm like Floyd-Warshall is more appropriate or even necessary:

Scenarios Demanding All-Pairs Shortest Path

•Graphs with Negative Edge Weights — Dijkstra's algorithm doesn't work with negative edges. While you could run Bellman-Ford V times (O(V²E)), Floyd-Warshall handles negative edges naturally in O(V³), which is often faster for dense graphs.
•Dense Graphs — When E ≈ V², running Dijkstra V times costs O(V × (V + E) log V) ≈ O(V³ log V). Floyd-Warshall's O(V³) becomes more competitive.
•Complete Distance Information Needed — Applications like clustering, graph diameter computation, or betweenness centrality require distances between ALL pairs. Running SSSP repeatedly is wasteful when you need the entire matrix anyway.
•Transitive Closure — Determining reachability (can vertex u reach vertex v?) for all pairs is a special case of APSP. Floyd-Warshall can be adapted for this elegantly.
•Network Reliability Analysis — Computing alternative routes, understanding the impact of edge failures, or finding the graph's structure often requires complete distance information.
•Simplicity and Implementation — Floyd-Warshall is remarkably simple: three nested loops. For small to medium graphs, its simplicity often outweighs any performance difference versus more complex alternatives.

Real-World Applications

Let's examine concrete scenarios where APSP is essential:

1. Facility Location Problems

Choosing where to build warehouses, hospitals, or fire stations requires knowing distances from every potential location to every customer or emergency site. You need the complete distance matrix to optimize placement.

2. Graph Diameter and Radius

The diameter of a graph is the longest shortest path between any two vertices. The radius is the minimum eccentricity. Computing these requires knowing all pairwise distances—you can't find the maximum without examining all values.

3. Betweenness Centrality

This important network metric measures how often a vertex lies on shortest paths between other vertices. Calculating it requires counting shortest paths between all pairs—a classic APSP application.

4. Traffic and Logistics Optimization

Shipping companies need to know distances between all warehouses, all distribution centers, and all delivery zones. Precomputing the complete matrix allows real-time route optimization.

5. Social Network Analysis

Measuring "degrees of separation" between all users, identifying influential nodes, or detecting communities often requires complete distance information.

The Decision Framework

Ask yourself: Do I need distances from one source, or from many/all sources? If the answer is 'all' or 'most,' and especially if the graph has negative edges or is dense, Floyd-Warshall becomes a strong candidate.

Mathematical Formulation

The beauty of Floyd-Warshall lies in its elegant mathematical formulation. To understand the algorithm, we must first understand the intermediate vertex concept.

Definition: Intermediate Vertices

Consider a path from vertex i to vertex j. The intermediate vertices of this path are all vertices on the path except i and j themselves. For example, in the path i → a → b → c → j, the intermediate vertices are {a, b, c}.

Floyd-Warshall builds shortest paths by gradually considering more vertices as potential intermediate vertices.

The Core Idea

Let d(k)[i][j] denote the shortest path from i to j using only vertices {1, 2, ..., k} as intermediate vertices. Floyd-Warshall computes d(0), then d(1), then d(2), ..., until d(V), which gives the final answer with all vertices available as intermediates.

The Recurrence Relation

The algorithm is based on a simple but powerful observation. When we allow vertex k to be used as an intermediate vertex, the shortest path from i to j either:

Does NOT use k — The path remains the same as before: d(k)[i][j] = d(k-1)[i][j]
DOES use k — The path goes through k, so it consists of a path from i to k, then from k to j: d(k)[i][j] = d(k-1)[i][k] + d(k-1)[k][j]

The shortest path is the minimum of these two options:

d(k)[i][j] = min( d(k-1)[i][j], d(k-1)[i][k] + d(k-1)[k][j] )

This is the heart of Floyd-Warshall—a classic dynamic programming recurrence.

floyd-warshall-recurrence.txt
┌─────────────────────────────────────────────────────────────┐
│                    FLOYD-WARSHALL RECURRENCE                 │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│   Base Case (k = 0):                                        │
│   ┌────────────────────────────────────────────────────┐    │
│   │  d(0)[i][j] = weight(i, j) if edge exists         │    │
│   │             = 0            if i = j               │    │
│   │             = ∞            otherwise              │    │
│   └────────────────────────────────────────────────────┘    │
│                                                             │
│   Recursive Case:                                           │
│   ┌────────────────────────────────────────────────────┐    │
│   │  d(k)[i][j] = min(                                 │    │
│   │      d(k-1)[i][j],              ← Don't use k     │    │
│   │      d(k-1)[i][k] + d(k-1)[k][j] ← Use k          │    │
│   │  )                                                 │    │
│   └────────────────────────────────────────────────────┘    │
│                                                             │
│   Final Answer:                                             │
│   ┌────────────────────────────────────────────────────┐    │
│   │  Shortest path from i to j = d(V)[i][j]           │    │
│   │  (All vertices available as intermediates)        │    │
│   └────────────────────────────────────────────────────┘    │
│                                                             │
└─────────────────────────────────────────────────────────────┘

Why This Works: The Optimal Substructure

This recurrence works because shortest paths exhibit optimal substructure—a subpath of a shortest path is itself a shortest path.

If the shortest path from i to j goes through k, then:

The portion from i to k must be the shortest path from i to k
The portion from k to j must be the shortest path from k to j

If either portion weren't optimal, we could replace it with a shorter path, contradicting that the original path was shortest. This property is what makes dynamic programming applicable.

Order of Computation

The recurrence requires d(k-1) values to compute d(k) values. This means we must process intermediate vertices in order: first consider vertex 1, then vertices {1, 2}, then {1, 2, 3}, and so on. Each phase builds on the previous phase's results.

Visualizing the Intermediate Vertex Expansion

To truly internalize Floyd-Warshall, let's trace through a concrete example. Consider a small directed graph with 4 vertices and the following edge weights:

example-graph.txt
Graph Structure:
    
         8
    1 ───────→ 3
    │ \        ↑
  5 │   \ 2    │ 1
    ↓     ↘    │
    4 ←───── 2
         3
 
Edges:
  1 → 2 (weight 2)
  1 → 4 (weight 5)
  1 → 3 (weight 8)
  2 → 3 (weight 1)
  2 → 4 (weight 3)
  4 → 3 (weight 1)
 
We want shortest paths between ALL pairs.

Step 0: Initial Distance Matrix (k = 0)

We start with direct edges only—no intermediate vertices allowed. If there's no direct edge, the distance is ∞.

d(0): Direct edges only (no intermediates)
From \ To	1	2	3	4
1	0	2	8	5
2	∞	0	1	3
3	∞	∞	0	∞
4	∞	∞	1	0

Step 1: Allow vertex 1 as intermediate (k = 1)

For each pair (i, j), we check: Is going through vertex 1 shorter than the current path?

d(1)[i][j] = min(d(0)[i][j], d(0)[i][1] + d(0)[1][j])

Since no vertex has an edge TO vertex 1, using vertex 1 as an intermediate doesn't help anyone. The matrix stays the same.

Step 2: Allow vertices {1, 2} as intermediates (k = 2)

Now vertex 2 can be an intermediate. Let's check improvements:

d[1][3]: Current = 8. Via 2? d[1][2] + d[2][3] = 2 + 1 = 3 < 8. Improvement! → 3
d[1][4]: Current = 5. Via 2? d[1][2] + d[2][4] = 2 + 3 = 5. No improvement.
d[3][4]: Current = ∞. Via 2? d[3][2] = ∞. Can't use it.
d[4][3]: Current = 1. Via 2? d[4][2] = ∞. Can't use it.

d(2): Vertices {1, 2} available as intermediates
From \ To	1	2	3	4
1	0	2	3 (via 2)	5
2	∞	0	1	3
3	∞	∞	0	∞
4	∞	∞	1	0

Step 3: Allow vertices {1, 2, 3} as intermediates (k = 3)

Vertex 3 becomes available. Let's check:

d[4][?]: Via 3? d[4][3] = 1, so we can reach places 3 can reach.
- But d[3][1], d[3][2], d[3][4] are all ∞. No improvements from 4.
d[2][?]: Via 3? d[2][3] = 1.
- d[2][4] via 3? d[2][3] + d[3][4] = 1 + ∞ = ∞. No improvement.
d[1][4]: Via 3? d[1][3] + d[3][4] = 3 + ∞ = ∞. No improvement.

No changes in this phase.

Step 4: Allow all vertices {1, 2, 3, 4} as intermediates (k = 4)

Vertex 4 becomes available:

d[1][3]: Current = 3. Via 4? d[1][4] + d[4][3] = 5 + 1 = 6 > 3. No improvement.
d[2][3]: Current = 1. Via 4? d[2][4] + d[4][3] = 3 + 1 = 4 > 1. No improvement.

No changes. We're done!

Final Distance Matrix d(4)
From \ To	1	2	3	4
1	0	2	3	5
2	∞	0	1	3
3	∞	∞	0	∞
4	∞	∞	1	0

The Matrix is Complete

Every entry now represents the shortest path between that pair of vertices, using any intermediate vertices as needed. The key insight: we only improved d[1][3] from 8 to 3 by discovering the path 1 → 2 → 3 in phase k=2.

Comparing All-Pairs Approaches

Floyd-Warshall isn't the only way to solve APSP. Let's compare the main approaches to understand when Floyd-Warshall is the right choice:

APSP Algorithm Comparison
Approach	Time Complexity	Space	Negative Edges	Best For
Dijkstra × V (binary heap)	O(V(V + E) log V)	O(V)	❌ No	Sparse graphs, non-negative edges
Dijkstra × V (Fibonacci heap)	O(V² log V + VE)	O(V)	❌ No	Sparse graphs, optimal SSSP
Bellman-Ford × V	O(V²E)	O(V)	✅ Yes	Negative edges, sparse graphs
Floyd-Warshall	O(V³)	O(V²)	✅ Yes	Dense graphs, simplicity, negative edges
Johnson's Algorithm	O(V² log V + VE)	O(V²)	✅ Yes	Sparse graphs with negative edges

Breaking Down the Comparison:

For sparse graphs (E ≈ V or E ≈ V log V):

Running Dijkstra V times: O(V² log V) — much better than O(V³)
With negative edges: Johnson's algorithm wins at O(V² log V + VE)
Floyd-Warshall: O(V³) — overkill, too slow

For dense graphs (E ≈ V²):

Running Dijkstra V times: O(V³ log V) — worse than Floyd-Warshall
Bellman-Ford V times: O(V² × V²) = O(V⁴) — much worse
Floyd-Warshall: O(V³) — optimal among polynomial algorithms

For moderate density:

The choice depends on exact constants and implementation
Floyd-Warshall's simplicity often wins in practice

The Simplicity Advantage

Floyd-Warshall is perhaps the simplest algorithm among these options—just three nested loops with a one-line update. For graphs up to a few hundred vertices, this simplicity often makes it the pragmatic choice regardless of theoretical complexity differences. No priority queues, no complex data structures, just a matrix.

Choose Floyd-Warshall When

•Graph is dense (E ≈ V²)
•Negative edges are present (but no negative cycles)
•You need ALL pairwise distances
•Graph is small to medium (V ≤ 500-1000)
•Implementation simplicity matters
•Transitive closure is also needed

Avoid Floyd-Warshall When

•Graph is very sparse (E << V²)
•Only single-source queries needed
•V is large (> 1000) and graph is sparse
•Memory is severely constrained (O(V²) is too much)
•Real-time performance on dynamic graphs
•You only need a subset of pairwise distances

The Distance Matrix as a Complete Graph Model

The V × V distance matrix that Floyd-Warshall produces is more than just an answer table—it's a complete metric space representation of the graph. This matrix has remarkable properties that enable powerful analyses:

Properties of the Distance Matrix

•Diagonal is Zero — d[i][i] = 0 for all i. The shortest path from any vertex to itself has length zero (assuming no negative cycles).
•Triangle Inequality — d[i][j] ≤ d[i][k] + d[k][j] for all i, j, k. The direct shortest path is never longer than going through an intermediate.
•Asymmetry Possible — In directed graphs, d[i][j] ≠ d[j][i] is possible. One-way streets, for example.
•Infinity for Unreachable — d[i][j] = ∞ means no path exists from i to j.
•Eccentricity Computation — max_j(d[i][j]) gives the eccentricity of vertex i—how 'far' it is from the farthest reachable vertex.
•Diameter = max_i,j(d[i][j]) — The maximum finite entry is the graph's diameter.

Using the Distance Matrix

Once computed, the distance matrix enables O(1) answers to many questions:

FINDING GRAPH METRICS FROM DISTANCE MATRIX D:

1. Shortest path i to j:     D[i][j]                    // Direct lookup
2. Eccentricity of i:        max{ D[i][j] : j ∈ V }     // Max of row i
3. Radius of graph:          min{ eccentricity(i) }     // Min eccentricity
4. Diameter of graph:        max{ eccentricity(i) }     // Max eccentricity
5. Center of graph:          { i : eccentricity(i) = radius }
6. Reachability i → j:       D[i][j] < ∞               // Finite means reachable

The distance matrix transforms graph analysis from repeated algorithm runs into simple matrix operations.

The Transitive Closure Connection

By replacing min with logical OR and + with logical AND, Floyd-Warshall computes the transitive closure—a boolean matrix where T[i][j] = true if and only if vertex j is reachable from vertex i. Same algorithm, different semiring!

Summary: The All-Pairs Problem

We've established a thorough understanding of what the All-Pairs Shortest Path problem is and why it matters. Let's consolidate the key insights:

Key Takeaways

•APSP computes distances between ALL pairs — Unlike SSSP which focuses on one source, APSP produces a complete V × V distance matrix.
•The output enables O(1) distance queries — Any shortest path distance can be looked up instantly once the matrix is computed.
•Floyd-Warshall uses intermediate vertex expansion — It builds solutions by gradually allowing more vertices as potential intermediate stops.
•The recurrence is elegant — d(k)[i][j] = min(d(k-1)[i][j], d(k-1)[i][k] + d(k-1)[k][j]).
•Optimal for dense graphs and negative edges — With O(V³) time and simple implementation, it excels where alternatives struggle.
•The distance matrix is a powerful tool — It enables computing graph diameter, radius, eccentricity, and more.

What's Next:

Now that we understand what the all-pairs problem is and why Floyd-Warshall is a compelling solution, we'll dive into how the algorithm works in detail. The next page explores the dynamic programming approach—how we transform the recurrence into an efficient iterative algorithm and why we can optimize space by updating the matrix in-place.

Page Complete

You now understand the All-Pairs Shortest Path problem, its applications, and the mathematical foundation of Floyd-Warshall. Next, we'll see the dynamic programming machinery that makes this algorithm work.

1 / 4

Loading learning content...

Data Structures & AlgorithmsFloyd-Warshall Algorithm

All-Pairs Shortest Path — Floyd-Warshall (Conceptual)

LevelAdvanced

Duration75 mins

TopicFloyd-Warshall Algorithm

1 / 4

Finding Shortest Paths Between All Pairs

The Complete Distance Matrix Problem

This is precisely what the All-Pairs Shortest Path (APSP) problem addresses, and the Floyd-Warshall algorithm provides an elegant, powerful solution through dynamic programming.

What You Will Learn

The Nature of the All-Pairs Problem

Before diving into Floyd-Warshall, we must precisely understand what we're solving. The All-Pairs Shortest Path (APSP) problem asks:

Given a weighted directed graph G = (V, E), compute the shortest path distance d(u, v) for every pair of vertices u, v ∈ V.

Single-Source vs All-Pairs Shortest Path
Aspect	SSSP (Dijkstra/Bellman-Ford)	APSP (Floyd-Warshall)
Question Answered	Shortest paths from ONE source to all vertices	Shortest paths between EVERY pair of vertices
Output Size	O(V) distances	O(V²) distances (a matrix)
Input Focus	Optimized for a single starting point	Treats all vertices equally as potential sources
Typical Use Case	GPS from current location	Precomputed distance tables, network analysis
Repeated Queries	Must re-run for each new source	All answers precomputed

The Output: A Distance Matrix

Consider a small example with 4 cities: A, B, C, D. The distance matrix might look like:

From \ To	A	B	C	D
A	0	3	8	4
B	∞	0	2	7
C	5	1	0	9
D	2	6	4	0

The Power of Precomputation

When Do We Need All Pairs?

Scenarios Demanding All-Pairs Shortest Path

•Graphs with Negative Edge Weights — Dijkstra's algorithm doesn't work with negative edges. While you could run Bellman-Ford V times (O(V²E)), Floyd-Warshall handles negative edges naturally in O(V³), which is often faster for dense graphs.
•Dense Graphs — When E ≈ V², running Dijkstra V times costs O(V × (V + E) log V) ≈ O(V³ log V). Floyd-Warshall's O(V³) becomes more competitive.
•Complete Distance Information Needed — Applications like clustering, graph diameter computation, or betweenness centrality require distances between ALL pairs. Running SSSP repeatedly is wasteful when you need the entire matrix anyway.
•Transitive Closure — Determining reachability (can vertex u reach vertex v?) for all pairs is a special case of APSP. Floyd-Warshall can be adapted for this elegantly.
•Network Reliability Analysis — Computing alternative routes, understanding the impact of edge failures, or finding the graph's structure often requires complete distance information.
•Simplicity and Implementation — Floyd-Warshall is remarkably simple: three nested loops. For small to medium graphs, its simplicity often outweighs any performance difference versus more complex alternatives.

Real-World Applications

Let's examine concrete scenarios where APSP is essential:

1. Facility Location Problems

2. Graph Diameter and Radius

3. Betweenness Centrality

4. Traffic and Logistics Optimization

Shipping companies need to know distances between all warehouses, all distribution centers, and all delivery zones. Precomputing the complete matrix allows real-time route optimization.

5. Social Network Analysis

Measuring "degrees of separation" between all users, identifying influential nodes, or detecting communities often requires complete distance information.

The Decision Framework

Mathematical Formulation

The beauty of Floyd-Warshall lies in its elegant mathematical formulation. To understand the algorithm, we must first understand the intermediate vertex concept.

Definition: Intermediate Vertices

Floyd-Warshall builds shortest paths by gradually considering more vertices as potential intermediate vertices.

The Core Idea

The Recurrence Relation

The algorithm is based on a simple but powerful observation. When we allow vertex k to be used as an intermediate vertex, the shortest path from i to j either:

Does NOT use k — The path remains the same as before: d(k)[i][j] = d(k-1)[i][j]
DOES use k — The path goes through k, so it consists of a path from i to k, then from k to j: d(k)[i][j] = d(k-1)[i][k] + d(k-1)[k][j]

The shortest path is the minimum of these two options:

d(k)[i][j] = min( d(k-1)[i][j], d(k-1)[i][k] + d(k-1)[k][j] )

This is the heart of Floyd-Warshall—a classic dynamic programming recurrence.

floyd-warshall-recurrence.txt
┌─────────────────────────────────────────────────────────────┐
│                    FLOYD-WARSHALL RECURRENCE                 │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│   Base Case (k = 0):                                        │
│   ┌────────────────────────────────────────────────────┐    │
│   │  d(0)[i][j] = weight(i, j) if edge exists         │    │
│   │             = 0            if i = j               │    │
│   │             = ∞            otherwise              │    │
│   └────────────────────────────────────────────────────┘    │
│                                                             │
│   Recursive Case:                                           │
│   ┌────────────────────────────────────────────────────┐    │
│   │  d(k)[i][j] = min(                                 │    │
│   │      d(k-1)[i][j],              ← Don't use k     │    │
│   │      d(k-1)[i][k] + d(k-1)[k][j] ← Use k          │    │
│   │  )                                                 │    │
│   └────────────────────────────────────────────────────┘    │
│                                                             │
│   Final Answer:                                             │
│   ┌────────────────────────────────────────────────────┐    │
│   │  Shortest path from i to j = d(V)[i][j]           │    │
│   │  (All vertices available as intermediates)        │    │
│   └────────────────────────────────────────────────────┘    │
│                                                             │
└─────────────────────────────────────────────────────────────┘

Why This Works: The Optimal Substructure

This recurrence works because shortest paths exhibit optimal substructure—a subpath of a shortest path is itself a shortest path.

If the shortest path from i to j goes through k, then:

The portion from i to k must be the shortest path from i to k
The portion from k to j must be the shortest path from k to j

If either portion weren't optimal, we could replace it with a shorter path, contradicting that the original path was shortest. This property is what makes dynamic programming applicable.

Order of Computation

Visualizing the Intermediate Vertex Expansion

To truly internalize Floyd-Warshall, let's trace through a concrete example. Consider a small directed graph with 4 vertices and the following edge weights:

example-graph.txt
Graph Structure:
    
         8
    1 ───────→ 3
    │ \        ↑
  5 │   \ 2    │ 1
    ↓     ↘    │
    4 ←───── 2
         3
 
Edges:
  1 → 2 (weight 2)
  1 → 4 (weight 5)
  1 → 3 (weight 8)
  2 → 3 (weight 1)
  2 → 4 (weight 3)
  4 → 3 (weight 1)
 
We want shortest paths between ALL pairs.

Step 0: Initial Distance Matrix (k = 0)

We start with direct edges only—no intermediate vertices allowed. If there's no direct edge, the distance is ∞.

d(0): Direct edges only (no intermediates)
From \ To	1	2	3	4
1	0	2	8	5
2	∞	0	1	3
3	∞	∞	0	∞
4	∞	∞	1	0

Step 1: Allow vertex 1 as intermediate (k = 1)

For each pair (i, j), we check: Is going through vertex 1 shorter than the current path?

d(1)[i][j] = min(d(0)[i][j], d(0)[i][1] + d(0)[1][j])

Since no vertex has an edge TO vertex 1, using vertex 1 as an intermediate doesn't help anyone. The matrix stays the same.

Step 2: Allow vertices {1, 2} as intermediates (k = 2)

Now vertex 2 can be an intermediate. Let's check improvements:

d[1][3]: Current = 8. Via 2? d[1][2] + d[2][3] = 2 + 1 = 3 < 8. Improvement! → 3
d[1][4]: Current = 5. Via 2? d[1][2] + d[2][4] = 2 + 3 = 5. No improvement.
d[3][4]: Current = ∞. Via 2? d[3][2] = ∞. Can't use it.
d[4][3]: Current = 1. Via 2? d[4][2] = ∞. Can't use it.

d(2): Vertices {1, 2} available as intermediates
From \ To	1	2	3	4
1	0	2	3 (via 2)	5
2	∞	0	1	3
3	∞	∞	0	∞
4	∞	∞	1	0

Step 3: Allow vertices {1, 2, 3} as intermediates (k = 3)

Vertex 3 becomes available. Let's check:

d[4][?]: Via 3? d[4][3] = 1, so we can reach places 3 can reach.
- But d[3][1], d[3][2], d[3][4] are all ∞. No improvements from 4.
d[2][?]: Via 3? d[2][3] = 1.
- d[2][4] via 3? d[2][3] + d[3][4] = 1 + ∞ = ∞. No improvement.
d[1][4]: Via 3? d[1][3] + d[3][4] = 3 + ∞ = ∞. No improvement.

No changes in this phase.

Step 4: Allow all vertices {1, 2, 3, 4} as intermediates (k = 4)

Vertex 4 becomes available:

d[1][3]: Current = 3. Via 4? d[1][4] + d[4][3] = 5 + 1 = 6 > 3. No improvement.
d[2][3]: Current = 1. Via 4? d[2][4] + d[4][3] = 3 + 1 = 4 > 1. No improvement.

No changes. We're done!

Final Distance Matrix d(4)
From \ To	1	2	3	4
1	0	2	3	5
2	∞	0	1	3
3	∞	∞	0	∞
4	∞	∞	1	0

The Matrix is Complete

Comparing All-Pairs Approaches

Floyd-Warshall isn't the only way to solve APSP. Let's compare the main approaches to understand when Floyd-Warshall is the right choice:

APSP Algorithm Comparison
Approach	Time Complexity	Space	Negative Edges	Best For
Dijkstra × V (binary heap)	O(V(V + E) log V)	O(V)	❌ No	Sparse graphs, non-negative edges
Dijkstra × V (Fibonacci heap)	O(V² log V + VE)	O(V)	❌ No	Sparse graphs, optimal SSSP
Bellman-Ford × V	O(V²E)	O(V)	✅ Yes	Negative edges, sparse graphs
Floyd-Warshall	O(V³)	O(V²)	✅ Yes	Dense graphs, simplicity, negative edges
Johnson's Algorithm	O(V² log V + VE)	O(V²)	✅ Yes	Sparse graphs with negative edges

Breaking Down the Comparison:

For sparse graphs (E ≈ V or E ≈ V log V):

Running Dijkstra V times: O(V² log V) — much better than O(V³)
With negative edges: Johnson's algorithm wins at O(V² log V + VE)
Floyd-Warshall: O(V³) — overkill, too slow

For dense graphs (E ≈ V²):

Running Dijkstra V times: O(V³ log V) — worse than Floyd-Warshall
Bellman-Ford V times: O(V² × V²) = O(V⁴) — much worse
Floyd-Warshall: O(V³) — optimal among polynomial algorithms

For moderate density:

The choice depends on exact constants and implementation
Floyd-Warshall's simplicity often wins in practice

The Simplicity Advantage

Choose Floyd-Warshall When

•Graph is dense (E ≈ V²)
•Negative edges are present (but no negative cycles)
•You need ALL pairwise distances
•Graph is small to medium (V ≤ 500-1000)
•Implementation simplicity matters
•Transitive closure is also needed

Avoid Floyd-Warshall When

•Graph is very sparse (E << V²)
•Only single-source queries needed
•V is large (> 1000) and graph is sparse
•Memory is severely constrained (O(V²) is too much)
•Real-time performance on dynamic graphs
•You only need a subset of pairwise distances

The Distance Matrix as a Complete Graph Model

Properties of the Distance Matrix

•Diagonal is Zero — d[i][i] = 0 for all i. The shortest path from any vertex to itself has length zero (assuming no negative cycles).
•Triangle Inequality — d[i][j] ≤ d[i][k] + d[k][j] for all i, j, k. The direct shortest path is never longer than going through an intermediate.
•Asymmetry Possible — In directed graphs, d[i][j] ≠ d[j][i] is possible. One-way streets, for example.
•Infinity for Unreachable — d[i][j] = ∞ means no path exists from i to j.
•Eccentricity Computation — max_j(d[i][j]) gives the eccentricity of vertex i—how 'far' it is from the farthest reachable vertex.
•Diameter = max_i,j(d[i][j]) — The maximum finite entry is the graph's diameter.

Using the Distance Matrix

Once computed, the distance matrix enables O(1) answers to many questions:

FINDING GRAPH METRICS FROM DISTANCE MATRIX D:

1. Shortest path i to j:     D[i][j]                    // Direct lookup
2. Eccentricity of i:        max{ D[i][j] : j ∈ V }     // Max of row i
3. Radius of graph:          min{ eccentricity(i) }     // Min eccentricity
4. Diameter of graph:        max{ eccentricity(i) }     // Max eccentricity
5. Center of graph:          { i : eccentricity(i) = radius }
6. Reachability i → j:       D[i][j] < ∞               // Finite means reachable

The distance matrix transforms graph analysis from repeated algorithm runs into simple matrix operations.

The Transitive Closure Connection

Summary: The All-Pairs Problem

We've established a thorough understanding of what the All-Pairs Shortest Path problem is and why it matters. Let's consolidate the key insights:

Key Takeaways

•APSP computes distances between ALL pairs — Unlike SSSP which focuses on one source, APSP produces a complete V × V distance matrix.
•The output enables O(1) distance queries — Any shortest path distance can be looked up instantly once the matrix is computed.
•Floyd-Warshall uses intermediate vertex expansion — It builds solutions by gradually allowing more vertices as potential intermediate stops.
•The recurrence is elegant — d(k)[i][j] = min(d(k-1)[i][j], d(k-1)[i][k] + d(k-1)[k][j]).
•Optimal for dense graphs and negative edges — With O(V³) time and simple implementation, it excels where alternatives struggle.
•The distance matrix is a powerful tool — It enables computing graph diameter, radius, eccentricity, and more.

What's Next:

Page Complete

1 / 4