Data Structures & AlgorithmsNetwork Flow

Network Flow — Conceptual Introduction

LevelAdvanced

Duration90 mins

TopicNetwork Flow

3 / 5

Ford-Fulkerson Method — The Augmenting Path Framework

The Breakthrough That Changed Optimization

In 1956, Lester Ford Jr. and Delbert Fulkerson published a paper that would become one of the most influential works in combinatorial optimization. Their Ford-Fulkerson method provided not just an algorithm, but a way of thinking about flow problems that unified theory and practice.

The genius of Ford-Fulkerson lies in its elegant simplicity: repeatedly find a path with available capacity from source to sink, and push flow along it. The magic is in the details—specifically, in the concept of residual graphs that allow us to correct suboptimal choices made earlier.

What You'll Master

By the end of this page, you'll understand the complete Ford-Fulkerson method: how it uses residual graphs and augmenting paths, why it's correct, when it terminates, its time complexity characteristics, and practical implementation considerations. You'll also learn about the Edmonds-Karp optimization that guarantees polynomial runtime.

The Core Algorithm — Simplicity Through Iteration

The Ford-Fulkerson method is beautifully simple at its core:

Ford-Fulkerson Method:
1. Initialize flow f(u,v) = 0 for all edges
2. While there exists an augmenting path p from s to t in residual graph G_f:
   a. Find the bottleneck capacity: c_f(p) = min{c_f(u,v) : (u,v) ∈ p}
   b. Augment flow: For each edge (u,v) in p:
      - If (u,v) is a forward edge: f(u,v) += c_f(p)
      - If (u,v) is a backward edge: f(v,u) -= c_f(p)
3. Return the maximum flow f

Key Insight: The method doesn't specify how to find augmenting paths—it's a method, not an algorithm. Different path-finding strategies yield different algorithms with different performance characteristics.

Ford-Fulkerson Terminology
Term	Definition	Intuition
Augmenting path	Path from s to t in residual graph G_f	A route where we can push more flow
Bottleneck	Minimum residual capacity along the path	The limiting constraint for this path
Forward edge	Original edge with remaining capacity	We add flow in the intended direction
Backward edge	Represents flow that can be canceled	We 'undo' previous flow to reroute it
Residual capacity	c_f(u,v) = c(u,v) - f(u,v) or f(v,u)	What we can still push or cancel

Method vs. Algorithm

Ford-Fulkerson is a 'method' because it leaves the path-finding strategy unspecified. Using DFS to find paths is one option (original Ford-Fulkerson). Using BFS guarantees polynomial time (Edmonds-Karp). Using shortest path by weight gives another variant. The framework is constant; the path strategy varies.

Step-by-Step Walkthrough — Watching Flow Grow

Let's trace through Ford-Fulkerson on a concrete example. Consider this network:

         [A]
        ↗    ↘
      10      10
     ↗          ↘
  [S]     10     [T]
     ↘    ↗↘    ↗
      10      10
        ↘    ↗
         [B]

With edges:

S→A: capacity 10
S→B: capacity 10
A→B: capacity 10
A→T: capacity 10
B→T: capacity 10

Initial State: All flows are 0.

Iteration 1:

Find augmenting path: S → A → T (in residual graph, all edges have full capacity)
Bottleneck: min(10, 10) = 10
Augment: f(S,A) = 10, f(A,T) = 10
Total flow: 10

After Iteration 1:

Residual graph:
         [A]
        ↗    ↘
       0      0    (forward edges saturated)
      ↙          ↙ (backward edges: 10 each)
  [S]     10     [T]
     ↘    ↗↘    ↗
      10      10   (still at full capacity)
        ↘    ↗
         [B]

Iteration 2:

Find augmenting path: S → B → T
Bottleneck: min(10, 10) = 10
Augment: f(S,B) = 10, f(B,T) = 10
Total flow: 20

After Iteration 2:

No more augmenting paths exist from S to T. All edges leaving S are saturated.

Result: Maximum flow = 20

The Role of Backward Edges

In this example, we didn't need backward edges. But consider if we had found S→A→B→T first (using the cross-edge A→B). Then we'd have used only part of A→T's capacity. The backward edge A←B in the residual graph would allow a subsequent path to 'reroute' that flow directly to T.

The Backward Edge in Action — Why It's Essential

The backward edge concept can seem counterintuitive. Let's see a case where it's absolutely essential.

The Problem Network:

       [A]
      ↗    ↘
     3       2
    ↗          ↘
 [S]     1      [T]
    ↘    ↓↑    ↗
     3       2
      ↘    ↗
       [B]

Edges: S→A (3), S→B (3), A→B (1), A→T (2), B→T (2)

Bad Path Choice (Iteration 1): S → A → B → T

Bottleneck: min(3, 1, 2) = 1
After augmentation: f(S,A)=1, f(A,B)=1, f(B,T)=1
Total flow: 1

Current Flow State:

S→A: 1/3 used
A→B: 1/1 used (saturated!)
B→T: 1/2 used
All other edges: 0

Without backward edges, we could only find:

S → A → T (bottleneck = min(2, 2) = 2) → total would be 1 + 2 = 3
S → B → T (bottleneck = min(3, 1) = 1) → total would be 3 + 1 = 4

But the optimal is 5! We need backward edges.

Iteration 2 (with backward edges): S → B → A → T

Wait—there's no forward edge B→A! But in the residual graph, there's a backward edge B→A with capacity 1 (because f(A,B) = 1).

Path: S → B → [A via backward edge] → T
Bottleneck: min(3, 1, 2) = 1
Augment:
- f(S,B) += 1 → f(S,B) = 1
- f(A,B) -= 1 → f(A,B) = 0 (flow canceled!)
- f(A,T) += 1 → f(A,T) = 1
Total flow: 2

Flow Rerouting in Action

The backward edge let us 'cancel' the A→B flow and reroute it directly A→T. Simultaneously, we sent new flow S→B. The net effect: flow goes S→A→T AND S→B→T, achieving higher total flow than if we were stuck with our initial A→B commitment.

Completing the Example:

Iteration 3: S → A → T (residual capacity 2, 1) → bottleneck 1 → total flow 3

Iteration 4: S → B → T (residual capacity 2, 1) → bottleneck 1 → total flow 4

Iteration 5: S → A → T (residual capacity 1, 0—wait, A→T is saturated)

Actually, let's recalculate carefully after iteration 2:

f(S,A) = 1, f(S,B) = 1
f(A,B) = 0, f(A,T) = 1
f(B,T) = 1

Residual capacities:

S→A: 2, S→B: 2
A→B: 1, A→T: 1
B→T: 1

Plus backward edges for non-zero flows.

Iterations 3-4: We can push 2 more through S→A→T and 1 more through S→B→T.

Final Maximum Flow: 5 (verified: S can output 3+3=6 but T can only receive 2+2+1 routed optimally = 4... let me recalculate)

Actually the max flow is min(total S outflow capacity, total T inflow capacity) = min(6, 4) = 4. The key point stands: backward edges enable flow values that forward-only approaches cannot achieve.

Correctness — Why Does Ford-Fulkerson Maximize Flow?

The correctness of Ford-Fulkerson rests on a beautiful theorem connecting maximum flows to minimum cuts. We'll explore this deeply on the next page, but here's the essence.

Key Theorem (Max-Flow Min-Cut):

The maximum value of a flow from s to t equals the minimum capacity of any s-t cut.

What's an s-t Cut?

A cut (S, T) partitions vertices into two sets:

S contains the source s
T contains the sink t
S ∪ T = V and S ∩ T = ∅

The capacity of the cut is the sum of capacities of edges from S to T: $$c(S, T) = \sum_{u \in S, v \in T} c(u, v)$$

Why Max-Flow = Min-Cut Proves Correctness:

When Ford-Fulkerson terminates (no augmenting path exists):

The residual graph has no s-t path
Define S = {vertices reachable from s in G_f}, T = V - S
This (S, T) is a cut where every edge from S to T is saturated
Therefore, the flow value equals this cut's capacity
Since flow ≤ any cut's capacity, this flow is maximum

The Termination Implies Optimality

The genius: we don't prove we found the best flow directly. We prove that when no augmenting path exists, we've hit the theoretical ceiling (the minimum cut). No flow can exceed the min cut, and we've achieved it.

Proof Sketch of Flow Conservation During Augmentation:

We need to verify that after each augmentation, the resulting f is still a valid flow:

Capacity constraints: We only push flow equal to the bottleneck, which by definition is ≤ the residual capacity of each edge. So we never exceed capacity.
Conservation: For any intermediate vertex v on the augmenting path:
- One edge enters v (bringing +Δ flow)
- One edge leaves v (taking -Δ flow)
- Net change at v: 0
- Conservation maintained
For vertices not on the path: No flow changes, conservation trivially maintained.

Thus, each augmentation produces a valid flow with higher value by exactly the bottleneck amount.

Termination and Time Complexity — When and How Fast?

The Termination Question:

Does Ford-Fulkerson always terminate? The answer depends on the capacities:

Integer Capacities: Yes, terminates. Each augmentation increases flow by at least 1. Maximum flow is bounded by sum of capacities out of s, which is finite. Maximum iterations = O(max_flow).

Rational Capacities: Yes, terminates. Can reduce to integers by multiplying by LCD.

Irrational Capacities: May NOT terminate! Ford and Fulkerson constructed pathological examples where the method oscillates forever with flow converging but never reaching the maximum.

Fortunately, the non-termination case is artificial. All real-world capacities are rational.

Ford-Fulkerson Time Complexity Analysis
Factor	Analysis	Implication
Path finding	O(E) with DFS/BFS	Each iteration is efficient
Number of iterations	O(max_flow) worst case	Can be huge if max flow is large
Total complexity	O(E × max_flow)	Pseudo-polynomial (depends on output size)
Practical issue	max_flow could be exponential in network size	Bad for large capacity values

The Pseudo-Polynomial Problem

O(E × max_flow) is pseudo-polynomial—it depends on the numeric value of capacities, not just the network size. A network with 10 vertices and edges of capacity 1,000,000,000 could require a billion iterations! This motivated the search for truly polynomial algorithms.

A Worst-Case Example:

      [A]
     ↗    ↘
  1000    1000
   ↗    1    ↘
[S]  ↗   ↘   [T]
   ↘    1    ↗
  1000    1000
     ↘    ↗
      [B]

With the A→B edge having capacity 1, and if we unluckily alternate paths:

S→A→B→T (augment 1)
S→B→A→T (augment 1, using backward edge)
Repeat...

This could take 2000 iterations instead of the optimal 2 (S→A→T and S→B→T).

The fix: choose augmenting paths wisely, which leads us to Edmonds-Karp.

Edmonds-Karp — The BFS Optimization

The Edmonds-Karp algorithm is Ford-Fulkerson with one crucial specification: always use BFS to find augmenting paths. This simple choice transforms pseudo-polynomial into truly polynomial.

Key Insight:

BFS finds the shortest augmenting path (in terms of number of edges). This has a remarkable property:

Theorem (Edmonds-Karp Bound): The total number of augmentations is at most O(V × E).

Why? Two key observations:

After each augmentation, at least one edge becomes "critical" (saturated in forward direction or emptied in backward direction).
An edge can become critical at most O(V) times. Each time it becomes critical on a path of length d, the next time it's critical (if ever), the path length is ≥ d + 2.
Since path lengths are bounded by V and can only increase (or edge is permanently removed), each edge is critical at most O(V) times.
Total augmentations ≤ O(VE).

Polynomial Time Achieved

Edmonds-Karp runs in O(VE²) time. Each augmentation takes O(E) for BFS, and there are O(VE) augmentations. This is polynomial in the network size, independent of capacity values!

edmonds_karp.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
from collections import deque
 
def edmonds_karp(graph, source, sink):
    """
    Edmonds-Karp algorithm for maximum flow.
    Uses BFS to find shortest augmenting paths.
    
    Args:
        graph: Dict[node, Dict[neighbor, capacity]]
        source: Source vertex
        sink: Sink vertex
    
    Returns:
        Tuple of (max_flow_value, flow_dict)
    """
    # Initialize flow to zero
    flow = {u: {v: 0 for v in graph.get(u, {})} for u in graph}
    
    def bfs_find_path():
        """Find augmenting path using BFS, return path and bottleneck."""
        parent = {source: None}
        visited = {source}
        queue = deque([source])
        
        while queue:
            u = queue.popleft()
            
            for v in get_residual_neighbors(u):
                if v not in visited:
                    visited.add(v)
                    parent[v] = u
                    if v == sink:
                        # Reconstruct path and find bottleneck
                        path = []
                        curr = sink
                        while curr != source:
                            prev = parent[curr]
                            path.append((prev, curr))
                            curr = prev
                        path.reverse()
                        
                        # Find bottleneck
                        bottleneck = float('inf')
                        for u, v in path:
                            bottleneck = min(bottleneck, residual_capacity(u, v))
                        
                        return path, bottleneck
                    queue.append(v)
        
        return None, 0  # No augmenting path found
    
    def get_residual_neighbors(u):
        """Get neighbors with positive residual capacity."""
        neighbors = []
        # Forward edges
        for v, cap in graph.get(u, {}).items():
            if cap - flow.get(u, {}).get(v, 0) > 0:
                neighbors.append(v)
        # Backward edges
        for v in graph:
            if u in graph.get(v, {}) and flow.get(v, {}).get(u, 0) > 0:
                neighbors.append(v)
        return neighbors
    
    def residual_capacity(u, v):
        """Get residual capacity from u to v."""
        forward = graph.get(u, {}).get(v, 0) - flow.get(u, {}).get(v, 0)
        backward = flow.get(v, {}).get(u, 0)
        return max(forward, backward)
    
    max_flow = 0
    
    while True:
        path, bottleneck = bfs_find_path()
        if path is None:
            break
        
        # Augment flow along path
        for u, v in path:
            if v in graph.get(u, {}):  # Forward edge
                if u not in flow:
                    flow[u] = {}
                flow[u][v] = flow.get(u, {}).get(v, 0) + bottleneck
            else:  # Backward edge
                flow[v][u] = flow.get(v, {}).get(u, 0) - bottleneck
        
        max_flow += bottleneck
    
    return max_flow, flow
 
 
# Example usage
graph = {
    'S': {'A': 10, 'B': 10},
    'A': {'B': 2, 'T': 10},
    'B': {'T': 10},
    'T': {}
}
 
max_flow, flow = edmonds_karp(graph, 'S', 'T')
print(f"Maximum flow: {max_flow}")  # Output: 20

Implementation Considerations — Making It Practical

Moving from pseudocode to working implementation requires attention to several practical details.

Key Implementation Decisions

•Graph representation: Adjacency list is standard. For residual graphs, either maintain explicit backward edges or compute them on-the-fly. On-the-fly saves memory but is slightly slower.
•Flow storage: Store flow values in a separate structure from capacities. Common choices: 2D array (for small, dense graphs) or nested dictionary/hashmap (for sparse graphs).
•Handling anti-parallel edges: If both (u,v) and (v,u) exist in original graph, be careful not to confuse forward/backward edges. Label edges or use a three-field structure (from, to, capacity).
•Integer overflow: For very large capacities or many edges, flow sums can overflow standard integers. Use 64-bit integers or arbitrary precision if needed.
•Early termination: If you only need to know IF a flow of value k exists (vs. finding max), you can stop once flow ≥ k.

The Residual Graph Trick

Instead of explicitly maintaining a separate residual graph, store capacity and flow together. The residual capacity of (u,v) is cap[u][v] - flow[u][v], and the backward residual is flow[u][v]. This halves memory usage and simplifies updates.

Performance Comparison:

Implementation Choice	Time Impact	Space Impact	Code Complexity
Adjacency matrix	O(V²) BFS	O(V²)	Simple
Adjacency list	O(V+E) BFS	O(V+E)	Moderate
Explicit residual	Faster queries	2× edge storage	More complex
Implicit residual	Slower queries	1× edge storage	Simpler

For competitive programming, adjacency matrix with implicit residual is often preferred for simplicity. For production code with large graphs, adjacency list with optimized data structures wins.

optimized_max_flow.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
class MaxFlow:
    """
    Optimized max flow implementation using adjacency list
    with implicit residual graph computation.
    """
    
    def __init__(self, n):
        """Initialize with n vertices (0-indexed)."""
        self.n = n
        self.adj = [[] for _ in range(n)]  # adj[u] = list of (v, cap, rev_idx)
    
    def add_edge(self, u, v, cap):
        """
        Add directed edge u -> v with capacity cap.
        Automatically adds reverse edge with capacity 0.
        """
        # Forward edge: (to, capacity, index of reverse edge)
        # Reverse edge: (to, capacity, index of forward edge)
        self.adj[u].append([v, cap, len(self.adj[v])])
        self.adj[v].append([u, 0, len(self.adj[u]) - 1])
    
    def bfs(self, source, sink):
        """Find shortest augmenting path, return parent array or None."""
        parent = [-1] * self.n  # (prev_node, edge_index)
        parent[source] = (source, -1)
        
        queue = deque([source])
        while queue:
            u = queue.popleft()
            for i, (v, cap, _) in enumerate(self.adj[u]):
                if cap > 0 and parent[v] == -1:
                    parent[v] = (u, i)
                    if v == sink:
                        return parent
                    queue.append(v)
        return None
    
    def max_flow(self, source, sink):
        """Compute maximum flow from source to sink."""
        total_flow = 0
        
        while True:
            parent = self.bfs(source, sink)
            if parent is None:
                break
            
            # Find bottleneck
            bottleneck = float('inf')
            v = sink
            while v != source:
                u, i = parent[v]
                bottleneck = min(bottleneck, self.adj[u][i][1])
                v = u
            
            # Augment flow
            v = sink
            while v != source:
                u, i = parent[v]
                self.adj[u][i][1] -= bottleneck  # Decrease forward capacity
                rev_i = self.adj[u][i][2]
                self.adj[v][rev_i][1] += bottleneck  # Increase backward capacity
                v = u
            
            total_flow += bottleneck
        
        return total_flow
 
 
# Example: Maximum matching as max flow
# 3 workers, 3 jobs, with compatibility edges
mf = MaxFlow(8)  # Super-source=0, workers=1,2,3, jobs=4,5,6, super-sink=7
 
# Super-source to workers (capacity 1 each)
mf.add_edge(0, 1, 1)
mf.add_edge(0, 2, 1)
mf.add_edge(0, 3, 1)
 
# Worker-job compatibility
mf.add_edge(1, 4, 1)  # Worker 1 can do job 4
mf.add_edge(1, 5, 1)  # Worker 1 can do job 5
mf.add_edge(2, 5, 1)  # Worker 2 can do job 5
mf.add_edge(3, 5, 1)  # Worker 3 can do job 5
mf.add_edge(3, 6, 1)  # Worker 3 can do job 6
 
# Jobs to super-sink (capacity 1 each)
mf.add_edge(4, 7, 1)
mf.add_edge(5, 7, 1)
mf.add_edge(6, 7, 1)
 
print(f"Maximum matching: {mf.max_flow(0, 7)}")  # Output: 3

Beyond Ford-Fulkerson — Other Algorithms

Ford-Fulkerson and Edmonds-Karp are foundational, but more sophisticated algorithms exist for better performance in specific cases.

Comparison of Maximum Flow Algorithms
Algorithm	Time Complexity	Key Technique	Best Use Case
Ford-Fulkerson (DFS)	O(E × max_flow)	Any augmenting path	Small capacities
Edmonds-Karp (BFS)	O(VE²)	Shortest path augmentation	General purpose
Dinic's Algorithm	O(V²E)	Blocking flows	Dense graphs
Push-Relabel	O(V²E) or O(V³)	Local operations	Very dense graphs
Dinic on unit capacity	O(E√V)	Special structure	Bipartite matching

Dinic's Algorithm:

Dinic improves on Edmonds-Karp by finding ALL shortest paths of a given length before moving to longer paths. It constructs a "level graph" using BFS, then finds blocking flows using DFS. This achieves O(V²E) in general and O(E√V) on unit-capacity graphs.

Push-Relabel:

Instead of finding paths globally, push-relabel works locally: vertices have heights, and flow is "pushed" downhill. When no downhill neighbor exists, the vertex is "relabeled" (raised). This paradigm avoids the overhead of repeated BFS/DFS and excels on dense graphs.

Practical Considerations:

For most applications, Edmonds-Karp is sufficient and easier to implement correctly.
Dinic is worth learning for competitive programming where 20% performance difference matters.
Push-relabel is common in production systems due to its cache-friendly local operations.

Algorithm Selection Guide

Start with Edmonds-Karp. If performance is insufficient, try Dinic. For bipartite matching or unit capacities, Dinic's O(E√V) is hard to beat. Push-relabel is most practical when you have a high-quality library implementation.

Summary and Preview

We've explored the Ford-Fulkerson method comprehensively. Let's consolidate the key insights:

Key Takeaways

•Ford-Fulkerson repeatedly finds augmenting paths and pushes flow until no path exists.
•Residual graphs with backward edges enable flow rerouting, critical for optimality.
•Correctness follows from the Max-Flow Min-Cut theorem: when no path exists, we've hit a cut ceiling.
•Vanilla Ford-Fulkerson is pseudo-polynomial: O(E × max_flow), which can be very slow.
•Edmonds-Karp (BFS selection) guarantees O(VE²) polynomial time, independent of capacities.
•Advanced algorithms like Dinic and Push-Relabel offer better theoretical bounds for specific cases.
•Implementation matters: efficient data structures and careful residual graph handling are essential.

What's Next:

We've mentioned the Max-Flow Min-Cut Theorem several times—it's time to explore it deeply. This beautiful duality between flows and cuts is one of the most important results in combinatorial optimization, with applications far beyond network flow. Understanding this theorem provides intuition for why flow algorithms work and enables recognizing new problems as flow problems.

Algorithmic Foundation Complete

You now understand the mechanics of finding maximum flow: the augmenting path framework, the role of residual graphs, correctness arguments, complexity analysis, and practical implementation. The Ford-Fulkerson method is your algorithmic workhorse for flow problems.

3 / 5

Loading learning content...

Data Structures & AlgorithmsNetwork Flow

Network Flow — Conceptual Introduction

LevelAdvanced

Duration90 mins

TopicNetwork Flow

3 / 5

Ford-Fulkerson Method — The Augmenting Path Framework

The Breakthrough That Changed Optimization

What You'll Master

The Core Algorithm — Simplicity Through Iteration

The Ford-Fulkerson method is beautifully simple at its core:

Ford-Fulkerson Method:
1. Initialize flow f(u,v) = 0 for all edges
2. While there exists an augmenting path p from s to t in residual graph G_f:
   a. Find the bottleneck capacity: c_f(p) = min{c_f(u,v) : (u,v) ∈ p}
   b. Augment flow: For each edge (u,v) in p:
      - If (u,v) is a forward edge: f(u,v) += c_f(p)
      - If (u,v) is a backward edge: f(v,u) -= c_f(p)
3. Return the maximum flow f

Ford-Fulkerson Terminology
Term	Definition	Intuition
Augmenting path	Path from s to t in residual graph G_f	A route where we can push more flow
Bottleneck	Minimum residual capacity along the path	The limiting constraint for this path
Forward edge	Original edge with remaining capacity	We add flow in the intended direction
Backward edge	Represents flow that can be canceled	We 'undo' previous flow to reroute it
Residual capacity	c_f(u,v) = c(u,v) - f(u,v) or f(v,u)	What we can still push or cancel

Method vs. Algorithm

Step-by-Step Walkthrough — Watching Flow Grow

Let's trace through Ford-Fulkerson on a concrete example. Consider this network:

         [A]
        ↗    ↘
      10      10
     ↗          ↘
  [S]     10     [T]
     ↘    ↗↘    ↗
      10      10
        ↘    ↗
         [B]

With edges:

S→A: capacity 10
S→B: capacity 10
A→B: capacity 10
A→T: capacity 10
B→T: capacity 10

Initial State: All flows are 0.

Iteration 1:

Find augmenting path: S → A → T (in residual graph, all edges have full capacity)
Bottleneck: min(10, 10) = 10
Augment: f(S,A) = 10, f(A,T) = 10
Total flow: 10

After Iteration 1:

Residual graph:
         [A]
        ↗    ↘
       0      0    (forward edges saturated)
      ↙          ↙ (backward edges: 10 each)
  [S]     10     [T]
     ↘    ↗↘    ↗
      10      10   (still at full capacity)
        ↘    ↗
         [B]

Iteration 2:

Find augmenting path: S → B → T
Bottleneck: min(10, 10) = 10
Augment: f(S,B) = 10, f(B,T) = 10
Total flow: 20

After Iteration 2:

No more augmenting paths exist from S to T. All edges leaving S are saturated.

Result: Maximum flow = 20

The Role of Backward Edges

The Backward Edge in Action — Why It's Essential

The backward edge concept can seem counterintuitive. Let's see a case where it's absolutely essential.

The Problem Network:

       [A]
      ↗    ↘
     3       2
    ↗          ↘
 [S]     1      [T]
    ↘    ↓↑    ↗
     3       2
      ↘    ↗
       [B]

Edges: S→A (3), S→B (3), A→B (1), A→T (2), B→T (2)

Bad Path Choice (Iteration 1): S → A → B → T

Bottleneck: min(3, 1, 2) = 1
After augmentation: f(S,A)=1, f(A,B)=1, f(B,T)=1
Total flow: 1

Current Flow State:

S→A: 1/3 used
A→B: 1/1 used (saturated!)
B→T: 1/2 used
All other edges: 0

Without backward edges, we could only find:

S → A → T (bottleneck = min(2, 2) = 2) → total would be 1 + 2 = 3
S → B → T (bottleneck = min(3, 1) = 1) → total would be 3 + 1 = 4

But the optimal is 5! We need backward edges.

Iteration 2 (with backward edges): S → B → A → T

Wait—there's no forward edge B→A! But in the residual graph, there's a backward edge B→A with capacity 1 (because f(A,B) = 1).

Path: S → B → [A via backward edge] → T
Bottleneck: min(3, 1, 2) = 1
Augment:
- f(S,B) += 1 → f(S,B) = 1
- f(A,B) -= 1 → f(A,B) = 0 (flow canceled!)
- f(A,T) += 1 → f(A,T) = 1
Total flow: 2

Flow Rerouting in Action

Completing the Example:

Iteration 3: S → A → T (residual capacity 2, 1) → bottleneck 1 → total flow 3

Iteration 4: S → B → T (residual capacity 2, 1) → bottleneck 1 → total flow 4

Iteration 5: S → A → T (residual capacity 1, 0—wait, A→T is saturated)

Actually, let's recalculate carefully after iteration 2:

f(S,A) = 1, f(S,B) = 1
f(A,B) = 0, f(A,T) = 1
f(B,T) = 1

Residual capacities:

S→A: 2, S→B: 2
A→B: 1, A→T: 1
B→T: 1

Plus backward edges for non-zero flows.

Iterations 3-4: We can push 2 more through S→A→T and 1 more through S→B→T.

Final Maximum Flow: 5 (verified: S can output 3+3=6 but T can only receive 2+2+1 routed optimally = 4... let me recalculate)

Correctness — Why Does Ford-Fulkerson Maximize Flow?

The correctness of Ford-Fulkerson rests on a beautiful theorem connecting maximum flows to minimum cuts. We'll explore this deeply on the next page, but here's the essence.

Key Theorem (Max-Flow Min-Cut):

The maximum value of a flow from s to t equals the minimum capacity of any s-t cut.

What's an s-t Cut?

A cut (S, T) partitions vertices into two sets:

S contains the source s
T contains the sink t
S ∪ T = V and S ∩ T = ∅

The capacity of the cut is the sum of capacities of edges from S to T: $$c(S, T) = \sum_{u \in S, v \in T} c(u, v)$$

Why Max-Flow = Min-Cut Proves Correctness:

When Ford-Fulkerson terminates (no augmenting path exists):

The residual graph has no s-t path
Define S = {vertices reachable from s in G_f}, T = V - S
This (S, T) is a cut where every edge from S to T is saturated
Therefore, the flow value equals this cut's capacity
Since flow ≤ any cut's capacity, this flow is maximum

The Termination Implies Optimality

Proof Sketch of Flow Conservation During Augmentation:

We need to verify that after each augmentation, the resulting f is still a valid flow:

Capacity constraints: We only push flow equal to the bottleneck, which by definition is ≤ the residual capacity of each edge. So we never exceed capacity.
Conservation: For any intermediate vertex v on the augmenting path:
- One edge enters v (bringing +Δ flow)
- One edge leaves v (taking -Δ flow)
- Net change at v: 0
- Conservation maintained
For vertices not on the path: No flow changes, conservation trivially maintained.

Thus, each augmentation produces a valid flow with higher value by exactly the bottleneck amount.

Termination and Time Complexity — When and How Fast?

The Termination Question:

Does Ford-Fulkerson always terminate? The answer depends on the capacities:

Integer Capacities: Yes, terminates. Each augmentation increases flow by at least 1. Maximum flow is bounded by sum of capacities out of s, which is finite. Maximum iterations = O(max_flow).

Rational Capacities: Yes, terminates. Can reduce to integers by multiplying by LCD.

Irrational Capacities: May NOT terminate! Ford and Fulkerson constructed pathological examples where the method oscillates forever with flow converging but never reaching the maximum.

Fortunately, the non-termination case is artificial. All real-world capacities are rational.

Ford-Fulkerson Time Complexity Analysis
Factor	Analysis	Implication
Path finding	O(E) with DFS/BFS	Each iteration is efficient
Number of iterations	O(max_flow) worst case	Can be huge if max flow is large
Total complexity	O(E × max_flow)	Pseudo-polynomial (depends on output size)
Practical issue	max_flow could be exponential in network size	Bad for large capacity values

The Pseudo-Polynomial Problem

A Worst-Case Example:

      [A]
     ↗    ↘
  1000    1000
   ↗    1    ↘
[S]  ↗   ↘   [T]
   ↘    1    ↗
  1000    1000
     ↘    ↗
      [B]

With the A→B edge having capacity 1, and if we unluckily alternate paths:

S→A→B→T (augment 1)
S→B→A→T (augment 1, using backward edge)
Repeat...

This could take 2000 iterations instead of the optimal 2 (S→A→T and S→B→T).

The fix: choose augmenting paths wisely, which leads us to Edmonds-Karp.

Edmonds-Karp — The BFS Optimization

The Edmonds-Karp algorithm is Ford-Fulkerson with one crucial specification: always use BFS to find augmenting paths. This simple choice transforms pseudo-polynomial into truly polynomial.

Key Insight:

BFS finds the shortest augmenting path (in terms of number of edges). This has a remarkable property:

Theorem (Edmonds-Karp Bound): The total number of augmentations is at most O(V × E).

Why? Two key observations:

After each augmentation, at least one edge becomes "critical" (saturated in forward direction or emptied in backward direction).
An edge can become critical at most O(V) times. Each time it becomes critical on a path of length d, the next time it's critical (if ever), the path length is ≥ d + 2.
Since path lengths are bounded by V and can only increase (or edge is permanently removed), each edge is critical at most O(V) times.
Total augmentations ≤ O(VE).

Polynomial Time Achieved

Edmonds-Karp runs in O(VE²) time. Each augmentation takes O(E) for BFS, and there are O(VE) augmentations. This is polynomial in the network size, independent of capacity values!

edmonds_karp.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
from collections import deque
 
def edmonds_karp(graph, source, sink):
    """
    Edmonds-Karp algorithm for maximum flow.
    Uses BFS to find shortest augmenting paths.
    
    Args:
        graph: Dict[node, Dict[neighbor, capacity]]
        source: Source vertex
        sink: Sink vertex
    
    Returns:
        Tuple of (max_flow_value, flow_dict)
    """
    # Initialize flow to zero
    flow = {u: {v: 0 for v in graph.get(u, {})} for u in graph}
    
    def bfs_find_path():
        """Find augmenting path using BFS, return path and bottleneck."""
        parent = {source: None}
        visited = {source}
        queue = deque([source])
        
        while queue:
            u = queue.popleft()
            
            for v in get_residual_neighbors(u):
                if v not in visited:
                    visited.add(v)
                    parent[v] = u
                    if v == sink:
                        # Reconstruct path and find bottleneck
                        path = []
                        curr = sink
                        while curr != source:
                            prev = parent[curr]
                            path.append((prev, curr))
                            curr = prev
                        path.reverse()
                        
                        # Find bottleneck
                        bottleneck = float('inf')
                        for u, v in path:
                            bottleneck = min(bottleneck, residual_capacity(u, v))
                        
                        return path, bottleneck
                    queue.append(v)
        
        return None, 0  # No augmenting path found
    
    def get_residual_neighbors(u):
        """Get neighbors with positive residual capacity."""
        neighbors = []
        # Forward edges
        for v, cap in graph.get(u, {}).items():
            if cap - flow.get(u, {}).get(v, 0) > 0:
                neighbors.append(v)
        # Backward edges
        for v in graph:
            if u in graph.get(v, {}) and flow.get(v, {}).get(u, 0) > 0:
                neighbors.append(v)
        return neighbors
    
    def residual_capacity(u, v):
        """Get residual capacity from u to v."""
        forward = graph.get(u, {}).get(v, 0) - flow.get(u, {}).get(v, 0)
        backward = flow.get(v, {}).get(u, 0)
        return max(forward, backward)
    
    max_flow = 0
    
    while True:
        path, bottleneck = bfs_find_path()
        if path is None:
            break
        
        # Augment flow along path
        for u, v in path:
            if v in graph.get(u, {}):  # Forward edge
                if u not in flow:
                    flow[u] = {}
                flow[u][v] = flow.get(u, {}).get(v, 0) + bottleneck
            else:  # Backward edge
                flow[v][u] = flow.get(v, {}).get(u, 0) - bottleneck
        
        max_flow += bottleneck
    
    return max_flow, flow
 
 
# Example usage
graph = {
    'S': {'A': 10, 'B': 10},
    'A': {'B': 2, 'T': 10},
    'B': {'T': 10},
    'T': {}
}
 
max_flow, flow = edmonds_karp(graph, 'S', 'T')
print(f"Maximum flow: {max_flow}")  # Output: 20

Implementation Considerations — Making It Practical

Moving from pseudocode to working implementation requires attention to several practical details.

Key Implementation Decisions

•Graph representation: Adjacency list is standard. For residual graphs, either maintain explicit backward edges or compute them on-the-fly. On-the-fly saves memory but is slightly slower.
•Flow storage: Store flow values in a separate structure from capacities. Common choices: 2D array (for small, dense graphs) or nested dictionary/hashmap (for sparse graphs).
•Handling anti-parallel edges: If both (u,v) and (v,u) exist in original graph, be careful not to confuse forward/backward edges. Label edges or use a three-field structure (from, to, capacity).
•Integer overflow: For very large capacities or many edges, flow sums can overflow standard integers. Use 64-bit integers or arbitrary precision if needed.
•Early termination: If you only need to know IF a flow of value k exists (vs. finding max), you can stop once flow ≥ k.

The Residual Graph Trick

Performance Comparison:

Implementation Choice	Time Impact	Space Impact	Code Complexity
Adjacency matrix	O(V²) BFS	O(V²)	Simple
Adjacency list	O(V+E) BFS	O(V+E)	Moderate
Explicit residual	Faster queries	2× edge storage	More complex
Implicit residual	Slower queries	1× edge storage	Simpler

For competitive programming, adjacency matrix with implicit residual is often preferred for simplicity. For production code with large graphs, adjacency list with optimized data structures wins.

optimized_max_flow.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
class MaxFlow:
    """
    Optimized max flow implementation using adjacency list
    with implicit residual graph computation.
    """
    
    def __init__(self, n):
        """Initialize with n vertices (0-indexed)."""
        self.n = n
        self.adj = [[] for _ in range(n)]  # adj[u] = list of (v, cap, rev_idx)
    
    def add_edge(self, u, v, cap):
        """
        Add directed edge u -> v with capacity cap.
        Automatically adds reverse edge with capacity 0.
        """
        # Forward edge: (to, capacity, index of reverse edge)
        # Reverse edge: (to, capacity, index of forward edge)
        self.adj[u].append([v, cap, len(self.adj[v])])
        self.adj[v].append([u, 0, len(self.adj[u]) - 1])
    
    def bfs(self, source, sink):
        """Find shortest augmenting path, return parent array or None."""
        parent = [-1] * self.n  # (prev_node, edge_index)
        parent[source] = (source, -1)
        
        queue = deque([source])
        while queue:
            u = queue.popleft()
            for i, (v, cap, _) in enumerate(self.adj[u]):
                if cap > 0 and parent[v] == -1:
                    parent[v] = (u, i)
                    if v == sink:
                        return parent
                    queue.append(v)
        return None
    
    def max_flow(self, source, sink):
        """Compute maximum flow from source to sink."""
        total_flow = 0
        
        while True:
            parent = self.bfs(source, sink)
            if parent is None:
                break
            
            # Find bottleneck
            bottleneck = float('inf')
            v = sink
            while v != source:
                u, i = parent[v]
                bottleneck = min(bottleneck, self.adj[u][i][1])
                v = u
            
            # Augment flow
            v = sink
            while v != source:
                u, i = parent[v]
                self.adj[u][i][1] -= bottleneck  # Decrease forward capacity
                rev_i = self.adj[u][i][2]
                self.adj[v][rev_i][1] += bottleneck  # Increase backward capacity
                v = u
            
            total_flow += bottleneck
        
        return total_flow
 
 
# Example: Maximum matching as max flow
# 3 workers, 3 jobs, with compatibility edges
mf = MaxFlow(8)  # Super-source=0, workers=1,2,3, jobs=4,5,6, super-sink=7
 
# Super-source to workers (capacity 1 each)
mf.add_edge(0, 1, 1)
mf.add_edge(0, 2, 1)
mf.add_edge(0, 3, 1)
 
# Worker-job compatibility
mf.add_edge(1, 4, 1)  # Worker 1 can do job 4
mf.add_edge(1, 5, 1)  # Worker 1 can do job 5
mf.add_edge(2, 5, 1)  # Worker 2 can do job 5
mf.add_edge(3, 5, 1)  # Worker 3 can do job 5
mf.add_edge(3, 6, 1)  # Worker 3 can do job 6
 
# Jobs to super-sink (capacity 1 each)
mf.add_edge(4, 7, 1)
mf.add_edge(5, 7, 1)
mf.add_edge(6, 7, 1)
 
print(f"Maximum matching: {mf.max_flow(0, 7)}")  # Output: 3

Beyond Ford-Fulkerson — Other Algorithms

Ford-Fulkerson and Edmonds-Karp are foundational, but more sophisticated algorithms exist for better performance in specific cases.

Comparison of Maximum Flow Algorithms
Algorithm	Time Complexity	Key Technique	Best Use Case
Ford-Fulkerson (DFS)	O(E × max_flow)	Any augmenting path	Small capacities
Edmonds-Karp (BFS)	O(VE²)	Shortest path augmentation	General purpose
Dinic's Algorithm	O(V²E)	Blocking flows	Dense graphs
Push-Relabel	O(V²E) or O(V³)	Local operations	Very dense graphs
Dinic on unit capacity	O(E√V)	Special structure	Bipartite matching

Dinic's Algorithm:

Push-Relabel:

Practical Considerations:

For most applications, Edmonds-Karp is sufficient and easier to implement correctly.
Dinic is worth learning for competitive programming where 20% performance difference matters.
Push-relabel is common in production systems due to its cache-friendly local operations.

Algorithm Selection Guide

Summary and Preview

We've explored the Ford-Fulkerson method comprehensively. Let's consolidate the key insights:

Key Takeaways

•Ford-Fulkerson repeatedly finds augmenting paths and pushes flow until no path exists.
•Residual graphs with backward edges enable flow rerouting, critical for optimality.
•Correctness follows from the Max-Flow Min-Cut theorem: when no path exists, we've hit a cut ceiling.
•Vanilla Ford-Fulkerson is pseudo-polynomial: O(E × max_flow), which can be very slow.
•Edmonds-Karp (BFS selection) guarantees O(VE²) polynomial time, independent of capacities.
•Advanced algorithms like Dinic and Push-Relabel offer better theoretical bounds for specific cases.
•Implementation matters: efficient data structures and careful residual graph handling are essential.

What's Next:

Algorithmic Foundation Complete

3 / 5