Data Structures & AlgorithmsDynamic Programming

What Is Dynamic Programming?

LevelIntermediate

Duration60 mins

TopicDynamic Programming

1 / 4

DP as an Optimization Technique

The Art of Remembering Solutions

Imagine you're climbing a staircase with 50 steps. At each step, you can choose to climb 1 or 2 steps at a time. How many unique ways can you reach the top?

A naive approach would enumerate every possible combination—but with 50 steps, you'd be computing for centuries. Yet, with Dynamic Programming, you can solve this in microseconds.

This is the power of DP: transforming seemingly intractable problems into elegantly solvable ones. Dynamic Programming isn't just an algorithm—it's a paradigm for thinking about optimization. It's the reason modern computing can solve problems that would otherwise require astronomical computation time.

What You Will Learn

By the end of this page, you will understand what makes Dynamic Programming an optimization technique, when it applies, and why it's fundamentally different from both brute force enumeration and greedy algorithms. You'll see DP as the bridge between naive recursion and optimal computation.

Defining Dynamic Programming

Dynamic Programming is a method for solving complex problems by breaking them down into simpler, overlapping subproblems, solving each subproblem only once, and storing the solutions for future use. The term was coined by mathematician Richard Bellman in the 1950s, and despite its mathematical origins, DP has become one of the most practical and widely-used algorithmic techniques in software engineering.

The formal definition:

Dynamic Programming is an algorithmic paradigm that solves a given complex problem by breaking it into subproblems and storing the results of subproblems to avoid computing the same results again.

But this definition, while accurate, doesn't capture the essence of DP. Let's build a deeper understanding.

Why 'Dynamic' Programming?

Bellman chose the name partly because 'dynamic' sounded impressive and partly because it suggested the multi-stage, sequential nature of the decision process. The 'programming' refers to planning and scheduling (like in 'linear programming'), not computer programming. The name stuck, even though it doesn't directly describe what the technique does.

The three pillars of Dynamic Programming:

Optimal Substructure — An optimal solution to the problem contains optimal solutions to its subproblems. If you can build the best overall solution by combining the best solutions to smaller pieces, the problem has optimal substructure.
Overlapping Subproblems — The same subproblems are encountered multiple times when solving the main problem. Unlike divide-and-conquer where subproblems are independent, DP subproblems recur, creating redundant computation that can be eliminated.
Memoization or Tabulation — The mechanism by which we avoid recomputation. Either cache results as we compute them (memoization, top-down) or build up solutions iteratively from base cases (tabulation, bottom-up).

When all three conditions are present, Dynamic Programming transforms exponential-time algorithms into polynomial-time solutions—often the difference between 'impossible' and 'instantaneous'.

DP as Optimization — The Core Insight

Many problems in computing ask us to find the best solution among many possibilities:

What's the shortest path between two cities?
What's the maximum value we can carry in a knapsack?
What's the minimum number of operations to transform one string into another?
What's the longest increasing subsequence in an array?

These are optimization problems—we're not just finding a solution, we're finding the optimal one. And this is precisely where Dynamic Programming shines.

The optimization principle:

DP leverages a profound insight: if we knew the optimal solutions to all smaller subproblems, we could combine them to find the optimal solution to the larger problem. This principle—Bellman's Principle of Optimality—is the theoretical foundation of DP.

Bellman's Principle of Optimality

An optimal policy has the property that whatever the initial state and initial decision are, the remaining decisions must constitute an optimal policy with regard to the state resulting from the first decision. In simpler terms: if you're on the optimal path, every remaining segment of that path is also optimal for its starting point.

Consider the shortest path problem:

If the shortest path from A to C goes through B, then:

The portion from A to B must be the shortest path from A to B
The portion from B to C must be the shortest path from B to C

If either portion weren't optimal, we could substitute a better sub-path and improve the overall path—contradicting our assumption that the A-to-C path was shortest.

This seemingly simple observation is revolutionary. It means we can:

Solve smaller subproblems first
Combine their optimal solutions
Guarantee that the combined solution is globally optimal

No need to enumerate all possibilities. No need for exhaustive search. The optimal structure of the problem guarantees that building from optimal subparts yields an optimal whole.

Optimization Problems Amenable to DP
Problem Type	Optimization Goal	DP Applicability
Shortest Path	Minimize total distance/cost	✅ Optimal subpaths form optimal paths
Knapsack	Maximize value within weight limit	✅ Optimal packing of smaller capacity leads to optimal of larger
Edit Distance	Minimize operations to transform string	✅ Optimal edits for prefixes compose to optimal for full strings
Longest Increasing Subsequence	Maximize subsequence length	✅ Optimal LIS ending at each position builds optimal overall
Matrix Chain Multiplication	Minimize scalar multiplications	✅ Optimal parenthesization of sub-chains yields optimal overall

What Makes DP Different from Brute Force

Brute force enumeration is the most straightforward approach to optimization: generate every possible solution, evaluate each one, and pick the best. It's conceptually simple but computationally catastrophic for most problems.

Consider computing the nth Fibonacci number:

F(0) = 0, F(1) = 1
F(n) = F(n-1) + F(n-2) for n > 1

A naive recursive implementation directly mirrors this definition. But watch what happens as n grows:

fibonacci_naive.py
1
2
3
4
5
6
7
8
9
10
11
12
def fib_naive(n):
    """
    Naive recursive Fibonacci - O(2^n) time complexity
    This is brute force: we recompute the same values exponentially many times.
    """
    if n <= 1:
        return n
    return fib_naive(n - 1) + fib_naive(n - 2)
 
# For n = 40, this makes over 300 million recursive calls
# For n = 50, it would take HOURS
# For n = 100, it would take longer than the age of the universe

The visualization of redundant computation:

When computing fib(5), the call tree looks like this:

Converting Mermaid diagram...

Notice how fib(3) is computed twice (highlighted in red) and fib(2) is computed three times (highlighted in yellow). For larger n, this redundancy explodes exponentially. fib(40) computes fib(2) over 63 million times.

This is the key insight DP exploits: The subproblems overlap massively. If we simply remember each value after computing it once, we eliminate all redundant work.

Brute Force Approach

•Recomputes the same subproblems repeatedly
•Time complexity: O(2ⁿ) — exponential
•Each level of recursion doubles the work
•n=50 takes hours or days
•n=100 is computationally infeasible
•No memory of previous computations

Dynamic Programming Approach

•Computes each subproblem exactly once
•Time complexity: O(n) — linear
•Each value is computed and stored once
•n=50 takes microseconds
•n=100,000 is trivial
•Stores solutions for instant retrieval

The Exponential Trap

Exponential algorithms don't just 'slow down' as input grows—they become fundamentally impossible. O(2ⁿ) with n=100 requires more operations than atoms in the observable universe. This isn't a hardware limitation; it's a mathematical barrier that no computer, no matter how powerful, can overcome. DP is often the only way to cross this barrier.

What Makes DP Different from Greedy

Both Dynamic Programming and Greedy algorithms are used to solve optimization problems. Both require optimal substructure. So what's the difference?

Greedy algorithms make locally optimal choices at each step, hoping they lead to a globally optimal solution. They never reconsider decisions once made.

Dynamic Programming considers all possible choices at each step, uses the stored solutions to subproblems to evaluate each option, and makes the globally optimal choice by comparing all alternatives.

The fundamental difference:

Greedy: Makes one choice per step based on local information. Fast, but only works when local optimality guarantees global optimality.
DP: Explores all choices per step, using stored subproblem solutions to efficiently evaluate each. Slower than greedy (when greedy works), but guaranteed to find the optimal solution.

The Coin Change Problem — Where Greedy FailsConsider making change for 6 cents using coins of denominations [1, 3, 4].

Input

amount = 6, coins = [1, 3, 4]

Output

Minimum coins needed = 2 (using 3 + 3)

Explanation

Greedy approach (largest coin first):

Choose 4 (remaining: 2)
Choose 1 (remaining: 1)
Choose 1 (remaining: 0)
Result: 3 coins [4, 1, 1]

DP approach (consider all options):

For amount=6, minimum coins = min(coins(5)+1, coins(3)+1, coins(2)+1)
This evaluates all possibilities and finds [3, 3]
Result: 2 coins [3, 3]

The greedy choice of 4 seemed optimal locally but led to a suboptimal global solution. DP avoids this trap by considering all options.

When to use which:

Criterion	Greedy	Dynamic Programming
Greedy Choice Property	Required — local choice must be globally safe	Not required — considers all choices
Subproblem Overlap	Irrelevant — doesn't revisit subproblems	Essential — exploits overlap for efficiency
Time Complexity	Often O(n) or O(n log n)	Often O(n²) or O(n × capacity)
Correctness Guarantee	Must be proven per problem	Always finds optimal if applicable
Examples	Activity Selection, Huffman Coding	Knapsack, Edit Distance, Matrix Chain

The key insight: Every problem solvable by greedy is also solvable by DP (DP could explore all options and arrive at the same answer). But many DP problems are NOT solvable by greedy—the greedy choice may not be globally optimal.

The Optimization Spectrum

Think of optimization approaches on a spectrum: Brute Force (explore everything naively) → Dynamic Programming (explore everything smartly with memoization) → Greedy (explore nothing, trust local choices). DP sits in the sweet spot—efficient enough to be practical, thorough enough to guarantee optimality.

The Mechanics of Optimization via DP

How does DP actually achieve optimization? The mechanism is elegant: we express the optimal solution to a problem in terms of optimal solutions to smaller subproblems, then build up from the smallest (base) cases.

This is formalized through a recurrence relation — a mathematical equation that defines the solution to a problem in terms of solutions to smaller instances of the same problem.

General form of a DP recurrence:

OPT(problem) = optimize_over_all_choices(cost_of_choice + OPT(resulting_subproblem))

Where optimize_over_all_choices is typically min or max depending on whether we're minimizing or maximizing.

dp_recurrence_examples.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
# Example 1: Minimum coins to make amount
# OPT(amount) = min over all coins c of (1 + OPT(amount - c))
# where OPT(0) = 0 (base case)
 
def min_coins(amount, coins):
    """
    DP recurrence for minimum coin change.
    For each amount, we try every coin and pick the option
    that minimizes total coins used.
    """
    dp = [float('inf')] * (amount + 1)
    dp[0] = 0  # Base case: 0 coins needed for amount 0
    
    for amt in range(1, amount + 1):
        for coin in coins:
            if coin <= amt and dp[amt - coin] + 1 < dp[amt]:
                dp[amt] = dp[amt - coin] + 1
    
    return dp[amount] if dp[amount] != float('inf') else -1
 
 
# Example 2: Maximum value in 0/1 Knapsack
# OPT(i, w) = max(
#     OPT(i-1, w),           # Don't take item i
#     value[i] + OPT(i-1, w-weight[i])  # Take item i
# )
# Base case: OPT(0, w) = 0 for all w
 
def knapsack(values, weights, capacity):
    """
    DP recurrence for 0/1 knapsack.
    For each item, we decide: take it or leave it.
    We pick the choice that maximizes total value.
    """
    n = len(values)
    dp = [[0] * (capacity + 1) for _ in range(n + 1)]
    
    for i in range(1, n + 1):
        for w in range(capacity + 1):
            # Option 1: Don't take item i
            dp[i][w] = dp[i-1][w]
            # Option 2: Take item i (if it fits)
            if weights[i-1] <= w:
                take_value = values[i-1] + dp[i-1][w - weights[i-1]]
                dp[i][w] = max(dp[i][w], take_value)
    
    return dp[n][capacity]

The optimization loop:

Every DP solution follows this pattern:

Define the state — What information do we need to describe a subproblem? (e.g., 'amount remaining' for coin change, '(item index, remaining capacity)' for knapsack)
Write the recurrence — How does the optimal solution for a state relate to optimal solutions of smaller states? This is where optimization happens—we take the min or max over all choices.
Identify base cases — What are the smallest subproblems we can solve directly? (e.g., amount=0 needs 0 coins, no items means 0 value)
Compute solutions — Either top-down with memoization (recursion + cache) or bottom-up with tabulation (iteration + table)
Extract the answer — Read the optimal value from the appropriate cell of our DP table

The Recurrence IS the Algorithm

Once you correctly formulate the recurrence relation, the algorithm is essentially complete. The recurrence captures both the structure of the problem and the optimization logic. Implementation is just mechanical translation of this mathematical relationship into code.

Why DP Works — The Theoretical Foundation

Dynamic Programming works because of a beautiful mathematical property: the possibility of decomposing a complex optimization problem into a collection of simpler problems, where:

The simple problems can be solved independently
The solutions to simple problems can be combined to solve complex ones
The same simple problems appear multiple times, allowing reuse

Proof sketch for correctness:

Consider any DP solution. We claim it finds the optimal answer. Here's why:

Base cases are correct — We explicitly handle the smallest subproblems (e.g., 0 remaining, empty input)
Inductive step (optimal substructure) — For any state, our recurrence considers ALL ways to reach it from smaller states. Since each smaller state contains the optimal solution (by induction), and we take the best among all options, the current state must also be optimal.
Complete coverage — By iterating through all states (bottom-up) or exploring all reachable states (top-down), we ensure every relevant subproblem is solved.

This inductive argument mirrors the mathematical principle of strong induction: if we can solve base cases, and we can solve any case given solutions to all smaller cases, then we can solve all cases.

Properties That Enable DP Optimization

•Decomposability — The problem can be expressed in terms of smaller instances of itself. This recursive structure is essential; without it, DP doesn't apply.
•Optimal Substructure — The optimal solution uses optimal solutions to subproblems. If local optima don't compose to global optima, standard DP fails.
•Overlapping Subproblems — The same subproblems are needed multiple times. Without overlap, DP gives no advantage over divide-and-conquer.
•Polynomial State Space — The number of distinct subproblems must be manageable (polynomial in input size). Exponential state spaces make DP infeasible.
•Computable Transitions — The relationship between states must be efficiently computable. Complex transitions can negate DP's benefits.

When DP Doesn't Apply

DP is not a universal optimization tool. It fails when: (1) Problems lack optimal substructure (e.g., longest simple path in general graphs), (2) The state space is exponential (e.g., Traveling Salesman without clever formulation), (3) Subproblems don't overlap (then divide-and-conquer is simpler). Always verify these conditions before attempting DP.

Computational Complexity Gains

The optimization power of DP is most vividly seen in its complexity improvements. By eliminating redundant computation, DP often achieves exponential speedups:

Time complexity improvements:

Problem	Naive Recursive	With DP	Improvement Factor
Fibonacci(n)	O(2ⁿ)	O(n)	Exponential → Linear
Coin Change (amount, k coins)	O(kⁿ)	O(n × k)	Exponential → Polynomial
Longest Common Subsequence	O(2^(m+n))	O(m × n)	Exponential → Quadratic
0/1 Knapsack	O(2ⁿ)	O(n × W)	Exponential → Pseudo-polynomial
Matrix Chain Multiplication	O(4ⁿ/n^(3/2))	O(n³)	Exponential → Cubic

Why are the gains so dramatic?

The key is the relationship between total subproblems and overlap:

Total distinct subproblems — This determines DP's time complexity. For Fibonacci, there are only n+1 distinct values: F(0), F(1), ..., F(n).
Overlap ratio — How many times would naive recursion recompute each subproblem? For Fibonacci, this ratio is exponential. F(2) is computed an exponentially increasing number of times as n grows.

DP's time complexity = (number of subproblems) × (time per subproblem)

Naive recursion's time = (number of subproblems) × (overlap ratio) × (time per subproblem)

By eliminating the overlap ratio, DP collapses exponential computation into polynomial.

complexity_demonstration.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
import time
 
def fib_naive(n):
    """O(2^n) - Exponential time"""
    if n <= 1:
        return n
    return fib_naive(n - 1) + fib_naive(n - 2)
 
def fib_dp(n):
    """O(n) - Linear time with DP"""
    if n <= 1:
        return n
    dp = [0] * (n + 1)
    dp[1] = 1
    for i in range(2, n + 1):
        dp[i] = dp[i-1] + dp[i-2]
    return dp[n]
 
# Demonstration of the difference
# For n = 35:
# - fib_naive takes about 3-5 seconds
# - fib_dp takes about 0.00001 seconds (10,000x+ faster)
 
# For n = 45:
# - fib_naive takes about 5-10 MINUTES
# - fib_dp takes about 0.00001 seconds (millions of times faster)
 
# For n = 100:
# - fib_naive would take longer than the age of the universe
# - fib_dp takes about 0.00001 seconds

The Polynomial Barrier

In computational complexity theory, polynomial-time algorithms are considered 'efficient' while exponential-time algorithms are considered 'intractable'. DP frequently bridges this gap—taking problems that seem exponential and revealing polynomial structure hidden within the overlap of subproblems.

Summary: DP as Optimization

We've established Dynamic Programming as a powerful optimization technique. Let's consolidate the key insights:

Key Takeaways

•DP solves optimization problems — It finds the best solution (minimum cost, maximum value, etc.) among exponentially many possibilities.
•The core mechanism is memoization — By storing solutions to subproblems, DP eliminates redundant computation that makes brute force infeasible.
•Three conditions enable DP — Optimal substructure, overlapping subproblems, and a polynomial state space are necessary for DP to apply.
•Recurrence relations capture optimization — The mathematical relationship between states encodes both problem structure and optimal decision-making.
•DP achieves exponential speedups — Transforming O(2ⁿ) into O(n²) or better is DP's signature contribution.
•DP is between brute force and greedy — More thorough than greedy (considers all options), more efficient than brute force (reuses computation).

What's next:

Now that we understand DP as an optimization technique, we'll explore the first crucial requirement: breaking problems into subproblems. The next page examines how to decompose complex problems into simpler, self-similar pieces—the essential first step in any DP solution.

Page Complete

You now understand Dynamic Programming as an optimization paradigm. You've seen how it differs from brute force (by eliminating redundancy) and greedy (by considering all choices). You've learned about recurrence relations and the exponential speedups DP provides. Next, we'll dive deeper into the art of problem decomposition.

1 / 4

Loading learning content...

Data Structures & AlgorithmsDynamic Programming

What Is Dynamic Programming?

LevelIntermediate

Duration60 mins

TopicDynamic Programming

1 / 4

DP as an Optimization Technique

The Art of Remembering Solutions

Imagine you're climbing a staircase with 50 steps. At each step, you can choose to climb 1 or 2 steps at a time. How many unique ways can you reach the top?

A naive approach would enumerate every possible combination—but with 50 steps, you'd be computing for centuries. Yet, with Dynamic Programming, you can solve this in microseconds.

What You Will Learn

Defining Dynamic Programming

The formal definition:

Dynamic Programming is an algorithmic paradigm that solves a given complex problem by breaking it into subproblems and storing the results of subproblems to avoid computing the same results again.

But this definition, while accurate, doesn't capture the essence of DP. Let's build a deeper understanding.

Why 'Dynamic' Programming?

The three pillars of Dynamic Programming:

Optimal Substructure — An optimal solution to the problem contains optimal solutions to its subproblems. If you can build the best overall solution by combining the best solutions to smaller pieces, the problem has optimal substructure.
Overlapping Subproblems — The same subproblems are encountered multiple times when solving the main problem. Unlike divide-and-conquer where subproblems are independent, DP subproblems recur, creating redundant computation that can be eliminated.
Memoization or Tabulation — The mechanism by which we avoid recomputation. Either cache results as we compute them (memoization, top-down) or build up solutions iteratively from base cases (tabulation, bottom-up).

When all three conditions are present, Dynamic Programming transforms exponential-time algorithms into polynomial-time solutions—often the difference between 'impossible' and 'instantaneous'.

DP as Optimization — The Core Insight

Many problems in computing ask us to find the best solution among many possibilities:

What's the shortest path between two cities?
What's the maximum value we can carry in a knapsack?
What's the minimum number of operations to transform one string into another?
What's the longest increasing subsequence in an array?

These are optimization problems—we're not just finding a solution, we're finding the optimal one. And this is precisely where Dynamic Programming shines.

The optimization principle:

Bellman's Principle of Optimality

Consider the shortest path problem:

If the shortest path from A to C goes through B, then:

The portion from A to B must be the shortest path from A to B
The portion from B to C must be the shortest path from B to C

If either portion weren't optimal, we could substitute a better sub-path and improve the overall path—contradicting our assumption that the A-to-C path was shortest.

This seemingly simple observation is revolutionary. It means we can:

Solve smaller subproblems first
Combine their optimal solutions
Guarantee that the combined solution is globally optimal

No need to enumerate all possibilities. No need for exhaustive search. The optimal structure of the problem guarantees that building from optimal subparts yields an optimal whole.

Optimization Problems Amenable to DP
Problem Type	Optimization Goal	DP Applicability
Shortest Path	Minimize total distance/cost	✅ Optimal subpaths form optimal paths
Knapsack	Maximize value within weight limit	✅ Optimal packing of smaller capacity leads to optimal of larger
Edit Distance	Minimize operations to transform string	✅ Optimal edits for prefixes compose to optimal for full strings
Longest Increasing Subsequence	Maximize subsequence length	✅ Optimal LIS ending at each position builds optimal overall
Matrix Chain Multiplication	Minimize scalar multiplications	✅ Optimal parenthesization of sub-chains yields optimal overall

What Makes DP Different from Brute Force

Consider computing the nth Fibonacci number:

F(0) = 0, F(1) = 1
F(n) = F(n-1) + F(n-2) for n > 1

A naive recursive implementation directly mirrors this definition. But watch what happens as n grows:

fibonacci_naive.py
1
2
3
4
5
6
7
8
9
10
11
12
def fib_naive(n):
    """
    Naive recursive Fibonacci - O(2^n) time complexity
    This is brute force: we recompute the same values exponentially many times.
    """
    if n <= 1:
        return n
    return fib_naive(n - 1) + fib_naive(n - 2)
 
# For n = 40, this makes over 300 million recursive calls
# For n = 50, it would take HOURS
# For n = 100, it would take longer than the age of the universe

The visualization of redundant computation:

When computing fib(5), the call tree looks like this:

Converting Mermaid diagram...

This is the key insight DP exploits: The subproblems overlap massively. If we simply remember each value after computing it once, we eliminate all redundant work.

Brute Force Approach

•Recomputes the same subproblems repeatedly
•Time complexity: O(2ⁿ) — exponential
•Each level of recursion doubles the work
•n=50 takes hours or days
•n=100 is computationally infeasible
•No memory of previous computations

Dynamic Programming Approach

•Computes each subproblem exactly once
•Time complexity: O(n) — linear
•Each value is computed and stored once
•n=50 takes microseconds
•n=100,000 is trivial
•Stores solutions for instant retrieval

The Exponential Trap

What Makes DP Different from Greedy

Both Dynamic Programming and Greedy algorithms are used to solve optimization problems. Both require optimal substructure. So what's the difference?

Greedy algorithms make locally optimal choices at each step, hoping they lead to a globally optimal solution. They never reconsider decisions once made.

The fundamental difference:

Greedy: Makes one choice per step based on local information. Fast, but only works when local optimality guarantees global optimality.
DP: Explores all choices per step, using stored subproblem solutions to efficiently evaluate each. Slower than greedy (when greedy works), but guaranteed to find the optimal solution.

The Coin Change Problem — Where Greedy FailsConsider making change for 6 cents using coins of denominations [1, 3, 4].

Input

amount = 6, coins = [1, 3, 4]

Output

Minimum coins needed = 2 (using 3 + 3)

Explanation

Greedy approach (largest coin first):

Choose 4 (remaining: 2)
Choose 1 (remaining: 1)
Choose 1 (remaining: 0)
Result: 3 coins [4, 1, 1]

DP approach (consider all options):

For amount=6, minimum coins = min(coins(5)+1, coins(3)+1, coins(2)+1)
This evaluates all possibilities and finds [3, 3]
Result: 2 coins [3, 3]

The greedy choice of 4 seemed optimal locally but led to a suboptimal global solution. DP avoids this trap by considering all options.

When to use which:

Criterion	Greedy	Dynamic Programming
Greedy Choice Property	Required — local choice must be globally safe	Not required — considers all choices
Subproblem Overlap	Irrelevant — doesn't revisit subproblems	Essential — exploits overlap for efficiency
Time Complexity	Often O(n) or O(n log n)	Often O(n²) or O(n × capacity)
Correctness Guarantee	Must be proven per problem	Always finds optimal if applicable
Examples	Activity Selection, Huffman Coding	Knapsack, Edit Distance, Matrix Chain

The Optimization Spectrum

The Mechanics of Optimization via DP

This is formalized through a recurrence relation — a mathematical equation that defines the solution to a problem in terms of solutions to smaller instances of the same problem.

General form of a DP recurrence:

OPT(problem) = optimize_over_all_choices(cost_of_choice + OPT(resulting_subproblem))

Where optimize_over_all_choices is typically min or max depending on whether we're minimizing or maximizing.

dp_recurrence_examples.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
# Example 1: Minimum coins to make amount
# OPT(amount) = min over all coins c of (1 + OPT(amount - c))
# where OPT(0) = 0 (base case)
 
def min_coins(amount, coins):
    """
    DP recurrence for minimum coin change.
    For each amount, we try every coin and pick the option
    that minimizes total coins used.
    """
    dp = [float('inf')] * (amount + 1)
    dp[0] = 0  # Base case: 0 coins needed for amount 0
    
    for amt in range(1, amount + 1):
        for coin in coins:
            if coin <= amt and dp[amt - coin] + 1 < dp[amt]:
                dp[amt] = dp[amt - coin] + 1
    
    return dp[amount] if dp[amount] != float('inf') else -1
 
 
# Example 2: Maximum value in 0/1 Knapsack
# OPT(i, w) = max(
#     OPT(i-1, w),           # Don't take item i
#     value[i] + OPT(i-1, w-weight[i])  # Take item i
# )
# Base case: OPT(0, w) = 0 for all w
 
def knapsack(values, weights, capacity):
    """
    DP recurrence for 0/1 knapsack.
    For each item, we decide: take it or leave it.
    We pick the choice that maximizes total value.
    """
    n = len(values)
    dp = [[0] * (capacity + 1) for _ in range(n + 1)]
    
    for i in range(1, n + 1):
        for w in range(capacity + 1):
            # Option 1: Don't take item i
            dp[i][w] = dp[i-1][w]
            # Option 2: Take item i (if it fits)
            if weights[i-1] <= w:
                take_value = values[i-1] + dp[i-1][w - weights[i-1]]
                dp[i][w] = max(dp[i][w], take_value)
    
    return dp[n][capacity]

The optimization loop:

Every DP solution follows this pattern:

Define the state — What information do we need to describe a subproblem? (e.g., 'amount remaining' for coin change, '(item index, remaining capacity)' for knapsack)
Write the recurrence — How does the optimal solution for a state relate to optimal solutions of smaller states? This is where optimization happens—we take the min or max over all choices.
Identify base cases — What are the smallest subproblems we can solve directly? (e.g., amount=0 needs 0 coins, no items means 0 value)
Compute solutions — Either top-down with memoization (recursion + cache) or bottom-up with tabulation (iteration + table)
Extract the answer — Read the optimal value from the appropriate cell of our DP table

The Recurrence IS the Algorithm

Why DP Works — The Theoretical Foundation

Dynamic Programming works because of a beautiful mathematical property: the possibility of decomposing a complex optimization problem into a collection of simpler problems, where:

The simple problems can be solved independently
The solutions to simple problems can be combined to solve complex ones
The same simple problems appear multiple times, allowing reuse

Proof sketch for correctness:

Consider any DP solution. We claim it finds the optimal answer. Here's why:

Base cases are correct — We explicitly handle the smallest subproblems (e.g., 0 remaining, empty input)
Inductive step (optimal substructure) — For any state, our recurrence considers ALL ways to reach it from smaller states. Since each smaller state contains the optimal solution (by induction), and we take the best among all options, the current state must also be optimal.
Complete coverage — By iterating through all states (bottom-up) or exploring all reachable states (top-down), we ensure every relevant subproblem is solved.

Properties That Enable DP Optimization

•Decomposability — The problem can be expressed in terms of smaller instances of itself. This recursive structure is essential; without it, DP doesn't apply.
•Optimal Substructure — The optimal solution uses optimal solutions to subproblems. If local optima don't compose to global optima, standard DP fails.
•Overlapping Subproblems — The same subproblems are needed multiple times. Without overlap, DP gives no advantage over divide-and-conquer.
•Polynomial State Space — The number of distinct subproblems must be manageable (polynomial in input size). Exponential state spaces make DP infeasible.
•Computable Transitions — The relationship between states must be efficiently computable. Complex transitions can negate DP's benefits.

When DP Doesn't Apply

Computational Complexity Gains

The optimization power of DP is most vividly seen in its complexity improvements. By eliminating redundant computation, DP often achieves exponential speedups:

Time complexity improvements:

Problem	Naive Recursive	With DP	Improvement Factor
Fibonacci(n)	O(2ⁿ)	O(n)	Exponential → Linear
Coin Change (amount, k coins)	O(kⁿ)	O(n × k)	Exponential → Polynomial
Longest Common Subsequence	O(2^(m+n))	O(m × n)	Exponential → Quadratic
0/1 Knapsack	O(2ⁿ)	O(n × W)	Exponential → Pseudo-polynomial
Matrix Chain Multiplication	O(4ⁿ/n^(3/2))	O(n³)	Exponential → Cubic

Why are the gains so dramatic?

The key is the relationship between total subproblems and overlap:

Total distinct subproblems — This determines DP's time complexity. For Fibonacci, there are only n+1 distinct values: F(0), F(1), ..., F(n).
Overlap ratio — How many times would naive recursion recompute each subproblem? For Fibonacci, this ratio is exponential. F(2) is computed an exponentially increasing number of times as n grows.

DP's time complexity = (number of subproblems) × (time per subproblem)

Naive recursion's time = (number of subproblems) × (overlap ratio) × (time per subproblem)

By eliminating the overlap ratio, DP collapses exponential computation into polynomial.

complexity_demonstration.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
import time
 
def fib_naive(n):
    """O(2^n) - Exponential time"""
    if n <= 1:
        return n
    return fib_naive(n - 1) + fib_naive(n - 2)
 
def fib_dp(n):
    """O(n) - Linear time with DP"""
    if n <= 1:
        return n
    dp = [0] * (n + 1)
    dp[1] = 1
    for i in range(2, n + 1):
        dp[i] = dp[i-1] + dp[i-2]
    return dp[n]
 
# Demonstration of the difference
# For n = 35:
# - fib_naive takes about 3-5 seconds
# - fib_dp takes about 0.00001 seconds (10,000x+ faster)
 
# For n = 45:
# - fib_naive takes about 5-10 MINUTES
# - fib_dp takes about 0.00001 seconds (millions of times faster)
 
# For n = 100:
# - fib_naive would take longer than the age of the universe
# - fib_dp takes about 0.00001 seconds

The Polynomial Barrier

Summary: DP as Optimization

We've established Dynamic Programming as a powerful optimization technique. Let's consolidate the key insights:

Key Takeaways

•DP solves optimization problems — It finds the best solution (minimum cost, maximum value, etc.) among exponentially many possibilities.
•The core mechanism is memoization — By storing solutions to subproblems, DP eliminates redundant computation that makes brute force infeasible.
•Three conditions enable DP — Optimal substructure, overlapping subproblems, and a polynomial state space are necessary for DP to apply.
•Recurrence relations capture optimization — The mathematical relationship between states encodes both problem structure and optimal decision-making.
•DP achieves exponential speedups — Transforming O(2ⁿ) into O(n²) or better is DP's signature contribution.
•DP is between brute force and greedy — More thorough than greedy (considers all options), more efficient than brute force (reuses computation).

What's next:

Page Complete

1 / 4