Data Structures & AlgorithmsDynamic Programming

What Is Dynamic Programming?

LevelIntermediate

Duration60 mins

TopicDynamic Programming

3 / 4

Storing Solutions to Avoid Recomputation

The Power of Memory

Imagine being asked to calculate 47 × 23. You work it out: 1,081. Now imagine being asked the same question a hundred times. Would you recalculate it each time, or would you simply remember the answer?

This seemingly obvious insight—remember what you've already computed—is the mechanism that transforms Dynamic Programming from a theoretical framework into a practical powerhouse. Without it, DP would be no better than brute force. With it, DP achieves exponential speedups.

This page explores the two implementation strategies that bring DP to life: memoization (remember as you compute) and tabulation (compute systematically in order). Understanding both is essential to becoming a DP master.

What You Will Learn

By the end of this page, you will understand why storing solutions is essential, how memoization (top-down) and tabulation (bottom-up) work, when to use each approach, and how to implement both in practice. You'll see identical problems solved both ways.

The Recomputation Problem

Let's revisit a critical observation from earlier: naive recursion recomputes the same subproblems exponentially many times.

Quantifying the redundancy:

Consider computing Fibonacci(50) naively:

Total recursive calls: approximately 2^50 (over 1 quadrillion)
Distinct subproblems: only 51 (F(0) through F(50))
Redundancy factor: ~20 trillion times per subproblem on average

This is staggeringly wasteful. We're computing the same values over and over, each time throwing away the result.

Why does this happen?

It's a consequence of the overlapping subproblem property. When multiple branches of the recursion tree need the same subproblem, each branch independently computes it. Without memory of past work, each computation starts from scratch.

recomputation_analysis.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
# Let's count how many times each subproblem is computed naively
 
call_count = {}
 
def fib_naive_counted(n):
    """Fibonacci with call counting to demonstrate redundancy."""
    call_count[n] = call_count.get(n, 0) + 1
    
    if n <= 1:
        return n
    return fib_naive_counted(n - 1) + fib_naive_counted(n - 2)
 
# Example: Computing F(20)
call_count.clear()
result = fib_naive_counted(20)
 
print(f"F(20) = {result}")
print(f"Total calls: {sum(call_count.values())}")
print(f"Unique subproblems: {len(call_count)}")
 
# Sample output:
# F(20) = 6765
# Total calls: 21891
# Unique subproblems: 21
# 
# That's over 1000 calls per unique subproblem on average!
 
# Call counts for specific subproblems:
# F(2) is called 4181 times
# F(3) is called 2584 times
# F(10) is called 89 times
# F(15) is called 8 times

The solution is obvious but profound:

Instead of discarding computed values, store them. Before computing any subproblem:

Check if we've already solved it
If yes, return the stored answer immediately
If no, compute it, store the result, then return

This simple change eliminates all redundant computation. Each of the 51 distinct Fibonacci subproblems is computed exactly once, regardless of how many times it's needed.

The Space-Time Tradeoff

Storing solutions requires memory. For Fibonacci, we need O(n) space. For 2D problems like LCS, we need O(m × n) space. This is the fundamental tradeoff: we exchange memory for time. In almost all practical scenarios, this tradeoff is overwhelmingly favorable—memory is cheap, time is expensive.

Memoization — Top-Down DP

Memoization is the technique of caching the results of expensive function calls and returning the cached result when the same inputs occur again. The name comes from 'memo'—like writing a note to yourself.

The memoization pattern:

memo = {}  # Cache storage

def solve(state):
    # Check cache first
    if state in memo:
        return memo[state]
    
    # Compute the answer
    if state is base case:
        result = base_value
    else:
        result = combine(solve(smaller_state_1), solve(smaller_state_2), ...)
    
    # Store in cache before returning
    memo[state] = result
    return result

Key characteristics of memoization:

Top-down — We start with the main problem and recursively break it down, computing subproblems as needed.
Demand-driven — Only subproblems actually needed are computed. If some branches of the recursion tree are never explored, their subproblems are never computed.
Recursive — The code structure mirrors the mathematical recurrence directly, making it intuitive to write.
Lazy — Computation happens on first access; cached results are returned on subsequent accesses.

memoization_examples.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
# Example 1: Fibonacci with Memoization
 
def fib_memo(n, memo=None):
    """
    Memoized Fibonacci: O(n) time, O(n) space
    Each subproblem is computed exactly once.
    """
    if memo is None:
        memo = {}
    
    # Check cache
    if n in memo:
        return memo[n]
    
    # Base cases
    if n <= 1:
        return n
    
    # Recursive case with memoization
    result = fib_memo(n - 1, memo) + fib_memo(n - 2, memo)
    memo[n] = result
    return result
 
# Python also supports @functools.lru_cache for automatic memoization:
from functools import lru_cache
 
@lru_cache(maxsize=None)
def fib_lru(n):
    """Fibonacci with automatic memoization via lru_cache."""
    if n <= 1:
        return n
    return fib_lru(n - 1) + fib_lru(n - 2)
 
 
# Example 2: Minimum Coin Change with Memoization
 
@lru_cache(maxsize=None)
def min_coins_memo(amount, coins):
    """
    Minimum coins needed to make 'amount'.
    coins is a tuple (hashable for caching).
    """
    if amount == 0:
        return 0
    if amount < 0:
        return float('inf')
    
    min_result = float('inf')
    for coin in coins:
        result = min_coins_memo(amount - coin, coins)
        min_result = min(min_result, result + 1)
    
    return min_result
 
# Usage: min_coins_memo(11, (1, 5, 6)) returns 3 (6+5 or 5+5+1? Actually 6+5=11, so 2)
# Actually: 11 = 6 + 5 = 2 coins. Let me verify...
# min_coins_memo(11, (1, 5, 6)):
#   Try coin 6: 1 + min_coins(5) = 1 + 1 = 2
#   Try coin 5: 1 + min_coins(6) = 1 + 1 = 2
#   Try coin 1: 1 + min_coins(10) = 1 + 2 = 3
# So answer is 2.

Memoization Is Recursive + Cache

Think of memoization as adding a 'memory layer' on top of naive recursion. You write the recursive logic naturally, then wrap it with cache checks. This makes memoization easy to implement—just add cache lookup at the start and cache update at the end of your recursive function.

Tabulation — Bottom-Up DP

Tabulation is the technique of systematically filling a table (array) with solutions to subproblems, starting from the smallest (base cases) and building up to the final answer. Unlike memoization, there's no recursion—just iteration.

The tabulation pattern:

# Create table with dimensions matching state space
dp = create_table(dimensions, default_value)

# Fill base cases
dp[base_indices] = base_values

# Fill table in dependency order (smallest to largest)
for each state in topological_order:
    dp[state] = combine(dp[dependency_1], dp[dependency_2], ...)

# Answer is in the cell corresponding to the original problem
return dp[answer_index]

Key characteristics of tabulation:

Bottom-up — We start with base cases and build up to the final answer, never looking 'ahead'.
Eager — All subproblems are computed, regardless of whether they're needed for the final answer.
Iterative — No recursion means no call stack overhead and no stack overflow risk.
Explicit ordering — We must determine the correct order to fill the table (topological order of the subproblem DAG).

tabulation_examples.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
# Example 1: Fibonacci with Tabulation
 
def fib_tab(n):
    """
    Tabulated Fibonacci: O(n) time, O(n) space
    We fill the table from F(0) to F(n) in order.
    """
    if n <= 1:
        return n
    
    # Create table
    dp = [0] * (n + 1)
    
    # Base cases
    dp[0] = 0
    dp[1] = 1
    
    # Fill table bottom-up
    for i in range(2, n + 1):
        dp[i] = dp[i - 1] + dp[i - 2]
    
    # Answer
    return dp[n]
 
 
# Example 2: Minimum Coin Change with Tabulation
 
def min_coins_tab(amount, coins):
    """
    Minimum coins needed to make 'amount'.
    We compute min coins for every amount from 0 to target.
    """
    # Create table: dp[a] = min coins for amount a
    dp = [float('inf')] * (amount + 1)
    
    # Base case: 0 coins needed for amount 0
    dp[0] = 0
    
    # Fill table for all amounts from 1 to target
    for a in range(1, amount + 1):
        for coin in coins:
            if coin <= a and dp[a - coin] != float('inf'):
                dp[a] = min(dp[a], dp[a - coin] + 1)
    
    return dp[amount] if dp[amount] != float('inf') else -1
 
 
# Example 3: Longest Common Subsequence with Tabulation
 
def lcs_tab(s1, s2):
    """
    Length of Longest Common Subsequence.
    2D table where dp[i][j] = LCS of s1[0:i] and s2[0:j].
    """
    m, n = len(s1), len(s2)
    
    # Create (m+1) x (n+1) table, initialized to 0
    dp = [[0] * (n + 1) for _ in range(m + 1)]
    
    # Base cases: dp[0][j] = dp[i][0] = 0 (already done by initialization)
    
    # Fill table row by row
    for i in range(1, m + 1):
        for j in range(1, n + 1):
            if s1[i - 1] == s2[j - 1]:
                # Characters match: extend LCS
                dp[i][j] = dp[i - 1][j - 1] + 1
            else:
                # No match: take best of excluding either char
                dp[i][j] = max(dp[i - 1][j], dp[i][j - 1])
    
    return dp[m][n]
 
# Usage: lcs_tab("ABCDGH", "AEDFHR") returns 3

Filling Order Matters

In tabulation, the order you fill the table must respect dependencies. For 1D problems growing left-to-right, iterate i from 0 to n. For 2D grid problems, row-by-row or column-by-column works. For interval DP, iterate by interval length. Getting the order wrong causes incorrect reads of unfilled cells.

Memoization vs Tabulation — Deep Comparison

Both approaches solve DP problems correctly and efficiently, but they have different characteristics that make each preferable in different situations.

Structural comparison:

Memoization (Top-Down) vs Tabulation (Bottom-Up)
Aspect	Memoization	Tabulation
Approach	Start from problem, recurse down	Start from base cases, build up
Implementation	Recursive with cache	Iterative with table
Order of computation	Determined by recursion (automatic)	Must be explicitly specified
Subproblems computed	Only those needed (lazy)	All possible (eager)
Call stack usage	Can be deep (recursion limit)	None (no recursion)
Space for storage	Hash map (dict) or array	Array (table)
Cache lookup overhead	Hash map lookup O(1) average	Direct array access O(1)
Code style	Matches recurrence naturally	More verbose but explicit

When to prefer Memoization:

Sparse subproblem access — If many subproblems aren't needed for the final answer, memoization avoids computing them. Example: certain search problems where pruning eliminates branches.
Complex state dependencies — When the order of computation isn't obvious, memoization's demand-driven nature handles it automatically.
Quick implementation — Memoization often requires minimal changes to a recursive solution—just add caching.
Readability — The code directly mirrors the mathematical recurrence, making it easier to verify correctness.

When to prefer Tabulation:

Deep recursion — Languages with limited stack size (like Python's default ~1000 frames) can overflow with deep memoization. Tabulation avoids this.
Space optimization — Tabulation makes it easier to see and exploit the opportunity to reduce space (e.g., only keeping the previous row).
Constant factor speed — Iterative loops with array access are faster than recursive calls with hash map lookups.
Dense computation — If all subproblems are needed anyway, tabulation is cleaner and faster.

Choose Memoization When

•Not all subproblems are needed
•Recursive structure is complex
•You want to prototype quickly
•The recurrence is the clearest specification
•State space is sparse

Choose Tabulation When

•Maximum performance is critical
•Deep recursion could overflow stack
•Space optimization is possible
•All subproblems will be used
•Filling order is straightforward

The Practical Reality

In interviews and competitive programming, memoization is often faster to write and easier to debug. In production systems where performance is critical, tabulation with space optimization is preferred. Many practitioners start with memoization for correctness, then convert to tabulation for performance if needed.

Space Optimization Techniques

One of the most valuable skills in DP implementation is space optimization—reducing the memory footprint without sacrificing correctness or time complexity.

The key insight:

Many DP problems have limited dependency scope. If dp[i] only depends on dp[i-1] and dp[i-2], we don't need to store all n values—just the last two suffice.

Common space optimization patterns:

space_optimization.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
# Pattern 1: Direct variable replacement
# When dp[i] depends only on dp[i-1] and dp[i-2]
 
def fib_optimized(n):
    """
    Fibonacci with O(1) space.
    We only need the last two values, not all n.
    """
    if n <= 1:
        return n
    
    prev2, prev1 = 0, 1  # F(i-2), F(i-1)
    
    for i in range(2, n + 1):
        current = prev1 + prev2
        prev2 = prev1
        prev1 = current
    
    return prev1
# Space: O(n) → O(1)
 
 
# Pattern 2: Rolling array (keep only necessary rows)
# When dp[i][j] depends only on dp[i-1][...] (previous row)
 
def min_path_sum_optimized(grid):
    """
    Minimum path sum with O(n) space instead of O(m × n).
    Each row only needs the previous row's values.
    """
    if not grid:
        return 0
    
    m, n = len(grid), len(grid[0])
    prev = [float('inf')] * n
    prev[0] = 0  # Starting point has no incoming cost
    
    for i in range(m):
        curr = [float('inf')] * n
        for j in range(n):
            if i == 0 and j == 0:
                curr[j] = grid[0][0]
            else:
                from_above = prev[j] if i > 0 else float('inf')
                from_left = curr[j - 1] if j > 0 else float('inf')
                curr[j] = grid[i][j] + min(from_above, from_left)
        prev = curr
    
    return prev[n - 1]
# Space: O(m × n) → O(n)
 
 
# Pattern 3: In-place modification with careful iteration direction
# When we can use the same array, updating from the right direction
 
def coin_change_optimized(amount, coins):
    """
    Coin change with minimal space.
    dp array is reused; each position is updated at most once per outer loop.
    """
    dp = [float('inf')] * (amount + 1)
    dp[0] = 0
    
    for coin in coins:
        for a in range(coin, amount + 1):
            dp[a] = min(dp[a], dp[a - coin] + 1)
    
    return dp[amount] if dp[amount] != float('inf') else -1
 
 
# Pattern 4: 0/1 Knapsack with 1D array
# Iterate backwards to avoid using already-updated values
 
def knapsack_optimized(values, weights, capacity):
    """
    0/1 Knapsack with O(W) space instead of O(n × W).
    Trick: iterate capacity backwards so we don't use the 'new' values.
    """
    dp = [0] * (capacity + 1)
    
    for i in range(len(values)):
        # BACKWARDS! Using j = capacity down to weights[i]
        for w in range(capacity, weights[i] - 1, -1):
            dp[w] = max(dp[w], values[i] + dp[w - weights[i]])
    
    return dp[capacity]
# Space: O(n × W) → O(W)

Iteration Direction Matters

When reducing 2D DP to 1D, the direction of iteration is critical. If you iterate forward but depend on values that just changed, you'll use the current row's values instead of the previous row's—breaking correctness. For 0/1 Knapsack, iterate backwards; for unbounded coin change, iterate forwards.

Space Optimization Summary
Original Space	Optimization	Result	When Applicable
O(n) — 1D array	Two variables	O(1)	Only last 1-2 values needed
O(m × n) — 2D grid	Rolling array (1 row)	O(n)	Each row depends only on previous row
O(n × W) — 2D table	1D array with careful iteration	O(W)	0/1 choice problems with backwards iteration
O(m × n) — 2 sequences	Single row + update in place	O(min(m,n))	LCS-style problems, diagonal dependencies

Implementation Best Practices

After implementing many DP solutions, certain best practices emerge that help avoid bugs and improve clarity:

1. Define state semantics precisely

Before writing code, write a comment stating exactly what dp[i] or dp[i][j] represents. Ambiguity here causes bugs.

# dp[i] = minimum cost to reach step i from step 0
# dp[i][j] = LCS length of s1[0..i-1] and s2[0..j-1]
# dp[i][w] = max value using items 0..i-1 with capacity w

2. Handle base cases explicitly

Don't rely on implicit initialization. Set base cases clearly:

dp[0] = 0  # Base case: cost to reach start is 0
dp[1] = cost[0]  # Base case: cost to reach step 1 is cost of first step

3. Use meaningful variable names

Instead of dp, consider minCost, maxProfit, waysToReach, etc. The name documents intent.

4. Validate indices before access

Off-by-one errors are the most common DP bugs. Check your loop bounds and array accesses.

if i - 1 >= 0:  # Make sure we don't access dp[-1]
    result = min(result, dp[i - 1] + cost)

dp_best_practices.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
# Example: Well-documented DP solution for House Robber
 
def rob_houses(money):
    """
    Maximum money that can be robbed without robbing adjacent houses.
    
    State definition:
        max_profit[i] = maximum money obtainable from houses 0..i
        
    Recurrence:
        max_profit[i] = max(
            max_profit[i-1],           # Skip house i
            money[i] + max_profit[i-2] # Rob house i
        )
        
    Base cases:
        max_profit[0] = money[0]              # Only one house
        max_profit[1] = max(money[0], money[1]) # Best of first two
    
    Time: O(n), Space: O(1) after optimization
    """
    # Edge cases
    if not money:
        return 0
    if len(money) == 1:
        return money[0]
    
    n = len(money)
    
    # Instead of full array, use two variables (space optimization)
    # prev2 = max_profit[i-2], prev1 = max_profit[i-1]
    prev2 = money[0]
    prev1 = max(money[0], money[1])
    
    for i in range(2, n):
        # Current = max(skip house i, rob house i)
        current = max(prev1, money[i] + prev2)
        prev2 = prev1
        prev1 = current
    
    return prev1
 
 
# Example: Proper memoization with explicit cache check
 
def min_edit_distance(s1, s2):
    """
    Minimum operations (insert, delete, replace) to transform s1 to s2.
    
    State: dp[i][j] = edit distance of s1[0..i-1] and s2[0..j-1]
    """
    # Use a dictionary for memoization with clear key structure
    memo = {}
    
    def dp(i, j):
        # State key
        key = (i, j)
        
        # Cache check
        if key in memo:
            return memo[key]
        
        # Base cases
        if i == 0:
            result = j  # Insert all of s2
        elif j == 0:
            result = i  # Delete all of s1
        elif s1[i - 1] == s2[j - 1]:
            result = dp(i - 1, j - 1)  # No operation needed
        else:
            result = 1 + min(
                dp(i - 1, j),      # Delete from s1
                dp(i, j - 1),      # Insert into s1
                dp(i - 1, j - 1)   # Replace
            )
        
        # Cache before return
        memo[key] = result
        return result
    
    return dp(len(s1), len(s2))

DP Implementation Checklist

•State definition — Wrote clear comment explaining what dp[...] represents
•Base cases — Identified and explicitly handled all minimal subproblems
•Recurrence — Verified the formula covers all transitions and is correct
•Bounds checking — Confirmed no out-of-bounds array access
•Initialization — Used appropriate default values (0, infinity, -infinity)
•Return value — Returning the correct cell that represents the original problem
•Edge cases — Tested with empty input, single element, small inputs

Common Pitfalls and How to Avoid Them

DP is notorious for subtle bugs. Here are the most common pitfalls and how to avoid them:

Pitfall 1: Off-by-one errors in indexing

Problem: Using dp[i] to represent 'first i elements' vs 'element at index i' inconsistently.

Solution: Choose one convention and stick to it. Using dp[i] = 'solution for first i elements' (1-indexed semantically) often reduces edge case complexity.

Pitfall 2: Wrong initialization

Problem: Initializing to 0 when you should use infinity (for minimization) or -infinity (for maximization).

Solution: For minimization, initialize achievable states to 0 and unreachable states to infinity. For maximization, the opposite.

Pitfall 3: Forgetting base cases

Problem: Recursion or loops try to access undefined base cases.

Solution: Always explicitly handle i=0, j=0, empty input, and boundary conditions.

Pitfall 4: Wrong iteration order (tabulation)

Problem: Filling table in an order that violates dependencies.

Solution: Draw the dependency DAG for a small case. Ensure your iteration visits dependencies before dependents.

Pitfall 5: Mutating memoization keys

Problem: Using mutable objects (lists) as dictionary keys in memoization.

Solution: Convert to immutable types (tuples) or use indices only as keys.

The Debugging Nightmare

DP bugs are particularly insidious because the code may work on small tests but fail on larger inputs. A wrong base case might be masked by other values. An incorrect transition might produce 'close enough' answers. Always test with edge cases: empty input, single element, and inputs that exercise all branches of your recurrence.

Debugging strategy:

Print the DP table — Especially for 2D DP, printing the entire table reveals patterns and errors.
Trace by hand — Pick a small example and manually compute each cell. Compare with code output.
Add assertions — Assert bounds, assert that values make sense (non-negative, within expected range).
Test incrementally — Verify base cases before testing full recurrence. Test with n=1, n=2 before n=100.

Summary: Storing Solutions

We've explored the mechanism that makes Dynamic Programming practical: storing solutions to avoid recomputation. Let's consolidate the key insights:

Key Takeaways

•Redundant computation is the enemy — Naive recursion recomputes the same subproblems exponentially many times, making it infeasible.
•Memoization (top-down) — Add a cache to recursive solutions. Compute on demand, store for reuse. Natural and quick to implement.
•Tabulation (bottom-up) — Fill a table iteratively from base cases. No recursion overhead, enables space optimization.
•Both achieve the same time complexity — The difference is in constant factors, stack usage, and space optimization potential.
•Space optimization is powerful — Many DP problems can reduce from O(n²) space to O(n) or O(1) by exploiting limited dependencies.
•Best practices prevent bugs — Clear state definitions, explicit base cases, proper initialization, and thorough testing.

What's next:

We've now covered three of the four pillars of Dynamic Programming: what it is (optimization technique), how to decompose problems (subproblems), and how to store solutions (memoization/tabulation). The final page compares DP with its close relatives—plain recursion and greedy algorithms—to complete your conceptual foundation.

Page Complete

You now understand the implementation mechanics of Dynamic Programming. You can implement both memoization and tabulation, optimize space usage, and avoid common pitfalls. Next, we'll compare DP with recursion and greedy algorithms to understand when each approach is appropriate.

3 / 4

Loading learning content...

Data Structures & AlgorithmsDynamic Programming

What Is Dynamic Programming?

LevelIntermediate

Duration60 mins

TopicDynamic Programming

3 / 4

Storing Solutions to Avoid Recomputation

The Power of Memory

What You Will Learn

The Recomputation Problem

Let's revisit a critical observation from earlier: naive recursion recomputes the same subproblems exponentially many times.

Quantifying the redundancy:

Consider computing Fibonacci(50) naively:

Total recursive calls: approximately 2^50 (over 1 quadrillion)
Distinct subproblems: only 51 (F(0) through F(50))
Redundancy factor: ~20 trillion times per subproblem on average

This is staggeringly wasteful. We're computing the same values over and over, each time throwing away the result.

Why does this happen?

recomputation_analysis.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
# Let's count how many times each subproblem is computed naively
 
call_count = {}
 
def fib_naive_counted(n):
    """Fibonacci with call counting to demonstrate redundancy."""
    call_count[n] = call_count.get(n, 0) + 1
    
    if n <= 1:
        return n
    return fib_naive_counted(n - 1) + fib_naive_counted(n - 2)
 
# Example: Computing F(20)
call_count.clear()
result = fib_naive_counted(20)
 
print(f"F(20) = {result}")
print(f"Total calls: {sum(call_count.values())}")
print(f"Unique subproblems: {len(call_count)}")
 
# Sample output:
# F(20) = 6765
# Total calls: 21891
# Unique subproblems: 21
# 
# That's over 1000 calls per unique subproblem on average!
 
# Call counts for specific subproblems:
# F(2) is called 4181 times
# F(3) is called 2584 times
# F(10) is called 89 times
# F(15) is called 8 times

The solution is obvious but profound:

Instead of discarding computed values, store them. Before computing any subproblem:

Check if we've already solved it
If yes, return the stored answer immediately
If no, compute it, store the result, then return

This simple change eliminates all redundant computation. Each of the 51 distinct Fibonacci subproblems is computed exactly once, regardless of how many times it's needed.

The Space-Time Tradeoff

Memoization — Top-Down DP

The memoization pattern:

memo = {}  # Cache storage

def solve(state):
    # Check cache first
    if state in memo:
        return memo[state]
    
    # Compute the answer
    if state is base case:
        result = base_value
    else:
        result = combine(solve(smaller_state_1), solve(smaller_state_2), ...)
    
    # Store in cache before returning
    memo[state] = result
    return result

Key characteristics of memoization:

Top-down — We start with the main problem and recursively break it down, computing subproblems as needed.
Demand-driven — Only subproblems actually needed are computed. If some branches of the recursion tree are never explored, their subproblems are never computed.
Recursive — The code structure mirrors the mathematical recurrence directly, making it intuitive to write.
Lazy — Computation happens on first access; cached results are returned on subsequent accesses.

memoization_examples.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
# Example 1: Fibonacci with Memoization
 
def fib_memo(n, memo=None):
    """
    Memoized Fibonacci: O(n) time, O(n) space
    Each subproblem is computed exactly once.
    """
    if memo is None:
        memo = {}
    
    # Check cache
    if n in memo:
        return memo[n]
    
    # Base cases
    if n <= 1:
        return n
    
    # Recursive case with memoization
    result = fib_memo(n - 1, memo) + fib_memo(n - 2, memo)
    memo[n] = result
    return result
 
# Python also supports @functools.lru_cache for automatic memoization:
from functools import lru_cache
 
@lru_cache(maxsize=None)
def fib_lru(n):
    """Fibonacci with automatic memoization via lru_cache."""
    if n <= 1:
        return n
    return fib_lru(n - 1) + fib_lru(n - 2)
 
 
# Example 2: Minimum Coin Change with Memoization
 
@lru_cache(maxsize=None)
def min_coins_memo(amount, coins):
    """
    Minimum coins needed to make 'amount'.
    coins is a tuple (hashable for caching).
    """
    if amount == 0:
        return 0
    if amount < 0:
        return float('inf')
    
    min_result = float('inf')
    for coin in coins:
        result = min_coins_memo(amount - coin, coins)
        min_result = min(min_result, result + 1)
    
    return min_result
 
# Usage: min_coins_memo(11, (1, 5, 6)) returns 3 (6+5 or 5+5+1? Actually 6+5=11, so 2)
# Actually: 11 = 6 + 5 = 2 coins. Let me verify...
# min_coins_memo(11, (1, 5, 6)):
#   Try coin 6: 1 + min_coins(5) = 1 + 1 = 2
#   Try coin 5: 1 + min_coins(6) = 1 + 1 = 2
#   Try coin 1: 1 + min_coins(10) = 1 + 2 = 3
# So answer is 2.

Memoization Is Recursive + Cache

Tabulation — Bottom-Up DP

The tabulation pattern:

# Create table with dimensions matching state space
dp = create_table(dimensions, default_value)

# Fill base cases
dp[base_indices] = base_values

# Fill table in dependency order (smallest to largest)
for each state in topological_order:
    dp[state] = combine(dp[dependency_1], dp[dependency_2], ...)

# Answer is in the cell corresponding to the original problem
return dp[answer_index]

Key characteristics of tabulation:

Bottom-up — We start with base cases and build up to the final answer, never looking 'ahead'.
Eager — All subproblems are computed, regardless of whether they're needed for the final answer.
Iterative — No recursion means no call stack overhead and no stack overflow risk.
Explicit ordering — We must determine the correct order to fill the table (topological order of the subproblem DAG).

tabulation_examples.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
# Example 1: Fibonacci with Tabulation
 
def fib_tab(n):
    """
    Tabulated Fibonacci: O(n) time, O(n) space
    We fill the table from F(0) to F(n) in order.
    """
    if n <= 1:
        return n
    
    # Create table
    dp = [0] * (n + 1)
    
    # Base cases
    dp[0] = 0
    dp[1] = 1
    
    # Fill table bottom-up
    for i in range(2, n + 1):
        dp[i] = dp[i - 1] + dp[i - 2]
    
    # Answer
    return dp[n]
 
 
# Example 2: Minimum Coin Change with Tabulation
 
def min_coins_tab(amount, coins):
    """
    Minimum coins needed to make 'amount'.
    We compute min coins for every amount from 0 to target.
    """
    # Create table: dp[a] = min coins for amount a
    dp = [float('inf')] * (amount + 1)
    
    # Base case: 0 coins needed for amount 0
    dp[0] = 0
    
    # Fill table for all amounts from 1 to target
    for a in range(1, amount + 1):
        for coin in coins:
            if coin <= a and dp[a - coin] != float('inf'):
                dp[a] = min(dp[a], dp[a - coin] + 1)
    
    return dp[amount] if dp[amount] != float('inf') else -1
 
 
# Example 3: Longest Common Subsequence with Tabulation
 
def lcs_tab(s1, s2):
    """
    Length of Longest Common Subsequence.
    2D table where dp[i][j] = LCS of s1[0:i] and s2[0:j].
    """
    m, n = len(s1), len(s2)
    
    # Create (m+1) x (n+1) table, initialized to 0
    dp = [[0] * (n + 1) for _ in range(m + 1)]
    
    # Base cases: dp[0][j] = dp[i][0] = 0 (already done by initialization)
    
    # Fill table row by row
    for i in range(1, m + 1):
        for j in range(1, n + 1):
            if s1[i - 1] == s2[j - 1]:
                # Characters match: extend LCS
                dp[i][j] = dp[i - 1][j - 1] + 1
            else:
                # No match: take best of excluding either char
                dp[i][j] = max(dp[i - 1][j], dp[i][j - 1])
    
    return dp[m][n]
 
# Usage: lcs_tab("ABCDGH", "AEDFHR") returns 3

Filling Order Matters

Memoization vs Tabulation — Deep Comparison

Both approaches solve DP problems correctly and efficiently, but they have different characteristics that make each preferable in different situations.

Structural comparison:

Memoization (Top-Down) vs Tabulation (Bottom-Up)
Aspect	Memoization	Tabulation
Approach	Start from problem, recurse down	Start from base cases, build up
Implementation	Recursive with cache	Iterative with table
Order of computation	Determined by recursion (automatic)	Must be explicitly specified
Subproblems computed	Only those needed (lazy)	All possible (eager)
Call stack usage	Can be deep (recursion limit)	None (no recursion)
Space for storage	Hash map (dict) or array	Array (table)
Cache lookup overhead	Hash map lookup O(1) average	Direct array access O(1)
Code style	Matches recurrence naturally	More verbose but explicit

When to prefer Memoization:

Sparse subproblem access — If many subproblems aren't needed for the final answer, memoization avoids computing them. Example: certain search problems where pruning eliminates branches.
Complex state dependencies — When the order of computation isn't obvious, memoization's demand-driven nature handles it automatically.
Quick implementation — Memoization often requires minimal changes to a recursive solution—just add caching.
Readability — The code directly mirrors the mathematical recurrence, making it easier to verify correctness.

When to prefer Tabulation:

Deep recursion — Languages with limited stack size (like Python's default ~1000 frames) can overflow with deep memoization. Tabulation avoids this.
Space optimization — Tabulation makes it easier to see and exploit the opportunity to reduce space (e.g., only keeping the previous row).
Constant factor speed — Iterative loops with array access are faster than recursive calls with hash map lookups.
Dense computation — If all subproblems are needed anyway, tabulation is cleaner and faster.

Choose Memoization When

•Not all subproblems are needed
•Recursive structure is complex
•You want to prototype quickly
•The recurrence is the clearest specification
•State space is sparse

Choose Tabulation When

•Maximum performance is critical
•Deep recursion could overflow stack
•Space optimization is possible
•All subproblems will be used
•Filling order is straightforward

The Practical Reality

Space Optimization Techniques

One of the most valuable skills in DP implementation is space optimization—reducing the memory footprint without sacrificing correctness or time complexity.

The key insight:

Many DP problems have limited dependency scope. If dp[i] only depends on dp[i-1] and dp[i-2], we don't need to store all n values—just the last two suffice.

Common space optimization patterns:

space_optimization.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
# Pattern 1: Direct variable replacement
# When dp[i] depends only on dp[i-1] and dp[i-2]
 
def fib_optimized(n):
    """
    Fibonacci with O(1) space.
    We only need the last two values, not all n.
    """
    if n <= 1:
        return n
    
    prev2, prev1 = 0, 1  # F(i-2), F(i-1)
    
    for i in range(2, n + 1):
        current = prev1 + prev2
        prev2 = prev1
        prev1 = current
    
    return prev1
# Space: O(n) → O(1)
 
 
# Pattern 2: Rolling array (keep only necessary rows)
# When dp[i][j] depends only on dp[i-1][...] (previous row)
 
def min_path_sum_optimized(grid):
    """
    Minimum path sum with O(n) space instead of O(m × n).
    Each row only needs the previous row's values.
    """
    if not grid:
        return 0
    
    m, n = len(grid), len(grid[0])
    prev = [float('inf')] * n
    prev[0] = 0  # Starting point has no incoming cost
    
    for i in range(m):
        curr = [float('inf')] * n
        for j in range(n):
            if i == 0 and j == 0:
                curr[j] = grid[0][0]
            else:
                from_above = prev[j] if i > 0 else float('inf')
                from_left = curr[j - 1] if j > 0 else float('inf')
                curr[j] = grid[i][j] + min(from_above, from_left)
        prev = curr
    
    return prev[n - 1]
# Space: O(m × n) → O(n)
 
 
# Pattern 3: In-place modification with careful iteration direction
# When we can use the same array, updating from the right direction
 
def coin_change_optimized(amount, coins):
    """
    Coin change with minimal space.
    dp array is reused; each position is updated at most once per outer loop.
    """
    dp = [float('inf')] * (amount + 1)
    dp[0] = 0
    
    for coin in coins:
        for a in range(coin, amount + 1):
            dp[a] = min(dp[a], dp[a - coin] + 1)
    
    return dp[amount] if dp[amount] != float('inf') else -1
 
 
# Pattern 4: 0/1 Knapsack with 1D array
# Iterate backwards to avoid using already-updated values
 
def knapsack_optimized(values, weights, capacity):
    """
    0/1 Knapsack with O(W) space instead of O(n × W).
    Trick: iterate capacity backwards so we don't use the 'new' values.
    """
    dp = [0] * (capacity + 1)
    
    for i in range(len(values)):
        # BACKWARDS! Using j = capacity down to weights[i]
        for w in range(capacity, weights[i] - 1, -1):
            dp[w] = max(dp[w], values[i] + dp[w - weights[i]])
    
    return dp[capacity]
# Space: O(n × W) → O(W)

Iteration Direction Matters

Space Optimization Summary
Original Space	Optimization	Result	When Applicable
O(n) — 1D array	Two variables	O(1)	Only last 1-2 values needed
O(m × n) — 2D grid	Rolling array (1 row)	O(n)	Each row depends only on previous row
O(n × W) — 2D table	1D array with careful iteration	O(W)	0/1 choice problems with backwards iteration
O(m × n) — 2 sequences	Single row + update in place	O(min(m,n))	LCS-style problems, diagonal dependencies

Implementation Best Practices

After implementing many DP solutions, certain best practices emerge that help avoid bugs and improve clarity:

1. Define state semantics precisely

Before writing code, write a comment stating exactly what dp[i] or dp[i][j] represents. Ambiguity here causes bugs.

# dp[i] = minimum cost to reach step i from step 0
# dp[i][j] = LCS length of s1[0..i-1] and s2[0..j-1]
# dp[i][w] = max value using items 0..i-1 with capacity w

2. Handle base cases explicitly

Don't rely on implicit initialization. Set base cases clearly:

dp[0] = 0  # Base case: cost to reach start is 0
dp[1] = cost[0]  # Base case: cost to reach step 1 is cost of first step

3. Use meaningful variable names

Instead of dp, consider minCost, maxProfit, waysToReach, etc. The name documents intent.

4. Validate indices before access

Off-by-one errors are the most common DP bugs. Check your loop bounds and array accesses.

if i - 1 >= 0:  # Make sure we don't access dp[-1]
    result = min(result, dp[i - 1] + cost)

dp_best_practices.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
# Example: Well-documented DP solution for House Robber
 
def rob_houses(money):
    """
    Maximum money that can be robbed without robbing adjacent houses.
    
    State definition:
        max_profit[i] = maximum money obtainable from houses 0..i
        
    Recurrence:
        max_profit[i] = max(
            max_profit[i-1],           # Skip house i
            money[i] + max_profit[i-2] # Rob house i
        )
        
    Base cases:
        max_profit[0] = money[0]              # Only one house
        max_profit[1] = max(money[0], money[1]) # Best of first two
    
    Time: O(n), Space: O(1) after optimization
    """
    # Edge cases
    if not money:
        return 0
    if len(money) == 1:
        return money[0]
    
    n = len(money)
    
    # Instead of full array, use two variables (space optimization)
    # prev2 = max_profit[i-2], prev1 = max_profit[i-1]
    prev2 = money[0]
    prev1 = max(money[0], money[1])
    
    for i in range(2, n):
        # Current = max(skip house i, rob house i)
        current = max(prev1, money[i] + prev2)
        prev2 = prev1
        prev1 = current
    
    return prev1
 
 
# Example: Proper memoization with explicit cache check
 
def min_edit_distance(s1, s2):
    """
    Minimum operations (insert, delete, replace) to transform s1 to s2.
    
    State: dp[i][j] = edit distance of s1[0..i-1] and s2[0..j-1]
    """
    # Use a dictionary for memoization with clear key structure
    memo = {}
    
    def dp(i, j):
        # State key
        key = (i, j)
        
        # Cache check
        if key in memo:
            return memo[key]
        
        # Base cases
        if i == 0:
            result = j  # Insert all of s2
        elif j == 0:
            result = i  # Delete all of s1
        elif s1[i - 1] == s2[j - 1]:
            result = dp(i - 1, j - 1)  # No operation needed
        else:
            result = 1 + min(
                dp(i - 1, j),      # Delete from s1
                dp(i, j - 1),      # Insert into s1
                dp(i - 1, j - 1)   # Replace
            )
        
        # Cache before return
        memo[key] = result
        return result
    
    return dp(len(s1), len(s2))

DP Implementation Checklist

•State definition — Wrote clear comment explaining what dp[...] represents
•Base cases — Identified and explicitly handled all minimal subproblems
•Recurrence — Verified the formula covers all transitions and is correct
•Bounds checking — Confirmed no out-of-bounds array access
•Initialization — Used appropriate default values (0, infinity, -infinity)
•Return value — Returning the correct cell that represents the original problem
•Edge cases — Tested with empty input, single element, small inputs

Common Pitfalls and How to Avoid Them

DP is notorious for subtle bugs. Here are the most common pitfalls and how to avoid them:

Pitfall 1: Off-by-one errors in indexing

Problem: Using dp[i] to represent 'first i elements' vs 'element at index i' inconsistently.

Solution: Choose one convention and stick to it. Using dp[i] = 'solution for first i elements' (1-indexed semantically) often reduces edge case complexity.

Pitfall 2: Wrong initialization

Problem: Initializing to 0 when you should use infinity (for minimization) or -infinity (for maximization).

Solution: For minimization, initialize achievable states to 0 and unreachable states to infinity. For maximization, the opposite.

Pitfall 3: Forgetting base cases

Problem: Recursion or loops try to access undefined base cases.

Solution: Always explicitly handle i=0, j=0, empty input, and boundary conditions.

Pitfall 4: Wrong iteration order (tabulation)

Problem: Filling table in an order that violates dependencies.

Solution: Draw the dependency DAG for a small case. Ensure your iteration visits dependencies before dependents.

Pitfall 5: Mutating memoization keys

Problem: Using mutable objects (lists) as dictionary keys in memoization.

Solution: Convert to immutable types (tuples) or use indices only as keys.

The Debugging Nightmare

Debugging strategy:

Print the DP table — Especially for 2D DP, printing the entire table reveals patterns and errors.
Trace by hand — Pick a small example and manually compute each cell. Compare with code output.
Add assertions — Assert bounds, assert that values make sense (non-negative, within expected range).
Test incrementally — Verify base cases before testing full recurrence. Test with n=1, n=2 before n=100.

Summary: Storing Solutions

We've explored the mechanism that makes Dynamic Programming practical: storing solutions to avoid recomputation. Let's consolidate the key insights:

Key Takeaways

•Redundant computation is the enemy — Naive recursion recomputes the same subproblems exponentially many times, making it infeasible.
•Memoization (top-down) — Add a cache to recursive solutions. Compute on demand, store for reuse. Natural and quick to implement.
•Tabulation (bottom-up) — Fill a table iteratively from base cases. No recursion overhead, enables space optimization.
•Both achieve the same time complexity — The difference is in constant factors, stack usage, and space optimization potential.
•Space optimization is powerful — Many DP problems can reduce from O(n²) space to O(n) or O(1) by exploiting limited dependencies.
•Best practices prevent bugs — Clear state definitions, explicit base cases, proper initialization, and thorough testing.

What's next:

Page Complete

3 / 4