Data Structures & AlgorithmsProblem-Solving Strategies

Optimization Techniques

LevelAdvanced

Duration90 mins

TopicProblem-Solving Strategies

5 / 5

Space-Time Trade-offs

The Fundamental Balance

At the heart of algorithmic design lies a fundamental trade-off: memory and time are interchangeable resources. You can often reduce computation time by using more memory (caching, precomputation, lookup tables), or reduce memory usage by doing more computation (recalculating values instead of storing them).

This isn't merely a technical detail—it's a philosophical stance that shapes every algorithmic decision. The engineer who understands space-time trade-offs can navigate constraints that would stymie others. Need to process a terabyte of data on a machine with 8GB of RAM? You'll trade time for space. Need sub-millisecond response times at any cost? You'll trade space for time.

This page makes the implicit explicit. You'll learn to recognize trade-off opportunities, quantify them precisely, and make informed decisions based on your specific constraints. Whether you're optimizing for a resource-constrained embedded system or a memory-rich cloud server, these principles will guide your choices.

What You Will Master

By the end of this page, you will be able to: • Quantify space-time trade-offs and reason about break-even points • Implement memoization to trade memory for recursive efficiency • Design caching strategies with appropriate eviction policies • Apply space-optimized DP techniques that sacrifice some speed for memory • Recognize when streaming/on-the-fly computation beats storage • Make informed decisions about which resource to optimize given constraints

The Trade-off Spectrum

Space-time trade-offs exist on a spectrum. At one extreme: compute everything on demand, store nothing. At the other: precompute everything, store all possible results. Most practical solutions fall somewhere in between.

The Spectrum Visualized:

← More Time, Less Space         More Space, Less Time →
|---------------------------------------------------|
Compute     Partial     Cache      Full         Lookup
on demand   cache       hot paths  precompute   tables

Examples Across the Spectrum:

Compute on Demand: Calculate n! each time by multiplying 1 × 2 × ... × n
Partial Cache: Store recently computed factorials, recompute others
Cache Hot Paths: Keep frequently requested factorials, evict rare ones
Full Precompute: Build factorial table for all n ≤ 1000 at startup
Lookup Tables: Extend to all possible inputs (if domain is bounded)

Trade-off Analysis for Factorial Computation
Strategy	Time per Query	Space	Best When
Compute on demand	O(n)	O(1)	Few queries, varied n, tight memory
Memoization	O(1) if cached, O(n) if not	O(queries)	Repeated queries with locality
Full precompute	O(1) always	O(max_n)	Many queries, bounded n, ample memory

The Decision Framework:

Choosing a position on the spectrum depends on:

Query Pattern: How often is the same value requested? Caching helps with repetition.
Memory Budget: How much memory can you afford? Constrained systems lean toward computation.
Latency Requirements: How fast must each query be? Strict latency pushes toward precomputation.
Startup Cost Tolerance: Can you spend time precomputing? Batch systems can; real-time systems cannot.
Input Domain Size: Is the set of possible inputs finite and small? Lookup tables become viable.

The Hybrid Approach

Real systems often use hybrid strategies: precompute common cases, compute rare cases on demand, and cache results for intermediate reuse. This captures most of the benefit of full precomputation while respecting memory limits. The art is choosing which cases to precompute based on access frequency and computation cost.

Trading Space for Time — Memoization

Memoization is the canonical technique for trading space for time. By storing results of expensive function calls and returning cached results for repeated inputs, we avoid redundant computation.

The Classic Example: Fibonacci

Without memoization, recursive Fibonacci has exponential time complexity due to redundant computation:

memoization_fibonacci.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
def fib_naive(n):
    """
    Naive recursive Fibonacci.
    
    Time: O(2^n) - exponential due to overlapping subproblems
    Space: O(n) - recursion stack depth
    
    fib(5) computes fib(3) twice, fib(2) three times, etc.
    """
    if n <= 1:
        return n
    return fib_naive(n - 1) + fib_naive(n - 2)
 
 
def fib_memoized(n, memo=None):
    """
    Memoized Fibonacci: Trade space for time.
    
    Time: O(n) - each value computed once
    Space: O(n) - memo dictionary + recursion stack
    
    The space cost of the memo dictionary eliminates
    exponential redundant computation.
    """
    if memo is None:
        memo = {}
    
    if n in memo:
        return memo[n]  # O(1) lookup instead of O(2^n) recomputation
    
    if n <= 1:
        return n
    
    result = fib_memoized(n - 1, memo) + fib_memoized(n - 2, memo)
    memo[n] = result  # Store for future use
    return result
 
 
# Using Python's built-in memoization
from functools import lru_cache
 
@lru_cache(maxsize=None)  # Unlimited cache
def fib_cached(n):
    """
    LRU-cached Fibonacci.
    
    Same O(n) time, O(n) space as manual memoization.
    Cleaner syntax, handles cache management automatically.
    """
    if n <= 1:
        return n
    return fib_cached(n - 1) + fib_cached(n - 2)
 
 
# Comparison:
# fib_naive(40):    ~30 seconds, 2^40 ≈ 10^12 operations
# fib_memoized(40): ~0.0001 seconds, 40 operations
# Space cost:       ~400 bytes for memo vs near-zero
 
# Trade-off: 400 bytes of memory eliminates billions of operations

When Memoization Shines:

Memoization is most effective when:

Overlapping Subproblems: Same inputs recur multiple times
Pure Functions: Same inputs always produce same outputs (no side effects)
Expensive Computation: The time saved justifies memory used
Bounded Unique Inputs: The number of distinct inputs is manageable

memoization_patterns.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
from functools import lru_cache
 
# Pattern 1: Memoization for combinatorics
@lru_cache(maxsize=None)
def count_paths(m, n):
    """
    Count paths in m×n grid from top-left to bottom-right.
    
    Without memoization: O(2^(m+n)) - exponential
    With memoization: O(m × n) - polynomial
    Space: O(m × n) for cache
    """
    if m == 1 or n == 1:
        return 1
    return count_paths(m - 1, n) + count_paths(m, n - 1)
 
 
# Pattern 2: Memoization with complex keys
def edit_distance(s1, s2):
    """
    Minimum edit distance between strings.
    
    Memoize on (i, j) positions in strings.
    """
    @lru_cache(maxsize=None)
    def dp(i, j):
        if i == 0:
            return j
        if j == 0:
            return i
        
        if s1[i-1] == s2[j-1]:
            return dp(i-1, j-1)
        
        return 1 + min(
            dp(i-1, j),    # Delete
            dp(i, j-1),    # Insert
            dp(i-1, j-1)   # Replace
        )
    
    return dp(len(s1), len(s2))
 
 
# Pattern 3: Bounded memoization for memory control
@lru_cache(maxsize=1000)  # Keep only 1000 most recent
def expensive_api_call(query):
    """
    Cache expensive calls with size limit.
    
    LRU eviction ensures bounded memory while
    still benefiting from temporal locality.
    """
    # Simulate expensive external call
    return external_api.fetch(query)
 
 
# Pattern 4: Memoization with unhashable types
def memoize_list_input():
    """
    For unhashable inputs (lists), convert to hashable form.
    """
    cache = {}
    
    def solve(items):  # items is a list
        key = tuple(items)  # Convert to hashable tuple
        if key in cache:
            return cache[key]
        
        # Expensive computation
        result = heavy_computation(items)
        cache[key] = result
        return result
    
    return solve

Memoization Pitfalls

Not all recursions benefit from memoization. If subproblems don't overlap, memoization just adds overhead. If inputs are unique or rarely repeat, the cache provides no benefit. Always verify that overlapping subproblems exist before adding memoization.

Trading Space for Time — Caching Strategies

While memoization caches function results, general caching stores any computed or fetched data for reuse. Caching is pervasive in systems: CPU caches, database query caches, CDNs, and application-level caches all trade memory for reduced latency.

Cache Design Decisions:

What to Cache: Expensive computations, external data fetches, transformed results
Cache Size: Limited by memory; must decide what to evict
Eviction Policy: When cache is full, which entry to remove?
Invalidation: When does cached data become stale?
Hit Rate: What fraction of requests find data in cache?

Common Cache Eviction Policies
Policy	Description	Best For	Complexity
LRU (Least Recently Used)	Evict entry not accessed longest	Temporal locality patterns	O(1) with hash map + list
LFU (Least Frequently Used)	Evict entry accessed fewest times	Frequency-based access patterns	O(log n) or O(1) with complexity
FIFO (First In First Out)	Evict oldest entry	Simple, predictable workloads	O(1) with queue
Random	Evict random entry	Uniform access, simplicity	O(1)
TTL (Time To Live)	Evict after fixed time	Data with known staleness	O(1) per entry

cache_implementation.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
from collections import OrderedDict
 
class LRUCache:
    """
    LRU Cache implementation: O(1) get and put.
    
    Trade-off quantified:
    - Space: O(capacity) for storing entries
    - Time: O(1) for all operations (vs O(computation) without cache)
    - Trade: Each entry uses ~100 bytes; saves potentially seconds of work
    """
    
    def __init__(self, capacity):
        self.capacity = capacity
        self.cache = OrderedDict()  # Maintains insertion order
    
    def get(self, key):
        """
        Get value if in cache, moving to end (most recent).
        Cache hit: O(1)
        Cache miss: Returns None (caller must compute and put)
        """
        if key not in self.cache:
            return None
        
        # Move to end (most recently used)
        self.cache.move_to_end(key)
        return self.cache[key]
    
    def put(self, key, value):
        """
        Insert or update entry, evicting LRU if at capacity.
        """
        if key in self.cache:
            self.cache.move_to_end(key)
        else:
            if len(self.cache) >= self.capacity:
                # Evict least recently used (first item)
                self.cache.popitem(last=False)
        
        self.cache[key] = value
 
 
class TTLCache:
    """
    Time-based cache: entries expire after TTL seconds.
    
    Useful when data becomes stale, e.g., API responses.
    """
    import time
    
    def __init__(self, ttl_seconds):
        self.ttl = ttl_seconds
        self.cache = {}  # {key: (value, expiry_time)}
    
    def get(self, key):
        if key not in self.cache:
            return None
        
        value, expiry = self.cache[key]
        
        if time.time() > expiry:
            # Entry expired
            del self.cache[key]
            return None
        
        return value
    
    def put(self, key, value):
        import time
        expiry = time.time() + self.ttl
        self.cache[key] = (value, expiry)
 
 
# Application-level caching pattern
def cached_api_call(api_client, cache, cache_key):
    """
    Pattern: Check cache → compute if miss → store result.
    """
    # Try cache first
    cached = cache.get(cache_key)
    if cached is not None:
        return cached  # Cache hit: O(1)
    
    # Cache miss: make expensive call
    result = api_client.expensive_call()  # O(network_latency)
    
    # Store for future use
    cache.put(cache_key, result)
    
    return result

Cache Hit Rate Analysis

A cache's value depends on its hit rate. With 90% hit rate, each request is fast 9 times out of 10. With 10% hit rate, the cache is mostly overhead. Analyze your access patterns: Do requests repeat? Is there temporal or spatial locality? Size your cache based on the working set—the set of entries accessed within a time window.

Trading Time for Space — Memory-Optimized DP

Dynamic programming often requires storing intermediate results. But if we only need recent rows/columns, we can reduce space by discarding older data—at the cost of preventing certain query patterns.

The Standard Trade-off in DP:

Full table: Access any subproblem result, allows backtracking for solution reconstruction
Space-optimized: Only keep what's needed for current computation, cannot reconstruct path

Fibonacci: Full Table vs. O(1) Space:

space_optimized_dp.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
# Full table approach: O(n) space
def fib_full_table(n):
    """
    Store all Fibonacci numbers up to n.
    
    Time: O(n)
    Space: O(n)
    Benefit: Can query any fib(k) for k <= n after computation
    """
    if n <= 1:
        return n
    
    dp = [0] * (n + 1)
    dp[1] = 1
    
    for i in range(2, n + 1):
        dp[i] = dp[i-1] + dp[i-2]
    
    return dp[n]
 
 
# Space-optimized: O(1) space
def fib_space_optimized(n):
    """
    Only keep last two values.
    
    Time: O(n) - same as full table
    Space: O(1) - constant, regardless of n
    Trade-off: Can only get fib(n), not intermediate values
    """
    if n <= 1:
        return n
    
    prev2, prev1 = 0, 1
    
    for i in range(2, n + 1):
        current = prev1 + prev2
        prev2, prev1 = prev1, current
    
    return prev1
 
 
# For n = 1,000,000:
# Full table: ~8 MB (1M integers × 8 bytes)
# Space-optimized: ~16 bytes (2 integers)
# Speed: Nearly identical (marginally faster optimized due to cache locality)

2D DP Space Optimization:

Many 2D DP problems only need the previous row to compute the current row:

2d_space_optimization.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
# Edit Distance: Full 2D table
def edit_distance_full(s1, s2):
    """
    Standard edit distance with full DP table.
    
    Time: O(m × n)
    Space: O(m × n)
    Benefit: Can reconstruct the edit sequence by backtracking
    """
    m, n = len(s1), len(s2)
    dp = [[0] * (n + 1) for _ in range(m + 1)]
    
    # Base cases
    for i in range(m + 1):
        dp[i][0] = i
    for j in range(n + 1):
        dp[0][j] = j
    
    # Fill table
    for i in range(1, m + 1):
        for j in range(1, n + 1):
            if s1[i-1] == s2[j-1]:
                dp[i][j] = dp[i-1][j-1]
            else:
                dp[i][j] = 1 + min(dp[i-1][j], dp[i][j-1], dp[i-1][j-1])
    
    return dp[m][n]
 
 
# Edit Distance: Two rows only
def edit_distance_optimized(s1, s2):
    """
    Space-optimized edit distance: only two rows.
    
    Time: O(m × n) - unchanged
    Space: O(n) - reduced from O(m × n)
    Trade-off: Cannot reconstruct the sequence of edits
    """
    m, n = len(s1), len(s2)
    
    # Only need previous row and current row
    prev_row = list(range(n + 1))
    curr_row = [0] * (n + 1)
    
    for i in range(1, m + 1):
        curr_row[0] = i
        
        for j in range(1, n + 1):
            if s1[i-1] == s2[j-1]:
                curr_row[j] = prev_row[j-1]
            else:
                curr_row[j] = 1 + min(
                    prev_row[j],      # Delete
                    curr_row[j-1],    # Insert
                    prev_row[j-1]     # Replace
                )
        
        prev_row, curr_row = curr_row, prev_row
    
    return prev_row[n]
 
 
# For s1, s2 with length 10,000 each:
# Full table: 100M cells × 4 bytes = 400 MB
# Optimized: 10K cells × 4 bytes × 2 = 80 KB
# Reduction factor: 5000x memory savings!

When to Keep the Full Table

Space optimization sacrifices the ability to reconstruct solutions. For edit distance, if you need the actual sequence of edits (not just the count), keep the full table. If you only need the final value, optimize aggressively. Sometimes you can use Hirschberg's algorithm to get both: O(n) space AND solution reconstruction through divide-and-conquer.

Streaming and On-the-Fly Computation

When data is too large to fit in memory, we must process it in a streaming fashion: reading elements one at a time, maintaining minimal state, and producing results without storing the entire input.

The Streaming Paradigm:

Instead of: Load all data → Process → Output

Streaming: Read one item → Update state → Repeat → Output from state

Streaming algorithms trade space for time by processing each element only once and maintaining only summary statistics.

streaming_algorithms.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
class StreamingStats:
    """
    Compute statistics in O(1) space as data streams in.
    
    Trade-off: Cannot compute median (would need sorted data)
              Cannot answer arbitrary queries about past data
              But can process unlimited data volume
    """
    
    def __init__(self):
        self.count = 0
        self.sum = 0
        self.sum_sq = 0
        self.min_val = float('inf')
        self.max_val = float('-inf')
    
    def update(self, value):
        """Process one value in O(1) time and space."""
        self.count += 1
        self.sum += value
        self.sum_sq += value * value
        self.min_val = min(self.min_val, value)
        self.max_val = max(self.max_val, value)
    
    def mean(self):
        return self.sum / self.count if self.count > 0 else 0
    
    def variance(self):
        if self.count < 2:
            return 0
        mean = self.mean()
        return (self.sum_sq / self.count) - (mean * mean)
    
    def range(self):
        return self.max_val - self.min_val
 
 
# Process a 100GB file with O(1) memory
def process_huge_file(filename):
    stats = StreamingStats()
    
    with open(filename) as f:
        for line in f:
            value = float(line.strip())
            stats.update(value)  # O(1) per line
    
    return stats.mean(), stats.variance()
 
 
class ReservoirSampling:
    """
    Maintain a random sample of k items from a stream.
    
    Problem: Select k random items from a stream of unknown length.
    Cannot store all items, but need uniform random sample.
    
    Space: O(k) regardless of stream length
    """
    
    def __init__(self, k):
        self.k = k
        self.reservoir = []
        self.count = 0
    
    def update(self, item):
        import random
        self.count += 1
        
        if self.count <= self.k:
            # Fill reservoir initially
            self.reservoir.append(item)
        else:
            # Replace with decreasing probability
            j = random.randint(1, self.count)
            if j <= self.k:
                self.reservoir[j - 1] = item
    
    def get_sample(self):
        return self.reservoir
 
 
# Sample 1000 items uniformly from a billion-item stream
def sample_from_huge_dataset(data_iterator, sample_size=1000):
    sampler = ReservoirSampling(sample_size)
    
    for item in data_iterator:
        sampler.update(item)  # O(1) per item
    
    return sampler.get_sample()  # Returns exactly sample_size items

Streaming Algorithm Capabilities

•Can compute: Count, sum, mean, variance, min/max, approximate quantiles
•Can approximate: Distinct count (HyperLogLog), heavy hitters (Count-Min Sketch)
•Can sample: Uniform random sample (reservoir sampling)
•Cannot compute exactly: Median (needs sorted data), arbitrary percentiles, any query about specific past elements
•Trade-off: Exactness for memory—some quantities can only be approximated in streaming model

Probabilistic Data Structures

When exact answers require too much space, probabilistic structures offer space-efficient approximations. Bloom filters test set membership with possible false positives. HyperLogLog estimates cardinality. Count-Min Sketch estimates frequency. These trade exactness for dramatic space savings—often reducing gigabytes to kilobytes with <1% error.

Precomputation vs. On-Demand Computation

One of the most common space-time trade-off decisions is whether to precompute results or compute them on demand. This is central to database indexing, search engines, and caching layers.

The Precomputation Decision:

Precompute when:

Query speed is critical
Queries are frequent
The set of possible queries is bounded
Compute cost >> Storage cost
Data changes infrequently

Compute on-demand when:

Storage is constrained
Queries are rare or unique
The query space is unbounded
Storage cost >> Compute cost
Data changes frequently

precompute_vs_ondemand.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
# Example: Shortest paths in a graph
 
# PRECOMPUTE: Floyd-Warshall all-pairs
def precompute_all_pairs(graph):
    """
    Compute all-pairs shortest paths upfront.
    
    Precompute: O(V³) time
    Space: O(V²) for distance matrix
    Query: O(1) for any pair
    
    Best when: Many queries, graph doesn't change often
    """
    n = len(graph)
    dist = [[float('inf')] * n for _ in range(n)]
    
    # Initialize from edges
    for u in range(n):
        dist[u][u] = 0
        for v, w in graph[u]:
            dist[u][v] = w
    
    # Floyd-Warshall
    for k in range(n):
        for i in range(n):
            for j in range(n):
                dist[i][j] = min(dist[i][j], dist[i][k] + dist[k][j])
    
    return dist
 
 
# ON-DEMAND: Dijkstra per query
def on_demand_shortest_path(graph, source, target):
    """
    Compute shortest path only when needed.
    
    Precompute: None
    Space: O(V) for single-source distances
    Query: O((V + E) log V) per query (Dijkstra)
    
    Best when: Few queries, graph changes often
    """
    import heapq
    
    dist = {source: 0}
    heap = [(0, source)]
    
    while heap:
        d, u = heapq.heappop(heap)
        
        if u == target:
            return d  # Early termination
        
        if d > dist.get(u, float('inf')):
            continue
        
        for v, w in graph[u]:
            new_dist = d + w
            if new_dist < dist.get(v, float('inf')):
                dist[v] = new_dist
                heapq.heappush(heap, (new_dist, v))
    
    return dist.get(target, float('inf'))
 
 
# HYBRID: Precompute from important nodes only
def precompute_landmark_distances(graph, landmarks):
    """
    Precompute distances from/to k landmark nodes.
    
    Precompute: O(k × (V + E) log V) - k Dijkstras
    Space: O(k × V) - much less than O(V²)
    Query: Use triangle inequality for bounds, Dijkstra for exact
    
    Trade-off: Approximate or selective precomputation
    """
    from_landmark = {}
    to_landmark = {}
    
    for landmark in landmarks:
        from_landmark[landmark] = dijkstra_all(graph, landmark)
        to_landmark[landmark] = dijkstra_all(reverse_graph(graph), landmark)
    
    return from_landmark, to_landmark
 
 
# Analysis for V = 10,000 nodes, E = 50,000 edges:
# 
# Full precompute:
#   Time: O(V³) = 10^12 operations ≈ hours
#   Space: O(V²) = 100M entries = 400 MB
#   Query: O(1)
#
# On-demand:
#   Time: None upfront
#   Space: O(V) per query = 40 KB
#   Query: O((V+E) log V) ≈ 1M operations ≈ 1ms
#
# For 100 queries: On-demand wins (100ms vs hours)
# For 10M queries: Precompute might win (assuming setup amortizes)

The Materialized View Pattern

In databases, this trade-off is called "materialized views": precomputed query results stored as tables. They speed up reads but must be updated when underlying data changes. The decision depends on read/write ratio: high reads favor materialization; high writes favor on-demand computation.

Quantifying Trade-offs

Effective decision-making requires quantifying trade-offs. Abstract comparisons ("uses more memory") aren't enough—you need numbers to make informed choices.

The Quantification Framework:

For any space-time trade-off, quantify:

Time Cost: How long does each approach take? (startup, per-query, total)
Space Cost: How much memory does each approach use? (peak, average)
Break-Even Point: At what query count does the space-heavy approach become worthwhile?
Constraints: What are the hard limits? (max memory, max latency)

Example Analysis: Hash Map vs. Sorted Array for Lookups

quantify_tradeoff.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
import sys
 
def analyze_tradeoff(n, num_queries):
    """
    Compare hash map vs sorted array for n items, Q queries.
    """
    
    # === APPROACH 1: HASH MAP ===
    # Space: ~64 bytes per entry (key + value + overhead)
    hash_space = n * 64
    
    # Time: O(n) to build, O(1) per query
    hash_build = n  # operations
    hash_query = 1  # operations per query
    hash_total = hash_build + num_queries * hash_query
    
    # === APPROACH 2: SORTED ARRAY ===
    # Space: ~16 bytes per entry (key + value, no overhead)
    array_space = n * 16
    
    # Time: O(n log n) to sort, O(log n) per query
    import math
    array_build = n * math.log2(max(n, 2))  # sort
    array_query = math.log2(max(n, 2))       # binary search per query
    array_total = array_build + num_queries * array_query
    
    print(f"For n={n:,} items, Q={num_queries:,} queries:")
    print(f"  Hash Map:     {hash_space/1e6:.1f} MB, {hash_total/1e6:.1f}M ops")
    print(f"  Sorted Array: {array_space/1e6:.1f} MB, {array_total/1e6:.1f}M ops")
    
    # Break-even analysis
    # hash_total < array_total
    # n + Q < n log n + Q log n
    # Q (1 - log n) < n log n - n
    # For large n, hash map is better when Q is moderate to large
    
    if hash_total < array_total:
        print("  Winner: Hash Map (faster)")
    else:
        print("  Winner: Sorted Array (less memory, acceptable speed)")
 
 
# Example runs:
# analyze_tradeoff(1_000_000, 100)
# For n=1,000,000 items, Q=100 queries:
#   Hash Map:     64.0 MB, 1.0M ops
#   Sorted Array: 16.0 MB, 20.0M ops (sort dominates)
#   Winner: Hash Map (faster)
 
# analyze_tradeoff(1_000_000, 1_000_000)
# For n=1,000,000 items, Q=1,000,000 queries:
#   Hash Map:     64.0 MB, 2.0M ops
#   Sorted Array: 16.0 MB, 40.0M ops
#   Winner: Hash Map (faster despite 4x more memory)
 
# analyze_tradeoff(1_000, 10)
# For n=1,000 items, Q=10 queries:
#   Hash Map:     0.1 MB, 0.001M ops
#   Sorted Array: 0.016 MB, 0.01M ops
#   Winner: Hash Map (both trivial, but hash is faster)

Decision Matrix for Common Trade-offs
Scenario	Favor Space	Favor Time
Memory-constrained systems	✓ Recompute, stream	Only if fits in budget
Latency-critical applications	Only if latency acceptable	✓ Precompute, cache
Infrequent queries	✓ Compute on demand	Overhead wasted
High query volume	Compute cost dominates	✓ Amortize precompute
Frequently changing data	✓ On-demand avoids invalidation	Cache invalidation complexity
Static data	Precompute once, use forever	✓ Full precomputation

Measure, Don't Guess

Theoretical analysis guides decisions, but real systems have cache effects, memory bandwidth limits, and other factors. Prototype both approaches and measure. A theoretically slower algorithm might be faster due to better cache utilization. Always validate with benchmarks on realistic data.

Summary: Mastering the Trade-off

Space-time trade-offs are fundamental to algorithmic design. The ability to consciously choose where on the spectrum to position your solution—and to change that position as requirements evolve—is a hallmark of engineering maturity.

Key Trade-off Techniques Mastered:

Chapter Summary

•The Trade-off Spectrum: From pure computation to full precomputation, most solutions fall in between. Position depends on query patterns, memory budget, and latency requirements.
•Memoization: Store function results to avoid recomputation. Transforms exponential recursions into polynomial when overlapping subproblems exist.
•Caching Strategies: LRU, LFU, FIFO, TTL—each eviction policy suits different access patterns. Hit rate determines cache value.
•Space-Optimized DP: Trade reconstruction ability for memory savings. Keep only necessary rows/columns when only the final answer matters.
•Streaming Computation: Process unlimited data with O(1) space by maintaining only summary state. Essential for big data pipelines.
•Precompute vs. On-Demand: Precompute for frequent queries on static data; compute on-demand for rare queries or changing data.
•Quantification: Calculate exact costs, find break-even points, and validate with benchmarks. Data-driven decisions beat intuition.

The Trade-off Mindset:

Every algorithm exists in a space-time context. When you see an algorithm using O(n²) space, ask: "Can we reduce this by recomputing?" When you see slow queries, ask: "Can we precompute or cache?" The trade-off is always available—the question is whether it's worthwhile given your constraints.

Module Complete:

You've now mastered the core optimization techniques: recognizing opportunities, preprocessing data, early termination, pruning search spaces, and navigating space-time trade-offs. These techniques form the foundation of efficient algorithm design and will serve you throughout your engineering career.

Module Complete

Congratulations! You've completed the Optimization Techniques module. You now possess a systematic toolkit for making algorithms faster and more efficient. From recognizing when optimization is needed, through preprocessing, early termination, and pruning, to the fundamental space-time trade-off—you have the knowledge to transform working solutions into optimal ones. Apply these techniques deliberately, measure their impact, and continue refining your optimization instincts through practice.

5 / 5

Loading learning content...

Data Structures & AlgorithmsProblem-Solving Strategies

Optimization Techniques

LevelAdvanced

Duration90 mins

TopicProblem-Solving Strategies

5 / 5

Space-Time Trade-offs

The Fundamental Balance

What You Will Master

The Trade-off Spectrum

The Spectrum Visualized:

← More Time, Less Space         More Space, Less Time →
|---------------------------------------------------|
Compute     Partial     Cache      Full         Lookup
on demand   cache       hot paths  precompute   tables

Examples Across the Spectrum:

Compute on Demand: Calculate n! each time by multiplying 1 × 2 × ... × n
Partial Cache: Store recently computed factorials, recompute others
Cache Hot Paths: Keep frequently requested factorials, evict rare ones
Full Precompute: Build factorial table for all n ≤ 1000 at startup
Lookup Tables: Extend to all possible inputs (if domain is bounded)

Trade-off Analysis for Factorial Computation
Strategy	Time per Query	Space	Best When
Compute on demand	O(n)	O(1)	Few queries, varied n, tight memory
Memoization	O(1) if cached, O(n) if not	O(queries)	Repeated queries with locality
Full precompute	O(1) always	O(max_n)	Many queries, bounded n, ample memory

The Decision Framework:

Choosing a position on the spectrum depends on:

Query Pattern: How often is the same value requested? Caching helps with repetition.
Memory Budget: How much memory can you afford? Constrained systems lean toward computation.
Latency Requirements: How fast must each query be? Strict latency pushes toward precomputation.
Startup Cost Tolerance: Can you spend time precomputing? Batch systems can; real-time systems cannot.
Input Domain Size: Is the set of possible inputs finite and small? Lookup tables become viable.

The Hybrid Approach

Trading Space for Time — Memoization

Memoization is the canonical technique for trading space for time. By storing results of expensive function calls and returning cached results for repeated inputs, we avoid redundant computation.

The Classic Example: Fibonacci

Without memoization, recursive Fibonacci has exponential time complexity due to redundant computation:

memoization_fibonacci.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
def fib_naive(n):
    """
    Naive recursive Fibonacci.
    
    Time: O(2^n) - exponential due to overlapping subproblems
    Space: O(n) - recursion stack depth
    
    fib(5) computes fib(3) twice, fib(2) three times, etc.
    """
    if n <= 1:
        return n
    return fib_naive(n - 1) + fib_naive(n - 2)
 
 
def fib_memoized(n, memo=None):
    """
    Memoized Fibonacci: Trade space for time.
    
    Time: O(n) - each value computed once
    Space: O(n) - memo dictionary + recursion stack
    
    The space cost of the memo dictionary eliminates
    exponential redundant computation.
    """
    if memo is None:
        memo = {}
    
    if n in memo:
        return memo[n]  # O(1) lookup instead of O(2^n) recomputation
    
    if n <= 1:
        return n
    
    result = fib_memoized(n - 1, memo) + fib_memoized(n - 2, memo)
    memo[n] = result  # Store for future use
    return result
 
 
# Using Python's built-in memoization
from functools import lru_cache
 
@lru_cache(maxsize=None)  # Unlimited cache
def fib_cached(n):
    """
    LRU-cached Fibonacci.
    
    Same O(n) time, O(n) space as manual memoization.
    Cleaner syntax, handles cache management automatically.
    """
    if n <= 1:
        return n
    return fib_cached(n - 1) + fib_cached(n - 2)
 
 
# Comparison:
# fib_naive(40):    ~30 seconds, 2^40 ≈ 10^12 operations
# fib_memoized(40): ~0.0001 seconds, 40 operations
# Space cost:       ~400 bytes for memo vs near-zero
 
# Trade-off: 400 bytes of memory eliminates billions of operations

When Memoization Shines:

Memoization is most effective when:

Overlapping Subproblems: Same inputs recur multiple times
Pure Functions: Same inputs always produce same outputs (no side effects)
Expensive Computation: The time saved justifies memory used
Bounded Unique Inputs: The number of distinct inputs is manageable

memoization_patterns.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
from functools import lru_cache
 
# Pattern 1: Memoization for combinatorics
@lru_cache(maxsize=None)
def count_paths(m, n):
    """
    Count paths in m×n grid from top-left to bottom-right.
    
    Without memoization: O(2^(m+n)) - exponential
    With memoization: O(m × n) - polynomial
    Space: O(m × n) for cache
    """
    if m == 1 or n == 1:
        return 1
    return count_paths(m - 1, n) + count_paths(m, n - 1)
 
 
# Pattern 2: Memoization with complex keys
def edit_distance(s1, s2):
    """
    Minimum edit distance between strings.
    
    Memoize on (i, j) positions in strings.
    """
    @lru_cache(maxsize=None)
    def dp(i, j):
        if i == 0:
            return j
        if j == 0:
            return i
        
        if s1[i-1] == s2[j-1]:
            return dp(i-1, j-1)
        
        return 1 + min(
            dp(i-1, j),    # Delete
            dp(i, j-1),    # Insert
            dp(i-1, j-1)   # Replace
        )
    
    return dp(len(s1), len(s2))
 
 
# Pattern 3: Bounded memoization for memory control
@lru_cache(maxsize=1000)  # Keep only 1000 most recent
def expensive_api_call(query):
    """
    Cache expensive calls with size limit.
    
    LRU eviction ensures bounded memory while
    still benefiting from temporal locality.
    """
    # Simulate expensive external call
    return external_api.fetch(query)
 
 
# Pattern 4: Memoization with unhashable types
def memoize_list_input():
    """
    For unhashable inputs (lists), convert to hashable form.
    """
    cache = {}
    
    def solve(items):  # items is a list
        key = tuple(items)  # Convert to hashable tuple
        if key in cache:
            return cache[key]
        
        # Expensive computation
        result = heavy_computation(items)
        cache[key] = result
        return result
    
    return solve

Memoization Pitfalls

Trading Space for Time — Caching Strategies

Cache Design Decisions:

What to Cache: Expensive computations, external data fetches, transformed results
Cache Size: Limited by memory; must decide what to evict
Eviction Policy: When cache is full, which entry to remove?
Invalidation: When does cached data become stale?
Hit Rate: What fraction of requests find data in cache?

Common Cache Eviction Policies
Policy	Description	Best For	Complexity
LRU (Least Recently Used)	Evict entry not accessed longest	Temporal locality patterns	O(1) with hash map + list
LFU (Least Frequently Used)	Evict entry accessed fewest times	Frequency-based access patterns	O(log n) or O(1) with complexity
FIFO (First In First Out)	Evict oldest entry	Simple, predictable workloads	O(1) with queue
Random	Evict random entry	Uniform access, simplicity	O(1)
TTL (Time To Live)	Evict after fixed time	Data with known staleness	O(1) per entry

cache_implementation.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
from collections import OrderedDict
 
class LRUCache:
    """
    LRU Cache implementation: O(1) get and put.
    
    Trade-off quantified:
    - Space: O(capacity) for storing entries
    - Time: O(1) for all operations (vs O(computation) without cache)
    - Trade: Each entry uses ~100 bytes; saves potentially seconds of work
    """
    
    def __init__(self, capacity):
        self.capacity = capacity
        self.cache = OrderedDict()  # Maintains insertion order
    
    def get(self, key):
        """
        Get value if in cache, moving to end (most recent).
        Cache hit: O(1)
        Cache miss: Returns None (caller must compute and put)
        """
        if key not in self.cache:
            return None
        
        # Move to end (most recently used)
        self.cache.move_to_end(key)
        return self.cache[key]
    
    def put(self, key, value):
        """
        Insert or update entry, evicting LRU if at capacity.
        """
        if key in self.cache:
            self.cache.move_to_end(key)
        else:
            if len(self.cache) >= self.capacity:
                # Evict least recently used (first item)
                self.cache.popitem(last=False)
        
        self.cache[key] = value
 
 
class TTLCache:
    """
    Time-based cache: entries expire after TTL seconds.
    
    Useful when data becomes stale, e.g., API responses.
    """
    import time
    
    def __init__(self, ttl_seconds):
        self.ttl = ttl_seconds
        self.cache = {}  # {key: (value, expiry_time)}
    
    def get(self, key):
        if key not in self.cache:
            return None
        
        value, expiry = self.cache[key]
        
        if time.time() > expiry:
            # Entry expired
            del self.cache[key]
            return None
        
        return value
    
    def put(self, key, value):
        import time
        expiry = time.time() + self.ttl
        self.cache[key] = (value, expiry)
 
 
# Application-level caching pattern
def cached_api_call(api_client, cache, cache_key):
    """
    Pattern: Check cache → compute if miss → store result.
    """
    # Try cache first
    cached = cache.get(cache_key)
    if cached is not None:
        return cached  # Cache hit: O(1)
    
    # Cache miss: make expensive call
    result = api_client.expensive_call()  # O(network_latency)
    
    # Store for future use
    cache.put(cache_key, result)
    
    return result

Cache Hit Rate Analysis

Trading Time for Space — Memory-Optimized DP

The Standard Trade-off in DP:

Full table: Access any subproblem result, allows backtracking for solution reconstruction
Space-optimized: Only keep what's needed for current computation, cannot reconstruct path

Fibonacci: Full Table vs. O(1) Space:

space_optimized_dp.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
# Full table approach: O(n) space
def fib_full_table(n):
    """
    Store all Fibonacci numbers up to n.
    
    Time: O(n)
    Space: O(n)
    Benefit: Can query any fib(k) for k <= n after computation
    """
    if n <= 1:
        return n
    
    dp = [0] * (n + 1)
    dp[1] = 1
    
    for i in range(2, n + 1):
        dp[i] = dp[i-1] + dp[i-2]
    
    return dp[n]
 
 
# Space-optimized: O(1) space
def fib_space_optimized(n):
    """
    Only keep last two values.
    
    Time: O(n) - same as full table
    Space: O(1) - constant, regardless of n
    Trade-off: Can only get fib(n), not intermediate values
    """
    if n <= 1:
        return n
    
    prev2, prev1 = 0, 1
    
    for i in range(2, n + 1):
        current = prev1 + prev2
        prev2, prev1 = prev1, current
    
    return prev1
 
 
# For n = 1,000,000:
# Full table: ~8 MB (1M integers × 8 bytes)
# Space-optimized: ~16 bytes (2 integers)
# Speed: Nearly identical (marginally faster optimized due to cache locality)

2D DP Space Optimization:

Many 2D DP problems only need the previous row to compute the current row:

2d_space_optimization.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
# Edit Distance: Full 2D table
def edit_distance_full(s1, s2):
    """
    Standard edit distance with full DP table.
    
    Time: O(m × n)
    Space: O(m × n)
    Benefit: Can reconstruct the edit sequence by backtracking
    """
    m, n = len(s1), len(s2)
    dp = [[0] * (n + 1) for _ in range(m + 1)]
    
    # Base cases
    for i in range(m + 1):
        dp[i][0] = i
    for j in range(n + 1):
        dp[0][j] = j
    
    # Fill table
    for i in range(1, m + 1):
        for j in range(1, n + 1):
            if s1[i-1] == s2[j-1]:
                dp[i][j] = dp[i-1][j-1]
            else:
                dp[i][j] = 1 + min(dp[i-1][j], dp[i][j-1], dp[i-1][j-1])
    
    return dp[m][n]
 
 
# Edit Distance: Two rows only
def edit_distance_optimized(s1, s2):
    """
    Space-optimized edit distance: only two rows.
    
    Time: O(m × n) - unchanged
    Space: O(n) - reduced from O(m × n)
    Trade-off: Cannot reconstruct the sequence of edits
    """
    m, n = len(s1), len(s2)
    
    # Only need previous row and current row
    prev_row = list(range(n + 1))
    curr_row = [0] * (n + 1)
    
    for i in range(1, m + 1):
        curr_row[0] = i
        
        for j in range(1, n + 1):
            if s1[i-1] == s2[j-1]:
                curr_row[j] = prev_row[j-1]
            else:
                curr_row[j] = 1 + min(
                    prev_row[j],      # Delete
                    curr_row[j-1],    # Insert
                    prev_row[j-1]     # Replace
                )
        
        prev_row, curr_row = curr_row, prev_row
    
    return prev_row[n]
 
 
# For s1, s2 with length 10,000 each:
# Full table: 100M cells × 4 bytes = 400 MB
# Optimized: 10K cells × 4 bytes × 2 = 80 KB
# Reduction factor: 5000x memory savings!

When to Keep the Full Table

Streaming and On-the-Fly Computation

The Streaming Paradigm:

Instead of: Load all data → Process → Output

Streaming: Read one item → Update state → Repeat → Output from state

Streaming algorithms trade space for time by processing each element only once and maintaining only summary statistics.

streaming_algorithms.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
class StreamingStats:
    """
    Compute statistics in O(1) space as data streams in.
    
    Trade-off: Cannot compute median (would need sorted data)
              Cannot answer arbitrary queries about past data
              But can process unlimited data volume
    """
    
    def __init__(self):
        self.count = 0
        self.sum = 0
        self.sum_sq = 0
        self.min_val = float('inf')
        self.max_val = float('-inf')
    
    def update(self, value):
        """Process one value in O(1) time and space."""
        self.count += 1
        self.sum += value
        self.sum_sq += value * value
        self.min_val = min(self.min_val, value)
        self.max_val = max(self.max_val, value)
    
    def mean(self):
        return self.sum / self.count if self.count > 0 else 0
    
    def variance(self):
        if self.count < 2:
            return 0
        mean = self.mean()
        return (self.sum_sq / self.count) - (mean * mean)
    
    def range(self):
        return self.max_val - self.min_val
 
 
# Process a 100GB file with O(1) memory
def process_huge_file(filename):
    stats = StreamingStats()
    
    with open(filename) as f:
        for line in f:
            value = float(line.strip())
            stats.update(value)  # O(1) per line
    
    return stats.mean(), stats.variance()
 
 
class ReservoirSampling:
    """
    Maintain a random sample of k items from a stream.
    
    Problem: Select k random items from a stream of unknown length.
    Cannot store all items, but need uniform random sample.
    
    Space: O(k) regardless of stream length
    """
    
    def __init__(self, k):
        self.k = k
        self.reservoir = []
        self.count = 0
    
    def update(self, item):
        import random
        self.count += 1
        
        if self.count <= self.k:
            # Fill reservoir initially
            self.reservoir.append(item)
        else:
            # Replace with decreasing probability
            j = random.randint(1, self.count)
            if j <= self.k:
                self.reservoir[j - 1] = item
    
    def get_sample(self):
        return self.reservoir
 
 
# Sample 1000 items uniformly from a billion-item stream
def sample_from_huge_dataset(data_iterator, sample_size=1000):
    sampler = ReservoirSampling(sample_size)
    
    for item in data_iterator:
        sampler.update(item)  # O(1) per item
    
    return sampler.get_sample()  # Returns exactly sample_size items

Streaming Algorithm Capabilities

•Can compute: Count, sum, mean, variance, min/max, approximate quantiles
•Can approximate: Distinct count (HyperLogLog), heavy hitters (Count-Min Sketch)
•Can sample: Uniform random sample (reservoir sampling)
•Cannot compute exactly: Median (needs sorted data), arbitrary percentiles, any query about specific past elements
•Trade-off: Exactness for memory—some quantities can only be approximated in streaming model

Probabilistic Data Structures

Precomputation vs. On-Demand Computation

One of the most common space-time trade-off decisions is whether to precompute results or compute them on demand. This is central to database indexing, search engines, and caching layers.

The Precomputation Decision:

Precompute when:

Query speed is critical
Queries are frequent
The set of possible queries is bounded
Compute cost >> Storage cost
Data changes infrequently

Compute on-demand when:

Storage is constrained
Queries are rare or unique
The query space is unbounded
Storage cost >> Compute cost
Data changes frequently

precompute_vs_ondemand.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
# Example: Shortest paths in a graph
 
# PRECOMPUTE: Floyd-Warshall all-pairs
def precompute_all_pairs(graph):
    """
    Compute all-pairs shortest paths upfront.
    
    Precompute: O(V³) time
    Space: O(V²) for distance matrix
    Query: O(1) for any pair
    
    Best when: Many queries, graph doesn't change often
    """
    n = len(graph)
    dist = [[float('inf')] * n for _ in range(n)]
    
    # Initialize from edges
    for u in range(n):
        dist[u][u] = 0
        for v, w in graph[u]:
            dist[u][v] = w
    
    # Floyd-Warshall
    for k in range(n):
        for i in range(n):
            for j in range(n):
                dist[i][j] = min(dist[i][j], dist[i][k] + dist[k][j])
    
    return dist
 
 
# ON-DEMAND: Dijkstra per query
def on_demand_shortest_path(graph, source, target):
    """
    Compute shortest path only when needed.
    
    Precompute: None
    Space: O(V) for single-source distances
    Query: O((V + E) log V) per query (Dijkstra)
    
    Best when: Few queries, graph changes often
    """
    import heapq
    
    dist = {source: 0}
    heap = [(0, source)]
    
    while heap:
        d, u = heapq.heappop(heap)
        
        if u == target:
            return d  # Early termination
        
        if d > dist.get(u, float('inf')):
            continue
        
        for v, w in graph[u]:
            new_dist = d + w
            if new_dist < dist.get(v, float('inf')):
                dist[v] = new_dist
                heapq.heappush(heap, (new_dist, v))
    
    return dist.get(target, float('inf'))
 
 
# HYBRID: Precompute from important nodes only
def precompute_landmark_distances(graph, landmarks):
    """
    Precompute distances from/to k landmark nodes.
    
    Precompute: O(k × (V + E) log V) - k Dijkstras
    Space: O(k × V) - much less than O(V²)
    Query: Use triangle inequality for bounds, Dijkstra for exact
    
    Trade-off: Approximate or selective precomputation
    """
    from_landmark = {}
    to_landmark = {}
    
    for landmark in landmarks:
        from_landmark[landmark] = dijkstra_all(graph, landmark)
        to_landmark[landmark] = dijkstra_all(reverse_graph(graph), landmark)
    
    return from_landmark, to_landmark
 
 
# Analysis for V = 10,000 nodes, E = 50,000 edges:
# 
# Full precompute:
#   Time: O(V³) = 10^12 operations ≈ hours
#   Space: O(V²) = 100M entries = 400 MB
#   Query: O(1)
#
# On-demand:
#   Time: None upfront
#   Space: O(V) per query = 40 KB
#   Query: O((V+E) log V) ≈ 1M operations ≈ 1ms
#
# For 100 queries: On-demand wins (100ms vs hours)
# For 10M queries: Precompute might win (assuming setup amortizes)

The Materialized View Pattern

Quantifying Trade-offs

Effective decision-making requires quantifying trade-offs. Abstract comparisons ("uses more memory") aren't enough—you need numbers to make informed choices.

The Quantification Framework:

For any space-time trade-off, quantify:

Time Cost: How long does each approach take? (startup, per-query, total)
Space Cost: How much memory does each approach use? (peak, average)
Break-Even Point: At what query count does the space-heavy approach become worthwhile?
Constraints: What are the hard limits? (max memory, max latency)

Example Analysis: Hash Map vs. Sorted Array for Lookups

quantify_tradeoff.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
import sys
 
def analyze_tradeoff(n, num_queries):
    """
    Compare hash map vs sorted array for n items, Q queries.
    """
    
    # === APPROACH 1: HASH MAP ===
    # Space: ~64 bytes per entry (key + value + overhead)
    hash_space = n * 64
    
    # Time: O(n) to build, O(1) per query
    hash_build = n  # operations
    hash_query = 1  # operations per query
    hash_total = hash_build + num_queries * hash_query
    
    # === APPROACH 2: SORTED ARRAY ===
    # Space: ~16 bytes per entry (key + value, no overhead)
    array_space = n * 16
    
    # Time: O(n log n) to sort, O(log n) per query
    import math
    array_build = n * math.log2(max(n, 2))  # sort
    array_query = math.log2(max(n, 2))       # binary search per query
    array_total = array_build + num_queries * array_query
    
    print(f"For n={n:,} items, Q={num_queries:,} queries:")
    print(f"  Hash Map:     {hash_space/1e6:.1f} MB, {hash_total/1e6:.1f}M ops")
    print(f"  Sorted Array: {array_space/1e6:.1f} MB, {array_total/1e6:.1f}M ops")
    
    # Break-even analysis
    # hash_total < array_total
    # n + Q < n log n + Q log n
    # Q (1 - log n) < n log n - n
    # For large n, hash map is better when Q is moderate to large
    
    if hash_total < array_total:
        print("  Winner: Hash Map (faster)")
    else:
        print("  Winner: Sorted Array (less memory, acceptable speed)")
 
 
# Example runs:
# analyze_tradeoff(1_000_000, 100)
# For n=1,000,000 items, Q=100 queries:
#   Hash Map:     64.0 MB, 1.0M ops
#   Sorted Array: 16.0 MB, 20.0M ops (sort dominates)
#   Winner: Hash Map (faster)
 
# analyze_tradeoff(1_000_000, 1_000_000)
# For n=1,000,000 items, Q=1,000,000 queries:
#   Hash Map:     64.0 MB, 2.0M ops
#   Sorted Array: 16.0 MB, 40.0M ops
#   Winner: Hash Map (faster despite 4x more memory)
 
# analyze_tradeoff(1_000, 10)
# For n=1,000 items, Q=10 queries:
#   Hash Map:     0.1 MB, 0.001M ops
#   Sorted Array: 0.016 MB, 0.01M ops
#   Winner: Hash Map (both trivial, but hash is faster)

Decision Matrix for Common Trade-offs
Scenario	Favor Space	Favor Time
Memory-constrained systems	✓ Recompute, stream	Only if fits in budget
Latency-critical applications	Only if latency acceptable	✓ Precompute, cache
Infrequent queries	✓ Compute on demand	Overhead wasted
High query volume	Compute cost dominates	✓ Amortize precompute
Frequently changing data	✓ On-demand avoids invalidation	Cache invalidation complexity
Static data	Precompute once, use forever	✓ Full precomputation

Measure, Don't Guess

Summary: Mastering the Trade-off

Key Trade-off Techniques Mastered:

Chapter Summary

•The Trade-off Spectrum: From pure computation to full precomputation, most solutions fall in between. Position depends on query patterns, memory budget, and latency requirements.
•Memoization: Store function results to avoid recomputation. Transforms exponential recursions into polynomial when overlapping subproblems exist.
•Caching Strategies: LRU, LFU, FIFO, TTL—each eviction policy suits different access patterns. Hit rate determines cache value.
•Space-Optimized DP: Trade reconstruction ability for memory savings. Keep only necessary rows/columns when only the final answer matters.
•Streaming Computation: Process unlimited data with O(1) space by maintaining only summary state. Essential for big data pipelines.
•Precompute vs. On-Demand: Precompute for frequent queries on static data; compute on-demand for rare queries or changing data.
•Quantification: Calculate exact costs, find break-even points, and validate with benchmarks. Data-driven decisions beat intuition.

The Trade-off Mindset:

Module Complete:

Module Complete

5 / 5