Case Analysis & Cost Models - Learning Module

Loading content...

0/276

Best-Case, Average-Case, and Worst-Case Analysis

The Incomplete Picture

When someone asks "How fast is this algorithm?" the honest answer is: "It depends."

Consider linear search—scanning an array for a target value. If the target is the first element, we find it instantly. If it's the last element, we scan the entire array. If it doesn't exist, we scan everything and find nothing. Same algorithm, same code, vastly different performance.

Saying "linear search is O(n)" captures something important, but it's incomplete. O(n) describes the worst case—what if we have to check every element? But real inputs aren't always worst-case. Sometimes we get lucky.

Case analysis provides the complete picture. Instead of a single complexity number, we characterize how an algorithm behaves across the spectrum of possible inputs: the best luck can bring, the worst that can happen, and what we typically expect.

What You Will Learn

By the end of this page, you will deeply understand best-case, average-case, and worst-case analysis. You'll know how to identify each for any algorithm, understand what drives the differences, and learn to communicate algorithm performance with precision rather than oversimplification.

The Three Cases Framework

For any algorithm, performance can vary based on the specific input. Case analysis systematically captures this variation by examining three scenarios:

Best Case (Ω notation):

The minimum resources (time or space) an algorithm requires across all possible inputs of size n.

Best case represents the most favorable input—the scenario where everything aligns perfectly for the algorithm. It answers: "What's the absolute best performance we could hope for?"

Worst Case (O notation):

The maximum resources an algorithm requires across all possible inputs of size n.

Worst case represents the most adversarial input—the scenario that makes the algorithm work hardest. It answers: "What's the absolute worst performance we must prepare for?"

Average Case (Θ notation, typically):

The expected resources an algorithm requires across a probability distribution of inputs of size n.

Average case represents typical performance—what we expect on "normal" inputs. It answers: "What performance should we expect in practice?"

The Three Cases Compared
Case	Question It Answers	Input Type	Notation Often Used
Best Case	What's the minimum possible work?	Most favorable input	Ω (Omega)
Worst Case	What's the maximum possible work?	Most adversarial input	O (Big-O)
Average Case	What work do we expect typically?	Random/typical input	Θ (Theta) for expected value

Why three cases matter:

Different contexts demand different analyses:

Real-time systems (aircraft control, medical devices): Must guarantee worst-case performance—lives depend on it
Interactive applications (web servers, UI): Average case matters most—users experience typical behavior
Security-critical systems (cryptography, authentication): Worst case prevents attacks that exploit slow paths
Optimization: Best case reveals fundamental limits—you can't do better than best case

No single case tells the whole story. Expert analysis considers all three.

Case vs. Bound

Don't confuse cases with bounds. O, Ω, and Θ are notations for describing bounds—upper, lower, and tight. Each case (best, worst, average) can have its own set of bounds. For example, quicksort's worst case is Θ(n²)—that's a tight bound on worst-case behavior.

Best-Case Analysis: The Lower Limit

What best case captures:

Best case describes the minimum resources an algorithm uses when given the most favorable possible input. It establishes a floor—performance cannot be better than best case.

How to identify best-case inputs:

For any algorithm, ask: "What input would let this finish as fast as possible?"

Linear search: Target is the first element → Θ(1)
Binary search: Target is the middle element → Θ(1)
Bubble sort: Array is already sorted (with early termination) → Θ(n)
Insertion sort: Array is already sorted → Θ(n)
Quicksort: Perfect partitions every time → Θ(n log n)

Detailed example: Linear search best case

def linear_search(arr, target):
    for i in range(len(arr)):
        if arr[i] == target:
            return i    # Found it!
    return -1           # Not found

Best case scenario: target == arr[0]

We check the first element
It matches the target
We return immediately
Total comparisons: 1
Best-case complexity: Θ(1)

No matter how large the array, if the target is first, we finish in constant time.

Why best case matters:

Establishes fundamental limits: If best case is O(n), then average and worst must be at least O(n). Best case bounds everything above it.
Optimistic planning: In some systems, best-case behavior is common enough to optimize for.
Algorithm comparison: If Algorithm A's best case is O(n) but Algorithm B's best case is O(1), B might be preferred when favorable inputs are common.

Best Case ≠ Guaranteed

Best case is about what could happen with fortunate inputs, not what will happen. Designing systems around best-case assumptions is dangerous. Just because linear search can complete in O(1) doesn't mean it will. Your target might be last, or not present at all.

When best case analysis is useful:

Lower bound proofs: Proving that no algorithm for a problem can do better than Ω(n log n) uses best-case reasoning
Optimistic caching: If best case is O(1) and happens often, caching strategies can exploit this
Early termination detection: Algorithms like optimized bubble sort can detect sorted arrays in O(n)
Interactive systems: Best-case responsiveness affects perceived performance

When best case analysis is misleading:

When best case is rare: Linear search's O(1) best case almost never happens randomly
When worst case dominates perception: Users remember the one slow query, not the 99 fast ones
When adversarial inputs are possible: Security contexts can't rely on favorable inputs

Worst-Case Analysis: The Upper Limit

What worst case captures:

Worst case describes the maximum resources an algorithm uses when given the most adversarial possible input. It establishes a ceiling—performance cannot be worse than worst case.

Why worst case dominates analysis:

Worst case is the default in computer science for good reasons:

Guarantees: Worst case provides a guarantee. "This algorithm never takes more than O(n²) time" is actionable.
Safety: Systems must handle worst-case loads. Servers sized for average load crash during peaks.
Adversarial thinking: In security, attackers deliberately trigger worst-case behavior. Denial-of-service attacks often exploit slow algorithmic paths.
Simplicity: Worst case is often easier to analyze than average case (no probability distributions needed).

How to identify worst-case inputs:

Ask: "What input would make this algorithm work as hard as possible?"

Worst-Case Inputs for Common Algorithms
Algorithm	Worst-Case Input	Why It's Worst	Worst-Case Complexity
Linear search	Target not in array	Must check every element	Θ(n)
Binary search	Target not in array	Must halve until empty	Θ(log n)
Bubble sort	Array in reverse order	Maximum swaps needed	Θ(n²)
Quicksort (bad pivot)	Already sorted array	Partitions are 0:n-1 splits	Θ(n²)
Insertion sort	Array in reverse order	Each insert goes to position 0	Θ(n²)
Hash table lookup (chaining)	All keys hash to same bucket	Degenerates to linked list	Θ(n)

Detailed example: Quicksort worst case

Quicksort partitions an array around a pivot, recursively sorting partitions:

def quicksort(arr):
    if len(arr) <= 1:
        return arr
    pivot = arr[0]  # First element as pivot
    left = [x for x in arr[1:] if x < pivot]
    right = [x for x in arr[1:] if x >= pivot]
    return quicksort(left) + [pivot] + quicksort(right)

Best case: Pivot consistently divides array in half → T(n) = 2T(n/2) + O(n) = O(n log n)

Worst case: Array is already sorted (or reverse sorted)

pivot = arr[0] is the smallest (or largest) element
left = [] (0 elements), right = n-1 elements
Partition is 0:(n-1), not n/2:n/2
Recurrence: T(n) = T(n-1) + O(n) = O(n²)

The same algorithm goes from O(n log n) to O(n²) depending on input—a dramatic difference!

Mitigating Worst Case

Knowing worst-case inputs enables mitigation. Quicksort's O(n²) worst case on sorted input can be avoided by: (1) Random pivot selection, (2) Median-of-three pivot selection, (3) Introspective sort (switch to heapsort when recursion depth suggests worst case). Analysis drives engineering.

When worst case matters most:

Real-time systems: Aircraft control software must respond within a deadline regardless of input. Worst-case guarantees are mandatory.
SLA-bound services: "99.9th percentile latency < 100ms" is effectively worst-case thinking at the tail.
Adversarial environments: Attackers craft inputs to trigger worst-case behavior. Hash table denial-of-service attacks exploit predictable hash collisions.
Resource planning: How much memory to allocate? How many servers to provision? Worst case answers these.

The worst-case mindset:

Engineers often think: "But worst case is unlikely!" True—but unlikely events happen. With millions of operations, even 0.01% probability means thousands of worst-case instances. Design for likelihood; prepare for worst case.

Average-Case Analysis: Expected Behavior

What average case captures:

Average case describes the expected resources an algorithm uses over a probability distribution of inputs. Unlike best and worst cases (which are specific inputs), average case is a weighted average across all possible inputs.

The mathematical formulation:

Average Time = Σ (Probability of input I × Time to process input I)

For all possible inputs I of size n.

The critical assumption:

Average-case analysis requires assuming a probability distribution over inputs. Different assumptions yield different results:

Uniform distribution: Each input equally likely (common assumption)
Real-world distribution: Inputs follow actual usage patterns
Adversarial distribution: Attacker chooses inputs to maximize cost

The result depends entirely on what distribution you assume.

Detailed example: Linear search average case

Assume:

Array has n distinct elements
Target is equally likely to be any element or not present
Probability target is element i: 1/(n+1) for each position plus not present

Simplified analysis (target is in array, position uniformly random):

If target at position 0: 1 comparison
If target at position 1: 2 comparisons
...
If target at position n-1: n comparisons

Average = (1 + 2 + 3 + ... + n) / n = n(n+1)/2 / n = (n+1)/2 ≈ n/2

On average, we search half the array. Average case is Θ(n)—same as worst case asymptotically, but with half the constant factor.

Including the "not found" case:

If target has 50% chance of being in array:

50% × (n/2) + 50% × n = n/2/2 + n/2 = 3n/4

Still Θ(n), but the constant changes with assumptions.

Average Case vs Worst Case for Common Algorithms
Algorithm	Average Case	Worst Case	Key Insight
Linear search	Θ(n)	Θ(n)	Same asymptotically; average scans half
Binary search	Θ(log n)	Θ(log n)	Same; binary search is consistent
Quicksort	Θ(n log n)	Θ(n²)	Dramatically different; average >> worst
Insertion sort	Θ(n²)	Θ(n²)	Same; consistently quadratic
Hash table lookup	Θ(1)	Θ(n)	Dramatically different with good hash function

Quicksort's Remarkable Average Case

Quicksort averages O(n log n) even though worst case is O(n²). Why? Worst case requires consistently bad pivots. With random input (or random pivot selection), the probability of consistently bad pivots is astronomically low. The expected partition quality is good enough that we get O(n log n) on average.

Challenges with average-case analysis:

Distribution assumptions may be wrong: Real inputs often aren't uniformly random. If most searches target recently added items, linear search from the end might be better.
Mathematical complexity: Average-case analysis requires probability theory. For complex algorithms, the analysis can be extremely difficult.
Misleading expectations: Average hides variance. An algorithm with average Θ(n) might take Θ(n²) 10% of the time, causing occasional severe slowdowns.
Doesn't guarantee anything: Average is not a bound. Individual operations can vastly exceed the average.

When average case matters most:

Throughput-oriented systems: If you process millions of queries, total time matters more than any single query's time. Average predicts total.
Non-adversarial settings: When inputs genuinely come from a known distribution, average case is realistic.
Amortized contexts: When occasional expensive operations are acceptable, average-like thinking applies.

Comparing Cases: The Complete Picture

Let's see how the three cases combine to give a complete picture of algorithm performance.

Visualization: Performance spectrum

Imagine plotting time (y-axis) against different inputs (x-axis, sorted by time taken):

Time
  |
  |           __________________ Worst case ceiling
  |          /
  |         /
  |        /
  |   ~~~~~/~~~~~~~~~~~~~~~~~~~~ Average case (expected)
  |   /
  |  /
  | /
  |/___________________________ Best case floor
  +---------------------------------> Inputs (sorted by time)

Best case is the floor—no input does better. Worst case is the ceiling—no input does worse. Average case is somewhere in between, weighted by input probability.

Case study: Complete analysis of insertion sort

def insertion_sort(arr):
    for i in range(1, len(arr)):
        key = arr[i]
        j = i - 1
        while j >= 0 and arr[j] > key:
            arr[j + 1] = arr[j]
            j -= 1
        arr[j + 1] = key

Best case: Array is already sorted.

Each element is already in position
Inner while loop never executes (arr[j] ≤ key immediately)
Outer loop runs n-1 times, each doing O(1) work
Best case: Θ(n)

Worst case: Array is in reverse order.

Each element must move to position 0
For element i, inner loop runs i times
Total: 1 + 2 + 3 + ... + (n-1) = n(n-1)/2
Worst case: Θ(n²)

Average case: Random input (each permutation equally likely).

On average, element i moves halfway to position 0
Total: 0.5 × (1 + 2 + ... + (n-1)) = n(n-1)/4
Average case: Θ(n²)

Complete characterization:

Insertion sort is O(n²) worst case
Insertion sort is Ω(n) best case
Insertion sort is Θ(n²) average case
Insertion sort is highly efficient on nearly-sorted data

Complete Case Analysis of Common Algorithms
Algorithm	Best Case	Average Case	Worst Case	Notable Pattern
Linear Search	Θ(1)	Θ(n)	Θ(n)	Best case dramatically better
Binary Search	Θ(1)	Θ(log n)	Θ(log n)	Consistent performance
Insertion Sort	Θ(n)	Θ(n²)	Θ(n²)	Excellent on nearly-sorted
Merge Sort	Θ(n log n)	Θ(n log n)	Θ(n log n)	Always consistent
Quicksort	Θ(n log n)	Θ(n log n)	Θ(n²)	Worst case rare in practice
Heap Sort	Θ(n log n)	Θ(n log n)	Θ(n log n)	Consistent but slower constant
Hash Table Insert	Θ(1)	Θ(1)	Θ(n)	Average is what matters

Reading Algorithm Descriptions

When you see 'Quicksort is O(n log n),' understand this usually means average case. When you see 'Merge sort is O(n log n),' this holds for all cases. When someone says 'Hash tables are O(1),' ask: 'In what case?' Training yourself to ask 'under what conditions?' makes you a more precise analyst.

What Drives Case Differences

Not all algorithms have different best, average, and worst cases. Understanding what creates variance helps you predict when case analysis matters.

Algorithms with no variance (all cases equal):

Some algorithms do the same work regardless of input:

Array access: Always O(1)—input doesn't affect index calculation
Merge sort: Always O(n log n)—always divides in half, always merges n elements
Matrix multiplication (naive): Always O(n³)—three nested loops, no early exit
Brute force search: Always O(2ⁿ)—explores all subsets regardless

These algorithms are input-oblivious—their structure doesn't respond to input characteristics.

Sources of case variance:

1. Early termination conditions:

# Variance from early termination
for item in array:
    if item == target:
        return True  # Can exit early!
return False

If target is found early, we stop. If not found, we scan everything. Early termination creates best-case O(1) vs worst-case O(n).

2. Data-dependent branching:

# Variance from branching
if condition_from_input:
    expensive_operation()  # O(n²)
else:
    cheap_operation()      # O(n)

The input determines which branch executes, creating different complexities.

3. Recursive partition quality:

In divide-and-conquer algorithms, how the input divides matters:

Quicksort with balanced partitions: O(n log n)
Quicksort with unbalanced partitions: O(n²)

The input directly affects recursion depth and work distribution.

High Variance Algorithms

•Linear search (found early vs not found)
•Quicksort (pivot quality varies)
•Hash table operations (collision patterns)
•BST operations (tree balance varies)
•Pattern matching (match position varies)

Low/No Variance Algorithms

•Merge sort (always divides evenly)
•Heap sort (always same structure)
•Binary search (always halves)
•Matrix operations (fixed iterations)
•Graph traversal (visits all nodes)

Variance Is a Feature

High variance isn't always bad. Quicksort's variance means it's usually O(n log n) despite O(n²) worst case. Hash tables' variance means usually O(1) despite O(n) worst case. The question is: how likely is the bad case, and can we afford it when it happens?

Practical Applications of Case Analysis

Case analysis isn't academic—it directly guides engineering decisions. Let's see how professionals apply this framework.

Scenario 1: Choosing a sorting algorithm

Problem: You need to sort user-uploaded data. Users sometimes upload already-sorted files.

Analysis:

Merge sort: Always O(n log n)—predictable but doesn't exploit sortedness
Quicksort: O(n log n) average, O(n²) worst on sorted data—dangerous here!
Timsort: O(n) best on nearly-sorted, O(n log n) otherwise—exploits sortedness

Decision: Use Timsort (Python's default) or similar adaptive algorithm. When best case is common, exploit it.

Scenario 2: Web server response times

Problem: Your API must respond in < 100ms for 99.9% of requests.

Analysis:

Average response time: 20ms—sounds great!
But 99.9th percentile: 500ms—fails requirement
Worst case: 2000ms—catastrophic

Decision: Average isn't enough. You need to analyze tail latency (effectively worst case for the tail). Optimize the slow path, not just the common path.

Scenario 3: Hash table design

Problem: Building a cache. Should you use chaining or open addressing?

Analysis:

Aspect	Chaining	Open Addressing
Average lookup	O(1)	O(1)
Worst case	O(n) if all hash to same bucket	O(n) if table nearly full
Worst trigger	Adversarial input	High load factor

Decision:

If inputs are trusted: either works, open addressing saves memory
If inputs are untrusted: chaining with universal hashing prevents adversarial worst case
Regardless: monitor load factor to prevent degradation

Scenario 4: Real-time game physics

Problem: Physics simulation must complete in 16ms (60 FPS) every frame.

Analysis:

Average collision detection: 2ms
Worst case (many objects clustered): 25ms—frame drop!

Decision:

Can't size for average—one bad frame is noticeable
Implement early-out optimizations (spatial partitioning) to improve worst case
Consider time-slicing: spread work across frames if worst case occurs

The Engineering Heuristic

Rule of thumb: Optimize for average case, but protect against worst case. Know your worst case, how likely it is, and what happens when it occurs. If worst case is rare and recoverable, accept it. If worst case is catastrophic or exploitable, mitigate it.

Summary: The Three Cases of Algorithm Analysis

Algorithm performance isn't a single number—it's a spectrum. Case analysis provides the framework for understanding this spectrum completely.

Key Takeaways

•Best case is the floor — Minimum possible resources on the most favorable input. Establishes what's achievable under ideal conditions.
•Worst case is the ceiling — Maximum possible resources on the most adversarial input. What we must prepare for.
•Average case is expected behavior — Weighted average across input distribution. Requires distribution assumptions.
•Different contexts prioritize different cases — Real-time needs worst case; throughput needs average; optimization targets best.
•Case variance comes from data-dependent behavior — Early termination, branching, partition quality create variance.
•Engineering uses all three — Optimize for average, protect against worst, exploit best when possible.

What's next:

Worst-case analysis dominates engineering practice for good reasons. The next page explores why worst case is often preferred—examining the engineering, safety, and business considerations that make worst-case thinking the default in professional software development.

Page Complete

You now understand best-case, average-case, and worst-case analysis as complementary views of algorithm performance. You can identify what drives each case, compare algorithms across all three cases, and apply case thinking to real engineering decisions. Next, we'll explore why worst-case analysis is the engineering default.

Best-Case, Average-Case, and Worst-Case Analysis

The Incomplete Picture

When someone asks "How fast is this algorithm?" the honest answer is: "It depends."

What You Will Learn

The Three Cases Framework

For any algorithm, performance can vary based on the specific input. Case analysis systematically captures this variation by examining three scenarios:

Best Case (Ω notation):

The minimum resources (time or space) an algorithm requires across all possible inputs of size n.

Best case represents the most favorable input—the scenario where everything aligns perfectly for the algorithm. It answers: "What's the absolute best performance we could hope for?"

Worst Case (O notation):

The maximum resources an algorithm requires across all possible inputs of size n.

Worst case represents the most adversarial input—the scenario that makes the algorithm work hardest. It answers: "What's the absolute worst performance we must prepare for?"

Average Case (Θ notation, typically):

The expected resources an algorithm requires across a probability distribution of inputs of size n.

Average case represents typical performance—what we expect on "normal" inputs. It answers: "What performance should we expect in practice?"

The Three Cases Compared
Case	Question It Answers	Input Type	Notation Often Used
Best Case	What's the minimum possible work?	Most favorable input	Ω (Omega)
Worst Case	What's the maximum possible work?	Most adversarial input	O (Big-O)
Average Case	What work do we expect typically?	Random/typical input	Θ (Theta) for expected value

Why three cases matter:

Different contexts demand different analyses:

Real-time systems (aircraft control, medical devices): Must guarantee worst-case performance—lives depend on it
Interactive applications (web servers, UI): Average case matters most—users experience typical behavior
Security-critical systems (cryptography, authentication): Worst case prevents attacks that exploit slow paths
Optimization: Best case reveals fundamental limits—you can't do better than best case

No single case tells the whole story. Expert analysis considers all three.

Case vs. Bound

Best-Case Analysis: The Lower Limit

What best case captures:

Best case describes the minimum resources an algorithm uses when given the most favorable possible input. It establishes a floor—performance cannot be better than best case.

How to identify best-case inputs:

For any algorithm, ask: "What input would let this finish as fast as possible?"

Linear search: Target is the first element → Θ(1)
Binary search: Target is the middle element → Θ(1)
Bubble sort: Array is already sorted (with early termination) → Θ(n)
Insertion sort: Array is already sorted → Θ(n)
Quicksort: Perfect partitions every time → Θ(n log n)

Detailed example: Linear search best case

def linear_search(arr, target):
    for i in range(len(arr)):
        if arr[i] == target:
            return i    # Found it!
    return -1           # Not found

Best case scenario: target == arr[0]

We check the first element
It matches the target
We return immediately
Total comparisons: 1
Best-case complexity: Θ(1)

No matter how large the array, if the target is first, we finish in constant time.

Why best case matters:

Establishes fundamental limits: If best case is O(n), then average and worst must be at least O(n). Best case bounds everything above it.
Optimistic planning: In some systems, best-case behavior is common enough to optimize for.
Algorithm comparison: If Algorithm A's best case is O(n) but Algorithm B's best case is O(1), B might be preferred when favorable inputs are common.

Best Case ≠ Guaranteed

When best case analysis is useful:

Lower bound proofs: Proving that no algorithm for a problem can do better than Ω(n log n) uses best-case reasoning
Optimistic caching: If best case is O(1) and happens often, caching strategies can exploit this
Early termination detection: Algorithms like optimized bubble sort can detect sorted arrays in O(n)
Interactive systems: Best-case responsiveness affects perceived performance

When best case analysis is misleading:

When best case is rare: Linear search's O(1) best case almost never happens randomly
When worst case dominates perception: Users remember the one slow query, not the 99 fast ones
When adversarial inputs are possible: Security contexts can't rely on favorable inputs

Worst-Case Analysis: The Upper Limit

What worst case captures:

Worst case describes the maximum resources an algorithm uses when given the most adversarial possible input. It establishes a ceiling—performance cannot be worse than worst case.

Why worst case dominates analysis:

Worst case is the default in computer science for good reasons:

Guarantees: Worst case provides a guarantee. "This algorithm never takes more than O(n²) time" is actionable.
Safety: Systems must handle worst-case loads. Servers sized for average load crash during peaks.
Adversarial thinking: In security, attackers deliberately trigger worst-case behavior. Denial-of-service attacks often exploit slow algorithmic paths.
Simplicity: Worst case is often easier to analyze than average case (no probability distributions needed).

How to identify worst-case inputs:

Ask: "What input would make this algorithm work as hard as possible?"

Worst-Case Inputs for Common Algorithms
Algorithm	Worst-Case Input	Why It's Worst	Worst-Case Complexity
Linear search	Target not in array	Must check every element	Θ(n)
Binary search	Target not in array	Must halve until empty	Θ(log n)
Bubble sort	Array in reverse order	Maximum swaps needed	Θ(n²)
Quicksort (bad pivot)	Already sorted array	Partitions are 0:n-1 splits	Θ(n²)
Insertion sort	Array in reverse order	Each insert goes to position 0	Θ(n²)
Hash table lookup (chaining)	All keys hash to same bucket	Degenerates to linked list	Θ(n)

Detailed example: Quicksort worst case

Quicksort partitions an array around a pivot, recursively sorting partitions:

def quicksort(arr):
    if len(arr) <= 1:
        return arr
    pivot = arr[0]  # First element as pivot
    left = [x for x in arr[1:] if x < pivot]
    right = [x for x in arr[1:] if x >= pivot]
    return quicksort(left) + [pivot] + quicksort(right)

Best case: Pivot consistently divides array in half → T(n) = 2T(n/2) + O(n) = O(n log n)

Worst case: Array is already sorted (or reverse sorted)

pivot = arr[0] is the smallest (or largest) element
left = [] (0 elements), right = n-1 elements
Partition is 0:(n-1), not n/2:n/2
Recurrence: T(n) = T(n-1) + O(n) = O(n²)

The same algorithm goes from O(n log n) to O(n²) depending on input—a dramatic difference!

Mitigating Worst Case

When worst case matters most:

Real-time systems: Aircraft control software must respond within a deadline regardless of input. Worst-case guarantees are mandatory.
SLA-bound services: "99.9th percentile latency < 100ms" is effectively worst-case thinking at the tail.
Adversarial environments: Attackers craft inputs to trigger worst-case behavior. Hash table denial-of-service attacks exploit predictable hash collisions.
Resource planning: How much memory to allocate? How many servers to provision? Worst case answers these.

The worst-case mindset:

Average-Case Analysis: Expected Behavior

What average case captures:

The mathematical formulation:

Average Time = Σ (Probability of input I × Time to process input I)

For all possible inputs I of size n.

The critical assumption:

Average-case analysis requires assuming a probability distribution over inputs. Different assumptions yield different results:

Uniform distribution: Each input equally likely (common assumption)
Real-world distribution: Inputs follow actual usage patterns
Adversarial distribution: Attacker chooses inputs to maximize cost

The result depends entirely on what distribution you assume.

Detailed example: Linear search average case

Assume:

Array has n distinct elements
Target is equally likely to be any element or not present
Probability target is element i: 1/(n+1) for each position plus not present

Simplified analysis (target is in array, position uniformly random):

If target at position 0: 1 comparison
If target at position 1: 2 comparisons
...
If target at position n-1: n comparisons

Average = (1 + 2 + 3 + ... + n) / n = n(n+1)/2 / n = (n+1)/2 ≈ n/2

On average, we search half the array. Average case is Θ(n)—same as worst case asymptotically, but with half the constant factor.

Including the "not found" case:

If target has 50% chance of being in array:

50% × (n/2) + 50% × n = n/2/2 + n/2 = 3n/4

Still Θ(n), but the constant changes with assumptions.

Average Case vs Worst Case for Common Algorithms
Algorithm	Average Case	Worst Case	Key Insight
Linear search	Θ(n)	Θ(n)	Same asymptotically; average scans half
Binary search	Θ(log n)	Θ(log n)	Same; binary search is consistent
Quicksort	Θ(n log n)	Θ(n²)	Dramatically different; average >> worst
Insertion sort	Θ(n²)	Θ(n²)	Same; consistently quadratic
Hash table lookup	Θ(1)	Θ(n)	Dramatically different with good hash function

Quicksort's Remarkable Average Case

Challenges with average-case analysis:

Distribution assumptions may be wrong: Real inputs often aren't uniformly random. If most searches target recently added items, linear search from the end might be better.
Mathematical complexity: Average-case analysis requires probability theory. For complex algorithms, the analysis can be extremely difficult.
Misleading expectations: Average hides variance. An algorithm with average Θ(n) might take Θ(n²) 10% of the time, causing occasional severe slowdowns.
Doesn't guarantee anything: Average is not a bound. Individual operations can vastly exceed the average.

When average case matters most:

Throughput-oriented systems: If you process millions of queries, total time matters more than any single query's time. Average predicts total.
Non-adversarial settings: When inputs genuinely come from a known distribution, average case is realistic.
Amortized contexts: When occasional expensive operations are acceptable, average-like thinking applies.

Comparing Cases: The Complete Picture

Let's see how the three cases combine to give a complete picture of algorithm performance.

Visualization: Performance spectrum

Imagine plotting time (y-axis) against different inputs (x-axis, sorted by time taken):

Time
  |
  |           __________________ Worst case ceiling
  |          /
  |         /
  |        /
  |   ~~~~~/~~~~~~~~~~~~~~~~~~~~ Average case (expected)
  |   /
  |  /
  | /
  |/___________________________ Best case floor
  +---------------------------------> Inputs (sorted by time)

Best case is the floor—no input does better. Worst case is the ceiling—no input does worse. Average case is somewhere in between, weighted by input probability.

Case study: Complete analysis of insertion sort

def insertion_sort(arr):
    for i in range(1, len(arr)):
        key = arr[i]
        j = i - 1
        while j >= 0 and arr[j] > key:
            arr[j + 1] = arr[j]
            j -= 1
        arr[j + 1] = key

Best case: Array is already sorted.

Each element is already in position
Inner while loop never executes (arr[j] ≤ key immediately)
Outer loop runs n-1 times, each doing O(1) work
Best case: Θ(n)

Worst case: Array is in reverse order.

Each element must move to position 0
For element i, inner loop runs i times
Total: 1 + 2 + 3 + ... + (n-1) = n(n-1)/2
Worst case: Θ(n²)

Average case: Random input (each permutation equally likely).

On average, element i moves halfway to position 0
Total: 0.5 × (1 + 2 + ... + (n-1)) = n(n-1)/4
Average case: Θ(n²)

Complete characterization:

Insertion sort is O(n²) worst case
Insertion sort is Ω(n) best case
Insertion sort is Θ(n²) average case
Insertion sort is highly efficient on nearly-sorted data

Complete Case Analysis of Common Algorithms
Algorithm	Best Case	Average Case	Worst Case	Notable Pattern
Linear Search	Θ(1)	Θ(n)	Θ(n)	Best case dramatically better
Binary Search	Θ(1)	Θ(log n)	Θ(log n)	Consistent performance
Insertion Sort	Θ(n)	Θ(n²)	Θ(n²)	Excellent on nearly-sorted
Merge Sort	Θ(n log n)	Θ(n log n)	Θ(n log n)	Always consistent
Quicksort	Θ(n log n)	Θ(n log n)	Θ(n²)	Worst case rare in practice
Heap Sort	Θ(n log n)	Θ(n log n)	Θ(n log n)	Consistent but slower constant
Hash Table Insert	Θ(1)	Θ(1)	Θ(n)	Average is what matters

Reading Algorithm Descriptions

What Drives Case Differences

Not all algorithms have different best, average, and worst cases. Understanding what creates variance helps you predict when case analysis matters.

Algorithms with no variance (all cases equal):

Some algorithms do the same work regardless of input:

Array access: Always O(1)—input doesn't affect index calculation
Merge sort: Always O(n log n)—always divides in half, always merges n elements
Matrix multiplication (naive): Always O(n³)—three nested loops, no early exit
Brute force search: Always O(2ⁿ)—explores all subsets regardless

These algorithms are input-oblivious—their structure doesn't respond to input characteristics.

Sources of case variance:

1. Early termination conditions:

# Variance from early termination
for item in array:
    if item == target:
        return True  # Can exit early!
return False

If target is found early, we stop. If not found, we scan everything. Early termination creates best-case O(1) vs worst-case O(n).

2. Data-dependent branching:

# Variance from branching
if condition_from_input:
    expensive_operation()  # O(n²)
else:
    cheap_operation()      # O(n)

The input determines which branch executes, creating different complexities.

3. Recursive partition quality:

In divide-and-conquer algorithms, how the input divides matters:

Quicksort with balanced partitions: O(n log n)
Quicksort with unbalanced partitions: O(n²)

The input directly affects recursion depth and work distribution.

High Variance Algorithms

•Linear search (found early vs not found)
•Quicksort (pivot quality varies)
•Hash table operations (collision patterns)
•BST operations (tree balance varies)
•Pattern matching (match position varies)

Low/No Variance Algorithms

•Merge sort (always divides evenly)
•Heap sort (always same structure)
•Binary search (always halves)
•Matrix operations (fixed iterations)
•Graph traversal (visits all nodes)

Variance Is a Feature

Practical Applications of Case Analysis

Case analysis isn't academic—it directly guides engineering decisions. Let's see how professionals apply this framework.

Scenario 1: Choosing a sorting algorithm

Problem: You need to sort user-uploaded data. Users sometimes upload already-sorted files.

Analysis:

Merge sort: Always O(n log n)—predictable but doesn't exploit sortedness
Quicksort: O(n log n) average, O(n²) worst on sorted data—dangerous here!
Timsort: O(n) best on nearly-sorted, O(n log n) otherwise—exploits sortedness

Decision: Use Timsort (Python's default) or similar adaptive algorithm. When best case is common, exploit it.

Scenario 2: Web server response times

Problem: Your API must respond in < 100ms for 99.9% of requests.

Analysis:

Average response time: 20ms—sounds great!
But 99.9th percentile: 500ms—fails requirement
Worst case: 2000ms—catastrophic

Decision: Average isn't enough. You need to analyze tail latency (effectively worst case for the tail). Optimize the slow path, not just the common path.

Scenario 3: Hash table design

Problem: Building a cache. Should you use chaining or open addressing?

Analysis:

Aspect	Chaining	Open Addressing
Average lookup	O(1)	O(1)
Worst case	O(n) if all hash to same bucket	O(n) if table nearly full
Worst trigger	Adversarial input	High load factor

Decision:

If inputs are trusted: either works, open addressing saves memory
If inputs are untrusted: chaining with universal hashing prevents adversarial worst case
Regardless: monitor load factor to prevent degradation

Scenario 4: Real-time game physics

Problem: Physics simulation must complete in 16ms (60 FPS) every frame.

Analysis:

Average collision detection: 2ms
Worst case (many objects clustered): 25ms—frame drop!

Decision:

Can't size for average—one bad frame is noticeable
Implement early-out optimizations (spatial partitioning) to improve worst case
Consider time-slicing: spread work across frames if worst case occurs

The Engineering Heuristic

Summary: The Three Cases of Algorithm Analysis

Algorithm performance isn't a single number—it's a spectrum. Case analysis provides the framework for understanding this spectrum completely.

Key Takeaways

•Best case is the floor — Minimum possible resources on the most favorable input. Establishes what's achievable under ideal conditions.
•Worst case is the ceiling — Maximum possible resources on the most adversarial input. What we must prepare for.
•Average case is expected behavior — Weighted average across input distribution. Requires distribution assumptions.
•Different contexts prioritize different cases — Real-time needs worst case; throughput needs average; optimization targets best.
•Case variance comes from data-dependent behavior — Early termination, branching, partition quality create variance.
•Engineering uses all three — Optimize for average, protect against worst, exploit best when possible.

What's next:

Page Complete