Data Structures & AlgorithmsSearch in Rotated Arrays

Search in Rotated Arrays

LevelIntermediate

Duration60 mins

TopicSearch in Rotated Arrays

4 / 4

Time Complexity Analysis

Understanding the Complete Complexity Picture

Analyzing the time complexity of rotated array search requires more nuance than a simple "O(log n)" answer. The complexity depends on multiple factors: whether the array is actually rotated, whether duplicates are present, and even the specific input distribution.

A strong software engineer doesn't just memorize complexity bounds—they understand why those bounds hold, when they might degrade, and how to communicate this nuance to colleagues and interviewers. This page provides that complete picture.

What You Will Learn

By the end of this page, you will be able to rigorously analyze and explain the time and space complexity of rotated array search across all scenarios. You'll understand recurrence relations, be able to prove complexity bounds, and articulate the factors that affect performance.

Complexity Overview — The Complete Picture

Let's start with a comprehensive summary, then drill into each case with rigorous analysis.

Time Complexity Summary for Rotated Array Search
Scenario	Best Case	Average Case	Worst Case	Space
Distinct elements	O(1)	O(log n)	O(log n)	O(1)
With duplicates	O(1)	O(log n)*	O(n)	O(1)
Not rotated (sorted)	O(1)	O(log n)	O(log n)	O(1)
Linear search baseline	O(1)	O(n)	O(n)	O(1)

*Average case for duplicates assumes duplicates are limited/sparse. If duplicates dominate, average case approaches O(n).

Key observations:

Distinct elements maintain logarithmic guarantee: The rotation doesn't affect complexity because we can always eliminate half the search space.
Duplicates fundamentally change the worst case: When we can't determine which half is sorted, we fall back to linear elimination.
Best case is always O(1): If the target happens to be at the middle position on the first comparison.
Space is always O(1): Iterative implementation uses only a few variables regardless of input size.

Rigorous Analysis — Distinct Elements

Let's prove that search in a rotated sorted array with distinct elements runs in O(log n) time. We'll use the recurrence relation approach, which is the gold standard for analyzing divide-and-conquer algorithms.

Setting up the recurrence:

Let T(n) be the time to search in an array of n elements.

At each step:

We do O(1) work: compute mid, compare elements, determine sorted half, check range
We then recurse on exactly ONE half of size n/2

This gives us the recurrence:

T(n) = T(n/2) + O(1)

This is the classic binary search recurrence.

Solving the recurrence:

Method 1: Unrolling

T(n) = T(n/2) + c
     = T(n/4) + c + c
     = T(n/8) + c + c + c
     = ...
     = T(n/2^k) + k·c

The recursion ends when n/2^k = 1, i.e., when k = log₂(n).

So: T(n) = T(1) + c·log₂(n) = O(log n)

Method 2: Master Theorem

For T(n) = aT(n/b) + f(n):

a = 1 (one subproblem)
b = 2 (half the size)
f(n) = O(1)

Compare f(n) = O(1) to n^(log_b a) = n^(log₂ 1) = n^0 = 1.

Since f(n) = Θ(n^(log_b a)), we're in Case 2 of the Master Theorem: T(n) = Θ(n^(log_b a) · log n) = Θ(log n)

Conclusion: T(n) = O(log n) for distinct elements. ∎

Why Rotation Doesn't Affect Complexity

The key insight is that rotation doesn't prevent us from halving the search space. We always identify one sorted half and make a definitive decision. Whether the array is rotated or not, we eliminate n/2 elements each iteration. The rotation just changes HOW we decide which half to search, not WHETHER we can eliminate half.

complexity_proof.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
def count_iterations_distinct(n: int) -> int:
    """
    Simulates binary search to count iterations.
    Proves O(log n) empirically by showing iterations ~ log₂(n).
    """
    low, high = 0, n - 1
    iterations = 0
    
    while low <= high:
        iterations += 1
        mid = (low + high) // 2
        # Simulate going right (doesn't matter for iteration count)
        low = mid + 1
    
    return iterations
 
 
def verify_log_n_complexity():
    """Verify that iterations grow logarithmically with n."""
    import math
    
    print("Verifying O(log n) for distinct elements")
    print("=" * 50)
    print(f"{'n':>12} {'Iterations':>12} {'log₂(n)':>12} {'Ratio':>12}")
    print("-" * 50)
    
    sizes = [10, 100, 1000, 10000, 100000, 1000000]
    
    for n in sizes:
        iters = count_iterations_distinct(n)
        log_n = math.log2(n)
        ratio = iters / log_n if log_n > 0 else 0
        
        print(f"{n:>12} {iters:>12} {log_n:>12.2f} {ratio:>12.4f}")
    
    print()
    print("Observation: Iterations ≈ ⌈log₂(n)⌉ + 1")
    print("Ratio converges to ~1 as n grows, confirming O(log n)")
 
 
verify_log_n_complexity()

Analysis with Duplicates — Worst Case O(n)

With duplicates, the worst case degrades to O(n). Let's prove this rigorously and understand exactly when it occurs.

The adversarial input:

Consider an array where all elements are the same except one:

[1, 1, 1, 1, 1, 1, 2, 1, 1, 1, 1, 1, 1]  (n = 13)

Searching for 2:

Initially: low=0, mid=6, high=12
nums[0] = nums[6] = nums[12] = 1
Ambiguous case! We cannot determine which half is sorted
Best we can do: low++, high--
New range: low=1, high=11 (reduced by only 2)

Recurrence for worst case with duplicates:

In the worst case, every iteration encounters the ambiguous case:

T(n) = T(n-2) + O(1)

This is NOT the binary search recurrence. Let's solve it:

T(n) = T(n-2) + c
     = T(n-4) + 2c
     = T(n-6) + 3c
     = ...
     = T(n-2k) + k·c

Recursion ends when n-2k ≤ 1, i.e., when k ≥ (n-1)/2 ≈ n/2.

So: T(n) = T(1) + (n/2)·c = O(n)

Conclusion: Worst case with duplicates is O(n). ∎

This Is Unavoidable

This O(n) worst case is not a failure of our algorithm—it's a FUNDAMENTAL limitation. When nums[low] = nums[mid] = nums[high], no comparison-based algorithm has enough information to eliminate more than a constant number of elements. This is information-theoretic, not algorithmic.

worst_case_demonstration.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
def count_iterations_duplicates(nums: list[int], target: int) -> int:
    """Count iterations for the duplicates algorithm."""
    if not nums:
        return 0
    
    low, high = 0, len(nums) - 1
    iterations = 0
    
    while low <= high:
        iterations += 1
        mid = low + (high - low) // 2
        
        if nums[mid] == target:
            return iterations
        
        if nums[low] == nums[mid] == nums[high]:
            low += 1
            high -= 1
            continue
        
        if nums[low] <= nums[mid]:
            if nums[low] <= target < nums[mid]:
                high = mid - 1
            else:
                low = mid + 1
        else:
            if nums[mid] < target <= nums[high]:
                low = mid + 1
            else:
                high = mid - 1
    
    return iterations
 
 
def demonstrate_worst_case():
    """Show O(n) behavior on adversarial inputs."""
    import math
    
    print("Demonstrating O(n) Worst Case with Duplicates")
    print("=" * 60)
    print(f"{'n':>10} {'Iterations':>12} {'n/2':>10} {'log₂(n)':>10}")
    print("-" * 60)
    
    for n in [11, 21, 51, 101, 201, 501, 1001]:
        # Adversarial input: all 1s except one 2 in the middle
        arr = [1] * (n // 2) + [2] + [1] * (n // 2)
        
        # Search for the unique element
        iterations = count_iterations_duplicates(arr, 2)
        
        print(f"{len(arr):>10} {iterations:>12} {len(arr)//2:>10} {math.log2(len(arr)):>10.2f}")
    
    print()
    print("Observation: Iterations ≈ n/2, confirming O(n) worst case")
    print("Compare to O(log n) which would be ~10 iterations for n=1001")
 
 
def compare_best_vs_worst():
    """Compare best and worst distribution of duplicates."""
    print("\nComparing Duplicate Distributions")
    print("=" * 60)
    
    n = 101
    
    # Best case: Target at mid on first try
    arr_best = [1, 2, 3, 4, 5, 5, 5, 5, 5, 6, 7, 8, 9]
    iters_best = count_iterations_duplicates(arr_best, 5)
    
    # Good case: Some duplicates but not boundary
    arr_good = [4, 5, 6, 7, 7, 7, 7, 0, 1, 2, 3]
    iters_good = count_iterations_duplicates(arr_good, 1)
    
    # Worst case: All boundaries same
    arr_worst = [1] * 50 + [2] + [1] * 50
    iters_worst = count_iterations_duplicates(arr_worst, 2)
    
    print(f"Best case (target at mid):      {iters_best} iterations")
    print(f"Good case (limited duplicates): {iters_good} iterations")
    print(f"Worst case (all 1s except one): {iters_worst} iterations")
 
 
demonstrate_worst_case()
compare_best_vs_worst()

Average Case Analysis

Average case analysis requires understanding the probability distribution of inputs. For rotated array search, we consider two models.

Model 1: Distinct Elements

With distinct elements, every iteration halves the search space regardless of input. The average case equals the worst case:

Average case = O(log n)

Model 2: Duplicates with Uniform Distribution

Assume each array position is filled with a value chosen uniformly from {1, 2, ..., k} where k is much smaller than n (i.e., many duplicates expected).

The probability that nums[low] = nums[mid] = nums[high] is approximately:

P(ambiguous) ≈ (1/k)² = 1/k²

If k = 10 (10 distinct values), P(ambiguous) ≈ 1%.

With rare ambiguous cases, most iterations still halve the search space, preserving O(log n) average behavior.

Model 3: Duplicates with High Repetition

If one value dominates (say, 90% of elements are the same), then:

P(ambiguous) ≈ 0.9³ = 72.9%

With 73% of iterations only eliminating 2 elements, the average case approaches O(n).

Practical Takeaway:

The average case complexity depends heavily on the duplicate distribution:

Few duplicates → O(log n)
Moderate duplicates → O(log n) with higher constant
Dominant duplicates → O(n)

In most real-world scenarios with reasonable data distributions, the algorithm remains efficient.

What to Tell Interviewers

When asked about average case with duplicates, say: 'The average case depends on the duplicate distribution. With limited duplicates, it remains O(log n). With many duplicates of the same value, it can approach O(n). The worst case is O(n) regardless, but that requires adversarial input where boundary values are always equal.'

average_case_simulation.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
import random
import math
 
def count_iterations_duplicates(nums, target):
    """Count iterations (same as before)."""
    if not nums:
        return 0
    
    low, high = 0, len(nums) - 1
    iterations = 0
    
    while low <= high:
        iterations += 1
        mid = low + (high - low) // 2
        
        if nums[mid] == target:
            return iterations
        
        if nums[low] == nums[mid] == nums[high]:
            low += 1
            high -= 1
            continue
        
        if nums[low] <= nums[mid]:
            if nums[low] <= target < nums[mid]:
                high = mid - 1
            else:
                low = mid + 1
        else:
            if nums[mid] < target <= nums[high]:
                low = mid + 1
            else:
                high = mid - 1
    
    return iterations
 
 
def generate_rotated_array(n, k, duplicate_bias=0.0):
    """
    Generate a rotated sorted array with controlled duplicates.
    
    n: size of array
    k: number of distinct values to choose from
    duplicate_bias: probability of repeating the previous value (0 to 1)
    """
    # Generate sorted array with potential duplicates
    arr = []
    prev_val = random.randint(1, k)
    
    for _ in range(n):
        if random.random() < duplicate_bias:
            arr.append(prev_val)
        else:
            prev_val = random.randint(1, k)
            arr.append(prev_val)
    
    arr.sort()
    
    # Rotate by random amount
    rotation = random.randint(0, n - 1)
    return arr[rotation:] + arr[:rotation]
 
 
def simulate_average_case(n=1000, trials=100):
    """Simulate average case under different duplicate distributions."""
    
    print("Average Case Simulation")
    print("=" * 70)
    print(f"Array size: {n}, Trials per config: {trials}")
    print(f"Expected O(log n) iterations: {math.log2(n):.1f}")
    print()
    
    configs = [
        ("No duplicates (k=n)", n, 0.0),
        ("Limited duplicates (k=100)", 100, 0.0),
        ("Moderate duplicates (k=10)", 10, 0.3),
        ("Heavy duplicates (k=3)", 3, 0.5),
        ("Extreme duplicates (k=2)", 2, 0.8),
    ]
    
    print(f"{'Configuration':<35} {'Avg Iters':>12} {'Max Iters':>12}")
    print("-" * 70)
    
    for desc, k, bias in configs:
        total_iters = 0
        max_iters = 0
        
        for _ in range(trials):
            arr = generate_rotated_array(n, k, bias)
            target = random.choice(arr)  # Search for existing element
            
            iters = count_iterations_duplicates(arr, target)
            total_iters += iters
            max_iters = max(max_iters, iters)
        
        avg_iters = total_iters / trials
        print(f"{desc:<35} {avg_iters:>12.1f} {max_iters:>12}")
    
    print()
    print("Note: As duplicates increase, average iterations approach O(n)")
 
 
simulate_average_case()

Space Complexity Analysis

Space complexity is often overlooked but is equally important in real systems. Let's analyze both iterative and recursive implementations.

Iterative Implementation: O(1)

The iterative version uses only:

low: one integer
high: one integer
mid: one integer
target: one integer (input)

Regardless of input size, we use a constant amount of extra space. This is optimal.

Recursive Implementation: O(log n) to O(n)

The recursive version uses the call stack:

With distinct elements: Maximum recursion depth is O(log n) because we halve the problem each time.
With adversarial duplicates: Maximum recursion depth is O(n) because we might only reduce by 2 each time.

Each stack frame uses O(1) space, so:

Distinct: O(log n) space
Duplicates worst case: O(n) space

Space Complexity Summary
Implementation	Distinct Elements	With Duplicates
Iterative	O(1)	O(1)
Recursive	O(log n)	O(n) worst case

Recursion Risks with Duplicates

If using recursion with duplicates, you risk stack overflow on adversarial inputs. For an array of 1 million elements with heavy duplicates, you might need 500,000 stack frames. Most systems default to ~1MB stack, which limits you to roughly 10,000-100,000 frames. Always prefer iteration for production code in this scenario.

Why does this matter?

In embedded systems, memory-constrained environments, or highly concurrent systems, space efficiency matters:

Embedded devices: Limited RAM, deep recursion can crash
High concurrency: Each thread has its own stack; 1000 threads with O(n) recursion = disaster
Large inputs: Even O(log n) recursive space becomes relevant at extreme scale

The iterative version is almost always preferred for production systems.

Comparison with Related Algorithms

To fully appreciate the complexity of rotated array search, let's compare it with related search algorithms.

Complexity Comparison of Search Algorithms
Algorithm	Time (Best)	Time (Avg)	Time (Worst)	Space	Requirements
Linear Search	O(1)	O(n)	O(n)	O(1)	None
Binary Search (sorted)	O(1)	O(log n)	O(log n)	O(1)	Sorted
Binary Search (rotated, distinct)	O(1)	O(log n)	O(log n)	O(1)	Rotated sorted, distinct
Binary Search (rotated, dups)	O(1)	O(log n)*	O(n)	O(1)	Rotated sorted
Hash Table Lookup	O(1)	O(1)	O(n)	O(n)	Preprocessing
Interpolation Search	O(1)	O(log log n)	O(n)	O(1)	Uniform distribution

Key insights from comparison:

Rotated array search with distinct elements has the same complexity as regular binary search. The rotation is just a twist in the decision logic, not the efficiency.
Duplicates degrade to linear search worst case. But the average case is often still good!
Hash table is O(1) but requires O(n) space and preprocessing. If doing many searches, hash table wins. For single search, rotated binary search is better.
Interpolation search can be O(log log n) on uniform data but degrades to O(n) otherwise. Rotated array search is more robust.
Linear search is simple and reliable but doesn't leverage the sorted structure. For small arrays, the constant factors might make linear search competitive.

Practical Decision Making

In interviews, if asked 'when would you use linear search instead?', good answers include: (1) Very small arrays where O(n) < O(log n) in practice, (2) Arrays with many duplicates where binary search degrades anyway, (3) Unsorted arrays where preprocessing cost exceeds search savings, (4) Simple code requirements where binary search complexity isn't justified.

Lower Bounds and Optimality

An important question in algorithm analysis: Is our algorithm optimal? Can we do better?

For distinct elements: YES, O(log n) is optimal

Any comparison-based search algorithm on n distinct elements requires Ω(log n) comparisons in the worst case. This is because:

There are n possible positions for the target
Each comparison gives at most 1 bit of information
We need at least log₂(n) bits to distinguish n possibilities

Our algorithm achieves this bound, so it's optimal.

For duplicates: O(n) worst case is also optimal

When nums[low] = nums[mid] = nums[high], a single comparison yields zero bits of information about which half contains the target. We literally cannot do better than linear elimination in this case.

Proof sketch:

Consider the adversary argument. An adversary constructing the array can always ensure that:

All revealed values are identical (say, 1)
The target (say, 2) is positioned to maximize search time

Until we examine a position containing the target, we have no information. The adversary can place the target such that we examine n/2 positions before finding it.

Information-Theoretic Perspective

The degradation to O(n) with duplicates isn't a failure of our algorithm—it's a fundamental information-theoretic barrier. When comparisons don't distinguish positions, no amount of algorithmic cleverness can help. This is important to communicate in interviews to show deep understanding.

What about non-comparison-based approaches?

If we could use other operations (like hashing or arithmetic), could we do better?

Hashing: We could hash all elements in O(n) time, then do O(1) lookup. But this requires O(n) preprocessing and O(n) space. Not better for single queries.
Parallel search: With O(n) processors, we could achieve O(1) time. But this changes the computation model.
Specialized hardware: GPU-accelerated search could be faster in wall-clock time but not in big-O complexity.

For the standard RAM model with comparison-based algorithms, our approach is optimal.

Practical Performance Considerations

Big-O complexity tells part of the story. In practice, constant factors, cache behavior, and branch prediction also matter.

Constant factors:

Our rotated array search does more work per iteration than standard binary search:

Standard: 1-2 comparisons per iteration
Rotated: 3-4 comparisons per iteration (determine sorted half + check range)

This roughly doubles the constant factor but doesn't change the O(log n) classification.

Cache efficiency:

Binary search is cache-friendly in the early iterations (accessing widely distributed elements) and becomes more cache-friendly as the search narrows. Rotated array search has the same access pattern—no difference in cache behavior.

Branch prediction:

The extra conditionals in rotated array search might cause more branch mispredictions. Modern CPUs handle this well, but in extremely performance-critical code, this could matter.

Practical Performance Factors
Factor	Standard Binary Search	Rotated Array Search	Impact
Comparisons per iteration	~2	~4	~2x slower per iteration
Iterations	log₂(n)	log₂(n)	Equal
Cache behavior	Good	Good	Equal
Branch prediction	Good	Slightly worse	Minimal
Code complexity	Simple	Moderate	Maintenance cost

When Micro-optimization Matters

For 99% of applications, the O(log n) complexity is what matters. The constant factor difference between rotated and standard binary search is negligible compared to algorithm choice (O(log n) vs O(n)). Only in extremely hot paths (called millions of times per second) would you consider micro-optimizations.

When to use linear search instead:

Despite O(n) complexity, linear search can be faster for small arrays:

For n < ~64, linear search often wins due to:
- Better cache utilization (sequential access)
- No branch misprediction from binary search conditions
- Simpler code with lower overhead
With many duplicates where binary search degrades anyway
When the array is very small relative to other operations

This is why some standard library implementations switch to linear search below a threshold.

Summary: Complete Complexity Understanding

We've conducted a comprehensive analysis of rotated array search complexity. Let's consolidate the key insights:

Key Takeaways

•Distinct elements: O(log n) always. Rotation doesn't affect complexity because we can always halve the search space.
•Duplicates: O(log n) average, O(n) worst. The worst case occurs when boundary values are always equal.
•Space: O(1) iterative, O(log n) to O(n) recursive. Always prefer iteration for production code.
•Lower bound: Our algorithm is optimal. For comparison-based search, we can't do better.
•Average case with duplicates depends on distribution. Limited duplicates → O(log n). Dominant duplicates → O(n).
•Practical factors matter. For very small arrays, linear search may be faster despite O(n) complexity.
•Interview communication is key. Understanding WHY complexity degrades with duplicates shows algorithmic maturity.

Module complete!

You now have a complete understanding of searching in rotated sorted arrays—from the foundational insight that one half is always sorted, through the modified binary search strategy, handling duplicates, and rigorous complexity analysis. This is one of the most frequently asked interview problems, and you're now equipped to handle it with confidence and depth.

Module Complete

Congratulations! You've mastered search in rotated arrays—a classic interview problem that tests deep understanding of binary search invariants. You can identify sorted portions, implement the search algorithm, handle duplicates, and articulate the complexity trade-offs. This knowledge will serve you well in interviews and in building efficient systems.

4 / 4

Loading learning content...

Data Structures & AlgorithmsSearch in Rotated Arrays

Search in Rotated Arrays

LevelIntermediate

Duration60 mins

TopicSearch in Rotated Arrays

4 / 4

Time Complexity Analysis

Understanding the Complete Complexity Picture

What You Will Learn

Complexity Overview — The Complete Picture

Let's start with a comprehensive summary, then drill into each case with rigorous analysis.

Time Complexity Summary for Rotated Array Search
Scenario	Best Case	Average Case	Worst Case	Space
Distinct elements	O(1)	O(log n)	O(log n)	O(1)
With duplicates	O(1)	O(log n)*	O(n)	O(1)
Not rotated (sorted)	O(1)	O(log n)	O(log n)	O(1)
Linear search baseline	O(1)	O(n)	O(n)	O(1)

*Average case for duplicates assumes duplicates are limited/sparse. If duplicates dominate, average case approaches O(n).

Key observations:

Distinct elements maintain logarithmic guarantee: The rotation doesn't affect complexity because we can always eliminate half the search space.
Duplicates fundamentally change the worst case: When we can't determine which half is sorted, we fall back to linear elimination.
Best case is always O(1): If the target happens to be at the middle position on the first comparison.
Space is always O(1): Iterative implementation uses only a few variables regardless of input size.

Rigorous Analysis — Distinct Elements

Setting up the recurrence:

Let T(n) be the time to search in an array of n elements.

At each step:

We do O(1) work: compute mid, compare elements, determine sorted half, check range
We then recurse on exactly ONE half of size n/2

This gives us the recurrence:

T(n) = T(n/2) + O(1)

This is the classic binary search recurrence.

Solving the recurrence:

Method 1: Unrolling

T(n) = T(n/2) + c
     = T(n/4) + c + c
     = T(n/8) + c + c + c
     = ...
     = T(n/2^k) + k·c

The recursion ends when n/2^k = 1, i.e., when k = log₂(n).

So: T(n) = T(1) + c·log₂(n) = O(log n)

Method 2: Master Theorem

For T(n) = aT(n/b) + f(n):

a = 1 (one subproblem)
b = 2 (half the size)
f(n) = O(1)

Compare f(n) = O(1) to n^(log_b a) = n^(log₂ 1) = n^0 = 1.

Since f(n) = Θ(n^(log_b a)), we're in Case 2 of the Master Theorem: T(n) = Θ(n^(log_b a) · log n) = Θ(log n)

Conclusion: T(n) = O(log n) for distinct elements. ∎

Why Rotation Doesn't Affect Complexity

complexity_proof.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
def count_iterations_distinct(n: int) -> int:
    """
    Simulates binary search to count iterations.
    Proves O(log n) empirically by showing iterations ~ log₂(n).
    """
    low, high = 0, n - 1
    iterations = 0
    
    while low <= high:
        iterations += 1
        mid = (low + high) // 2
        # Simulate going right (doesn't matter for iteration count)
        low = mid + 1
    
    return iterations
 
 
def verify_log_n_complexity():
    """Verify that iterations grow logarithmically with n."""
    import math
    
    print("Verifying O(log n) for distinct elements")
    print("=" * 50)
    print(f"{'n':>12} {'Iterations':>12} {'log₂(n)':>12} {'Ratio':>12}")
    print("-" * 50)
    
    sizes = [10, 100, 1000, 10000, 100000, 1000000]
    
    for n in sizes:
        iters = count_iterations_distinct(n)
        log_n = math.log2(n)
        ratio = iters / log_n if log_n > 0 else 0
        
        print(f"{n:>12} {iters:>12} {log_n:>12.2f} {ratio:>12.4f}")
    
    print()
    print("Observation: Iterations ≈ ⌈log₂(n)⌉ + 1")
    print("Ratio converges to ~1 as n grows, confirming O(log n)")
 
 
verify_log_n_complexity()

Analysis with Duplicates — Worst Case O(n)

With duplicates, the worst case degrades to O(n). Let's prove this rigorously and understand exactly when it occurs.

The adversarial input:

Consider an array where all elements are the same except one:

[1, 1, 1, 1, 1, 1, 2, 1, 1, 1, 1, 1, 1]  (n = 13)

Searching for 2:

Initially: low=0, mid=6, high=12
nums[0] = nums[6] = nums[12] = 1
Ambiguous case! We cannot determine which half is sorted
Best we can do: low++, high--
New range: low=1, high=11 (reduced by only 2)

Recurrence for worst case with duplicates:

In the worst case, every iteration encounters the ambiguous case:

T(n) = T(n-2) + O(1)

This is NOT the binary search recurrence. Let's solve it:

T(n) = T(n-2) + c
     = T(n-4) + 2c
     = T(n-6) + 3c
     = ...
     = T(n-2k) + k·c

Recursion ends when n-2k ≤ 1, i.e., when k ≥ (n-1)/2 ≈ n/2.

So: T(n) = T(1) + (n/2)·c = O(n)

Conclusion: Worst case with duplicates is O(n). ∎

This Is Unavoidable

worst_case_demonstration.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
def count_iterations_duplicates(nums: list[int], target: int) -> int:
    """Count iterations for the duplicates algorithm."""
    if not nums:
        return 0
    
    low, high = 0, len(nums) - 1
    iterations = 0
    
    while low <= high:
        iterations += 1
        mid = low + (high - low) // 2
        
        if nums[mid] == target:
            return iterations
        
        if nums[low] == nums[mid] == nums[high]:
            low += 1
            high -= 1
            continue
        
        if nums[low] <= nums[mid]:
            if nums[low] <= target < nums[mid]:
                high = mid - 1
            else:
                low = mid + 1
        else:
            if nums[mid] < target <= nums[high]:
                low = mid + 1
            else:
                high = mid - 1
    
    return iterations
 
 
def demonstrate_worst_case():
    """Show O(n) behavior on adversarial inputs."""
    import math
    
    print("Demonstrating O(n) Worst Case with Duplicates")
    print("=" * 60)
    print(f"{'n':>10} {'Iterations':>12} {'n/2':>10} {'log₂(n)':>10}")
    print("-" * 60)
    
    for n in [11, 21, 51, 101, 201, 501, 1001]:
        # Adversarial input: all 1s except one 2 in the middle
        arr = [1] * (n // 2) + [2] + [1] * (n // 2)
        
        # Search for the unique element
        iterations = count_iterations_duplicates(arr, 2)
        
        print(f"{len(arr):>10} {iterations:>12} {len(arr)//2:>10} {math.log2(len(arr)):>10.2f}")
    
    print()
    print("Observation: Iterations ≈ n/2, confirming O(n) worst case")
    print("Compare to O(log n) which would be ~10 iterations for n=1001")
 
 
def compare_best_vs_worst():
    """Compare best and worst distribution of duplicates."""
    print("\nComparing Duplicate Distributions")
    print("=" * 60)
    
    n = 101
    
    # Best case: Target at mid on first try
    arr_best = [1, 2, 3, 4, 5, 5, 5, 5, 5, 6, 7, 8, 9]
    iters_best = count_iterations_duplicates(arr_best, 5)
    
    # Good case: Some duplicates but not boundary
    arr_good = [4, 5, 6, 7, 7, 7, 7, 0, 1, 2, 3]
    iters_good = count_iterations_duplicates(arr_good, 1)
    
    # Worst case: All boundaries same
    arr_worst = [1] * 50 + [2] + [1] * 50
    iters_worst = count_iterations_duplicates(arr_worst, 2)
    
    print(f"Best case (target at mid):      {iters_best} iterations")
    print(f"Good case (limited duplicates): {iters_good} iterations")
    print(f"Worst case (all 1s except one): {iters_worst} iterations")
 
 
demonstrate_worst_case()
compare_best_vs_worst()

Average Case Analysis

Average case analysis requires understanding the probability distribution of inputs. For rotated array search, we consider two models.

Model 1: Distinct Elements

With distinct elements, every iteration halves the search space regardless of input. The average case equals the worst case:

Average case = O(log n)

Model 2: Duplicates with Uniform Distribution

Assume each array position is filled with a value chosen uniformly from {1, 2, ..., k} where k is much smaller than n (i.e., many duplicates expected).

The probability that nums[low] = nums[mid] = nums[high] is approximately:

P(ambiguous) ≈ (1/k)² = 1/k²

If k = 10 (10 distinct values), P(ambiguous) ≈ 1%.

With rare ambiguous cases, most iterations still halve the search space, preserving O(log n) average behavior.

Model 3: Duplicates with High Repetition

If one value dominates (say, 90% of elements are the same), then:

P(ambiguous) ≈ 0.9³ = 72.9%

With 73% of iterations only eliminating 2 elements, the average case approaches O(n).

Practical Takeaway:

The average case complexity depends heavily on the duplicate distribution:

Few duplicates → O(log n)
Moderate duplicates → O(log n) with higher constant
Dominant duplicates → O(n)

In most real-world scenarios with reasonable data distributions, the algorithm remains efficient.

What to Tell Interviewers

average_case_simulation.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
import random
import math
 
def count_iterations_duplicates(nums, target):
    """Count iterations (same as before)."""
    if not nums:
        return 0
    
    low, high = 0, len(nums) - 1
    iterations = 0
    
    while low <= high:
        iterations += 1
        mid = low + (high - low) // 2
        
        if nums[mid] == target:
            return iterations
        
        if nums[low] == nums[mid] == nums[high]:
            low += 1
            high -= 1
            continue
        
        if nums[low] <= nums[mid]:
            if nums[low] <= target < nums[mid]:
                high = mid - 1
            else:
                low = mid + 1
        else:
            if nums[mid] < target <= nums[high]:
                low = mid + 1
            else:
                high = mid - 1
    
    return iterations
 
 
def generate_rotated_array(n, k, duplicate_bias=0.0):
    """
    Generate a rotated sorted array with controlled duplicates.
    
    n: size of array
    k: number of distinct values to choose from
    duplicate_bias: probability of repeating the previous value (0 to 1)
    """
    # Generate sorted array with potential duplicates
    arr = []
    prev_val = random.randint(1, k)
    
    for _ in range(n):
        if random.random() < duplicate_bias:
            arr.append(prev_val)
        else:
            prev_val = random.randint(1, k)
            arr.append(prev_val)
    
    arr.sort()
    
    # Rotate by random amount
    rotation = random.randint(0, n - 1)
    return arr[rotation:] + arr[:rotation]
 
 
def simulate_average_case(n=1000, trials=100):
    """Simulate average case under different duplicate distributions."""
    
    print("Average Case Simulation")
    print("=" * 70)
    print(f"Array size: {n}, Trials per config: {trials}")
    print(f"Expected O(log n) iterations: {math.log2(n):.1f}")
    print()
    
    configs = [
        ("No duplicates (k=n)", n, 0.0),
        ("Limited duplicates (k=100)", 100, 0.0),
        ("Moderate duplicates (k=10)", 10, 0.3),
        ("Heavy duplicates (k=3)", 3, 0.5),
        ("Extreme duplicates (k=2)", 2, 0.8),
    ]
    
    print(f"{'Configuration':<35} {'Avg Iters':>12} {'Max Iters':>12}")
    print("-" * 70)
    
    for desc, k, bias in configs:
        total_iters = 0
        max_iters = 0
        
        for _ in range(trials):
            arr = generate_rotated_array(n, k, bias)
            target = random.choice(arr)  # Search for existing element
            
            iters = count_iterations_duplicates(arr, target)
            total_iters += iters
            max_iters = max(max_iters, iters)
        
        avg_iters = total_iters / trials
        print(f"{desc:<35} {avg_iters:>12.1f} {max_iters:>12}")
    
    print()
    print("Note: As duplicates increase, average iterations approach O(n)")
 
 
simulate_average_case()

Space Complexity Analysis

Space complexity is often overlooked but is equally important in real systems. Let's analyze both iterative and recursive implementations.

Iterative Implementation: O(1)

The iterative version uses only:

low: one integer
high: one integer
mid: one integer
target: one integer (input)

Regardless of input size, we use a constant amount of extra space. This is optimal.

Recursive Implementation: O(log n) to O(n)

The recursive version uses the call stack:

With distinct elements: Maximum recursion depth is O(log n) because we halve the problem each time.
With adversarial duplicates: Maximum recursion depth is O(n) because we might only reduce by 2 each time.

Each stack frame uses O(1) space, so:

Distinct: O(log n) space
Duplicates worst case: O(n) space

Space Complexity Summary
Implementation	Distinct Elements	With Duplicates
Iterative	O(1)	O(1)
Recursive	O(log n)	O(n) worst case

Recursion Risks with Duplicates

Why does this matter?

In embedded systems, memory-constrained environments, or highly concurrent systems, space efficiency matters:

Embedded devices: Limited RAM, deep recursion can crash
High concurrency: Each thread has its own stack; 1000 threads with O(n) recursion = disaster
Large inputs: Even O(log n) recursive space becomes relevant at extreme scale

The iterative version is almost always preferred for production systems.

Comparison with Related Algorithms

To fully appreciate the complexity of rotated array search, let's compare it with related search algorithms.

Complexity Comparison of Search Algorithms
Algorithm	Time (Best)	Time (Avg)	Time (Worst)	Space	Requirements
Linear Search	O(1)	O(n)	O(n)	O(1)	None
Binary Search (sorted)	O(1)	O(log n)	O(log n)	O(1)	Sorted
Binary Search (rotated, distinct)	O(1)	O(log n)	O(log n)	O(1)	Rotated sorted, distinct
Binary Search (rotated, dups)	O(1)	O(log n)*	O(n)	O(1)	Rotated sorted
Hash Table Lookup	O(1)	O(1)	O(n)	O(n)	Preprocessing
Interpolation Search	O(1)	O(log log n)	O(n)	O(1)	Uniform distribution

Key insights from comparison:

Rotated array search with distinct elements has the same complexity as regular binary search. The rotation is just a twist in the decision logic, not the efficiency.
Duplicates degrade to linear search worst case. But the average case is often still good!
Hash table is O(1) but requires O(n) space and preprocessing. If doing many searches, hash table wins. For single search, rotated binary search is better.
Interpolation search can be O(log log n) on uniform data but degrades to O(n) otherwise. Rotated array search is more robust.
Linear search is simple and reliable but doesn't leverage the sorted structure. For small arrays, the constant factors might make linear search competitive.

Practical Decision Making

Lower Bounds and Optimality

An important question in algorithm analysis: Is our algorithm optimal? Can we do better?

For distinct elements: YES, O(log n) is optimal

Any comparison-based search algorithm on n distinct elements requires Ω(log n) comparisons in the worst case. This is because:

There are n possible positions for the target
Each comparison gives at most 1 bit of information
We need at least log₂(n) bits to distinguish n possibilities

Our algorithm achieves this bound, so it's optimal.

For duplicates: O(n) worst case is also optimal

When nums[low] = nums[mid] = nums[high], a single comparison yields zero bits of information about which half contains the target. We literally cannot do better than linear elimination in this case.

Proof sketch:

Consider the adversary argument. An adversary constructing the array can always ensure that:

All revealed values are identical (say, 1)
The target (say, 2) is positioned to maximize search time

Until we examine a position containing the target, we have no information. The adversary can place the target such that we examine n/2 positions before finding it.

Information-Theoretic Perspective

What about non-comparison-based approaches?

If we could use other operations (like hashing or arithmetic), could we do better?

Hashing: We could hash all elements in O(n) time, then do O(1) lookup. But this requires O(n) preprocessing and O(n) space. Not better for single queries.
Parallel search: With O(n) processors, we could achieve O(1) time. But this changes the computation model.
Specialized hardware: GPU-accelerated search could be faster in wall-clock time but not in big-O complexity.

For the standard RAM model with comparison-based algorithms, our approach is optimal.

Practical Performance Considerations

Big-O complexity tells part of the story. In practice, constant factors, cache behavior, and branch prediction also matter.

Constant factors:

Our rotated array search does more work per iteration than standard binary search:

Standard: 1-2 comparisons per iteration
Rotated: 3-4 comparisons per iteration (determine sorted half + check range)

This roughly doubles the constant factor but doesn't change the O(log n) classification.

Cache efficiency:

Branch prediction:

The extra conditionals in rotated array search might cause more branch mispredictions. Modern CPUs handle this well, but in extremely performance-critical code, this could matter.

Practical Performance Factors
Factor	Standard Binary Search	Rotated Array Search	Impact
Comparisons per iteration	~2	~4	~2x slower per iteration
Iterations	log₂(n)	log₂(n)	Equal
Cache behavior	Good	Good	Equal
Branch prediction	Good	Slightly worse	Minimal
Code complexity	Simple	Moderate	Maintenance cost

When Micro-optimization Matters

When to use linear search instead:

Despite O(n) complexity, linear search can be faster for small arrays:

For n < ~64, linear search often wins due to:
- Better cache utilization (sequential access)
- No branch misprediction from binary search conditions
- Simpler code with lower overhead
With many duplicates where binary search degrades anyway
When the array is very small relative to other operations

This is why some standard library implementations switch to linear search below a threshold.

Summary: Complete Complexity Understanding

We've conducted a comprehensive analysis of rotated array search complexity. Let's consolidate the key insights:

Key Takeaways

•Distinct elements: O(log n) always. Rotation doesn't affect complexity because we can always halve the search space.
•Duplicates: O(log n) average, O(n) worst. The worst case occurs when boundary values are always equal.
•Space: O(1) iterative, O(log n) to O(n) recursive. Always prefer iteration for production code.
•Lower bound: Our algorithm is optimal. For comparison-based search, we can't do better.
•Average case with duplicates depends on distribution. Limited duplicates → O(log n). Dominant duplicates → O(n).
•Practical factors matter. For very small arrays, linear search may be faster despite O(n) complexity.
•Interview communication is key. Understanding WHY complexity degrades with duplicates shows algorithmic maturity.

Module complete!

Module Complete

4 / 4