Typeahead Autocomplete - Learning Module

Loading content...

0/273

Trie-Based Design: The Foundation of Fast Prefix Lookup

The Data Structure That Powers Billions of Searches

In 1960, computer scientist Edward Fredkin introduced a data structure he called a trie (from retrieval, though now commonly pronounced "try"). This elegant structure would become the backbone of nearly every typeahead system built in the following decades.

The trie's genius lies in its structure: rather than storing strings as complete units, it decomposes them into a tree where each node represents a character position. This decomposition enables O(m) lookup time where m is the query length—independent of how many strings are stored. Whether your corpus has 1,000 suggestions or 1 billion, finding prefixes takes the same amount of time.

Understanding tries deeply is essential for any engineer building search or autocomplete systems. This page will take you from first principles through production-grade optimizations.

What You Will Learn

By the end of this page, you will understand how tries work at a fundamental level, their time and space complexity, variants like radix tries and ternary search tries, and the practical considerations for implementing tries at scale. You'll see code examples, memory layouts, and the specific optimizations that make production tries efficient.

Trie Fundamentals

A trie (prefix tree) is a tree data structure where:

Each node represents a single character (or character sequence in compressed variants)
The root node represents the empty string
Each path from root to a node represents a prefix of stored strings
Nodes can be marked as "terminal" to indicate complete words
All descendants of a node share a common prefix

Visualizing a Trie

Let's build a trie containing: ["car", "card", "care", "careful", "cat", "cats"]

                    (root)
                      |
                     'c'
                      |
                     'a'
                    /   
                  'r'   't'●
                 / |      
               'd'●'e'●   's'●
                    |
                   'f'
                    |
                   'u'
                    |
                   'l'●

The ● symbol marks terminal nodes (complete words). Notice:

"car" shares its entire prefix with "card", "care", and "careful"
"cat" and "cats" branch off from "ca"
To find all suggestions for "car", we navigate to the 'r' node and collect all descendants

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
class TrieNode {
  children: Map<string, TrieNode>;
  isTerminal: boolean;
  // Store the complete word at terminal nodes for easy retrieval
  word: string | null;
  // Metadata for ranking (frequency, score, etc.)
  metadata: {
    frequency: number;
    lastUpdated: number;
  } | null;
 
  constructor() {
    this.children = new Map();
    this.isTerminal = false;
    this.word = null;
    this.metadata = null;
  }
}
 
class Trie {
  private root: TrieNode;
 
  constructor() {
    this.root = new TrieNode();
  }
 
  /**
   * Insert a word into the trie
   * Time: O(m) where m = word length
   * Space: O(m) worst case (no shared prefix)
   */
  insert(word: string, frequency: number = 1): void {
    let current = this.root;
    
    for (const char of word.toLowerCase()) {
      if (!current.children.has(char)) {
        current.children.set(char, new TrieNode());
      }
      current = current.children.get(char)!;
    }
    
    current.isTerminal = true;
    current.word = word;
    current.metadata = {
      frequency,
      lastUpdated: Date.now(),
    };
  }
 
  /**
   * Find the node corresponding to a prefix
   * Time: O(m) where m = prefix length
   */
  private findPrefixNode(prefix: string): TrieNode | null {
    let current = this.root;
    
    for (const char of prefix.toLowerCase()) {
      if (!current.children.has(char)) {
        return null;
      }
      current = current.children.get(char)!;
    }
    
    return current;
  }
 
  /**
   * Get all words with the given prefix
   * Time: O(m + k) where m = prefix length, k = results
   */
  getWordsWithPrefix(prefix: string, limit: number = 10): string[] {
    const prefixNode = this.findPrefixNode(prefix);
    if (!prefixNode) {
      return [];
    }
    
    const results: { word: string; frequency: number }[] = [];
    this.collectWords(prefixNode, results, limit * 2); // Collect extra for sorting
    
    // Sort by frequency and return top results
    return results
      .sort((a, b) => b.frequency - a.frequency)
      .slice(0, limit)
      .map(r => r.word);
  }
 
  /**
   * Recursively collect all terminal words under a node
   */
  private collectWords(
    node: TrieNode,
    results: { word: string; frequency: number }[],
    limit: number
  ): void {
    if (results.length >= limit) return;
    
    if (node.isTerminal && node.word) {
      results.push({
        word: node.word,
        frequency: node.metadata?.frequency ?? 0,
      });
    }
    
    for (const child of node.children.values()) {
      this.collectWords(child, results, limit);
    }
  }
}

The Key Insight

The trie's power is prefix sharing. For a corpus where many words share prefixes (which is true of natural language), the trie dramatically reduces memory compared to storing each word independently, while providing O(m) lookup regardless of corpus size.

Time and Space Complexity Analysis

Understanding the complexity characteristics of tries is essential for making informed architectural decisions.

Time Complexity

Trie Time Complexity
Operation	Time Complexity	Description
Insert	O(m)	m = length of word being inserted
Search (exact)	O(m)	m = length of word being searched
Prefix Search	O(m)	m = length of prefix; just finds the node
Collect All Matches	O(m + k)	m = prefix length, k = number of results
Delete	O(m)	m = length of word; may need cleanup

The critical observation: lookup time is independent of corpus size. Whether you have 1,000 or 1 billion suggestions, finding "machine learning" takes exactly the same 16 character comparisons.

Why This Matters at Scale

Compare to alternatives:

Comparison with Alternative Data Structures
Data Structure	Prefix Search	For 1B entries, prefix len 10
Sorted Array + Binary Search	O(m log n)	10 × 30 = 300 comparisons
Hash Table (all prefixes)	O(m)	Requires storing all prefixes → huge memory
Trie	O(m)	10 comparisons
B-Tree	O(m log n)	10 × 30 = 300 comparisons

Space Complexity

Space analysis is more nuanced:

Worst Case: O(ALPHABET_SIZE × m × n)

ALPHABET_SIZE = 26 for lowercase English, 52 for mixed case, 256 for extended ASCII, 65,536+ for Unicode
m = average word length
n = number of words

Practical Case: Much better due to prefix sharing

For natural language with significant prefix overlap, actual space is typically 3-5× the raw string data, not the theoretical worst case.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
Corpus: 1 billion suggestions
Average suggestion length: 25 characters
Raw string storage: 25 GB
 
Naive trie (Map-based, no optimization):
- Each node: ~100 bytes (Map overhead, pointers)
- Estimated nodes: 10B (due to prefix sharing ~10 nodes/word)
- Total: ~1 TB (unacceptable)
 
Optimized trie (array-based, compressed):
- Each node: ~16-32 bytes
- Compressed paths: ~5B effective nodes
- Total: ~80-160 GB (fits in modern server RAM)
 
With radix compression:
- Merge single-child chains
- Further reduce to ~50-100 GB

Memory is the Real Constraint

Tries are fast but memory-hungry. Production systems must use compressed variants (radix tries, succinct tries) and careful memory management. We'll cover these optimizations in detail.

Trie Variants for Production Use

The basic trie we saw is rarely used in production. Several variants offer better space efficiency and performance characteristics.

Radix Trie (Patricia Trie)

The radix trie (also called Patricia trie or compact prefix tree) compresses chains of single-child nodes into single edges labeled with strings rather than characters:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Standard Trie for ["romane", "romanus", "romulus"]:
 
        (root)
          |
         'r'
          |
         'o'
          |
         'm'
        / | \
      'a' 'a' 'u'
       |   |    |
      'n' 'n'  'l'
       |   |    |
      'e'●'u'  'u'
            |    |
           's'● 's'●
 
Radix Trie (compressed):
 
        (root)
          |
        "rom"
        /    \
      "an"   "ulus"●
      /  \
   "e"●  "us"●
 
Reduction: 11 nodes → 6 nodes (45% fewer)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
interface RadixTrieNode {
  // Edge label can be multiple characters
  edgeLabel: string;
  
  // Children indexed by first character of edge
  children: Map<string, RadixTrieNode>;
  
  isTerminal: boolean;
  word: string | null;
  metadata: SuggestionMetadata | null;
}
 
interface RadixTrie {
  root: RadixTrieNode;
  
  insert(word: string, metadata: SuggestionMetadata): void {
    // On insert, may need to split existing edges
    // Example: inserting "roman" when "romane" exists
    // requires splitting "romane" into "roman" + "e"
  }
  
  search(prefix: string): RadixTrieNode | null {
    // Navigate by matching edge labels
    // A single edge match can consume multiple characters
  }
}

Ternary Search Trie (TST)

The ternary search trie combines trie efficiency with binary search tree characteristics. Each node has three children:

Left: characters less than current
Middle: continuation of current character
Right: characters greater than current

Advantages:

More cache-friendly than standard tries
Naturally handles sparse alphabets
Supports near-neighbor search efficiently

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
interface TSTNode {
  char: string;              // The character at this node
  left: TSTNode | null;      // Characters < char
  middle: TSTNode | null;    // Continuation (next char of same word)
  right: TSTNode | null;     // Characters > char
  isTerminal: boolean;
  word: string | null;
}
 
// Search in TST
function searchTST(node: TSTNode | null, word: string, index: number): TSTNode | null {
  if (!node || index >= word.length) return null;
  
  const char = word[index];
  
  if (char < node.char) {
    return searchTST(node.left, word, index);
  } else if (char > node.char) {
    return searchTST(node.right, word, index);
  } else {
    // char === node.char
    if (index === word.length - 1) {
      return node; // Found the prefix
    }
    return searchTST(node.middle, word, index + 1);
  }
}

Array-Mapped Trie (AMT)

For fixed alphabets (like ASCII), using arrays instead of maps dramatically improves cache locality:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
const ALPHABET_SIZE = 128; // ASCII
 
interface AMTNode {
  // Fixed-size array indexed by character code
  // children[65] = child for 'A' (ASCII 65)
  children: (AMTNode | null)[];
  isTerminal: boolean;
  word: string | null;
  metadata: SuggestionMetadata | null;
}
 
function createNode(): AMTNode {
  return {
    children: new Array(ALPHABET_SIZE).fill(null),
    isTerminal: false,
    word: null,
    metadata: null,
  };
}
 
function insert(root: AMTNode, word: string): void {
  let current = root;
  
  for (const char of word) {
    const index = char.charCodeAt(0);
    if (!current.children[index]) {
      current.children[index] = createNode();
    }
    current = current.children[index]!;
  }
  
  current.isTerminal = true;
  current.word = word;
}
 
// Benefits:
// 1. O(1) child lookup vs O(log k) for Maps
// 2. Better cache locality (contiguous memory)
// 3. Simpler memory layout for serialization
//
// Tradeoffs:
// 1. Wastes space for sparse nodes
// 2. Fixed alphabet size

Choosing the Right Variant

For typeahead with ASCII/UTF-8 text: Radix trie with array-mapped children is often optimal—good compression, excellent lookup speed, and reasonable memory. For multi-language with large Unicode ranges: TST or Map-based tries handle sparse alphabets better. Always benchmark with realistic data.

Memory Optimization Techniques

Memory optimization is critical for production tries. A naive implementation won't fit billion-entry corpora in memory. Here are proven techniques used by major tech companies.

Technique 1: Edge Label Compression (Radix Compression)

We covered this above—merge single-child chains. This alone typically reduces node count by 50-70%.

Technique 2: Reference Sharing (Interning)

Many suggestions share common suffixes or substrings. Use string interning:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
class StringInterner {
  private pool: Map<string, string> = new Map();
  
  intern(s: string): string {
    const existing = this.pool.get(s);
    if (existing) {
      return existing; // Return the canonical instance
    }
    this.pool.set(s, s);
    return s;
  }
}
 
// Usage: Store only one instance of common substrings
const interner = new StringInterner();
 
// Instead of each node storing its own "the" string:
node.edgeLabel = interner.intern(rawLabel);
 
// Memory impact: If "the" appears in 1M edge labels,
// we store it once instead of 1M times
// Savings: ~3 bytes × 1M = 3 MB per common substring

Technique 3: Bitmap-Based Child Indexing

Instead of storing nulls for non-existent children in arrays, use bitmaps:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
interface CompactNode {
  // Bitmap indicating which characters have children
  // For ASCII 128 chars, need 2 × 64-bit integers
  childBitmap: [bigint, bigint];
  
  // Compact array containing only non-null children
  // Position in array determined by popcount of bitmap
  children: CompactNode[];
  
  isTerminal: boolean;
  metadata: number; // Index into separate metadata array
}
 
function getChild(node: CompactNode, char: string): CompactNode | null {
  const charCode = char.charCodeAt(0);
  const bitmapIndex = charCode < 64 ? 0 : 1;
  const bitPosition = charCode % 64;
  
  // Check if child exists
  const mask = 1n << BigInt(bitPosition);
  if ((node.childBitmap[bitmapIndex] & mask) === 0n) {
    return null;
  }
  
  // Calculate position in children array using popcount
  let position = 0;
  if (bitmapIndex === 1) {
    position += popcount(node.childBitmap[0]);
  }
  position += popcount(node.childBitmap[bitmapIndex] & (mask - 1n));
  
  return node.children[position];
}
 
function popcount(n: bigint): number {
  // Count number of 1 bits (CPU has dedicated instruction)
  let count = 0;
  while (n > 0n) {
    count += Number(n & 1n);
    n >>= 1n;
  }
  return count;
}
 
// Memory savings: 
// Standard: 128 pointers × 8 bytes = 1024 bytes/node
// Bitmap: 16 bytes bitmap + (avg 3 children × 8 bytes) = 40 bytes/node
// 25× reduction!

Technique 4: Succinct Tries

For maximum compression, succinct data structures represent the trie using information-theoretically optimal space while maintaining O(1) operations. The LOUDS (Level-Order Unary Degree Sequence) encoding is popular:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Standard trie:
       (root)
       /  |  \
      A   B   C
     /|       |
    D E       F
    |
    G
 
LOUDS encoding (bit sequence):
Level 0 (root): 1110   (3 children: A, B, C)
Level 1 (A,B,C): 110 0 10   (A:2 children, B:0, C:1)
Level 2 (D,E,F): 10 0 0   (D:1 child, E:0, F:0)
Level 3 (G): 0   (G:0 children)
 
Full sequence: 1110 110 0 10 10 0 0 0
 
Space: 2 bits per node (vs ~100+ bytes per node in naive impl)
Navigation uses rank/select operations on bit vectors

When to Use Succinct Tries

Succinct tries offer ~50× compression over naive implementations but come with complexity costs: harder to update, require specialized libraries (like SDSL or Succinct), and have higher constant factors for operations. Use them for read-heavy, rarely-updated indices. For frequently-updated tries, radix tries with bitmap indexing are usually the better trade-off.

Top-K Retrieval Optimization

Typeahead systems don't need all matches—they need the top K most relevant suggestions (typically 5-10). Naively collecting all matches then sorting is expensive for popular prefixes like "the" or "how to".

Problem: Popular Prefix Explosion

For prefix "a", there might be millions of matching suggestions. We can't collect and sort millions of candidates for every keystroke.

Solution 1: Pre-computed Top-K at Nodes

Store the top-K suggestions at each node during insertion:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
interface TrieNodeWithTopK {
  children: Map<string, TrieNodeWithTopK>;
  isTerminal: boolean;
  
  // Pre-computed top-K for this prefix
  topK: {
    word: string;
    score: number;
  }[];
  
  // K value (typically 10-20)
  static readonly K = 15;
}
 
class TrieWithTopK {
  private root: TrieNodeWithTopK;
  
  insert(word: string, score: number): void {
    let current = this.root;
    
    for (const char of word.toLowerCase()) {
      // Update top-K at every node along the path
      this.updateTopK(current, word, score);
      
      if (!current.children.has(char)) {
        current.children.set(char, this.createNode());
      }
      current = current.children.get(char)!;
    }
    
    this.updateTopK(current, word, score);
    current.isTerminal = true;
  }
  
  private updateTopK(node: TrieNodeWithTopK, word: string, score: number): void {
    // Check if this word should be in top-K
    const existingIndex = node.topK.findIndex(e => e.word === word);
    
    if (existingIndex !== -1) {
      // Update existing entry
      node.topK[existingIndex].score = score;
    } else if (node.topK.length < TrieNodeWithTopK.K) {
      // Add new entry
      node.topK.push({ word, score });
    } else if (score > node.topK[node.topK.length - 1].score) {
      // Replace lowest entry
      node.topK.pop();
      node.topK.push({ word, score });
    }
    
    // Keep sorted by score
    node.topK.sort((a, b) => b.score - a.score);
  }
  
  getSuggestions(prefix: string): string[] {
    const node = this.findPrefixNode(prefix);
    if (!node) return [];
    
    // O(1) retrieval - just return pre-computed list!
    return node.topK.map(e => e.word);
  }
}

Trade-offs of pre-computed Top-K:

Pros:

O(1) query time (just read the list)
No runtime sorting or ranking
Predictable latency

Cons:

Each node stores K entries → significant memory overhead
Updates require updating all ancestor nodes → O(m) work per update
Static ranking—can't personalize at query time

Solution 2: Priority-Based DFS with Early Termination

Traverse the subtree in score order, stopping after K results:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
import { MinPriorityQueue } from '@datastructures-js/priority-queue';
 
function getTopKSuggestions(
  prefixNode: TrieNode,
  k: number
): Suggestion[] {
  // Max-heap to track top-K during traversal
  const topK = new MinPriorityQueue<Suggestion>(
    (s) => -s.score // Negate for max-heap behavior
  );
  
  // Priority queue for DFS - explore high-score subtrees first
  const toExplore = new MaxPriorityQueue<{
    node: TrieNode;
    maxPossibleScore: number;
  }>((item) => item.maxPossibleScore);
  
  toExplore.enqueue({
    node: prefixNode,
    maxPossibleScore: prefixNode.maxDescendantScore ?? Infinity,
  });
  
  while (!toExplore.isEmpty()) {
    const { node, maxPossibleScore } = toExplore.dequeue()!;
    
    // Pruning: if best possible score in subtree is worse than
    // our worst top-K entry, skip this subtree entirely
    if (topK.size() >= k && maxPossibleScore <= topK.front()!.score) {
      continue;
    }
    
    // Process terminal node
    if (node.isTerminal && node.word && node.metadata) {
      const suggestion = {
        word: node.word,
        score: node.metadata.score,
      };
      
      if (topK.size() < k) {
        topK.enqueue(suggestion);
      } else if (suggestion.score > topK.front()!.score) {
        topK.dequeue();
        topK.enqueue(suggestion);
      }
    }
    
    // Add children to explore
    for (const child of node.children.values()) {
      toExplore.enqueue({
        node: child,
        maxPossibleScore: child.maxDescendantScore ?? 0,
      });
    }
  }
  
  return topK.toArray().sort((a, b) => b.score - a.score);
}

Hybrid Approach: Best of Both Worlds

Production systems often use a hybrid: pre-compute top-K for the most common prefixes (top 10,000 prefixes cover 80%+ of traffic), and use priority-based DFS for the long tail. This optimizes the common case while handling edge cases correctly.

Concurrent Trie Access Patterns

Production typeahead systems handle thousands of concurrent readers and writers. The trie must support this without becoming a bottleneck.

Read-Heavy Workload Characteristics

Typical ratio: 99.9% reads, 0.1% writes. This heavily influences design:

Optimize for read path (no locks for reads if possible)
Batch writes to amortize overhead
Accept eventual consistency for updates

Strategy 1: Copy-on-Write (COW) Tries

Never modify existing nodes. Create new versions on update:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
class ImmutableTrieNode {
  readonly children: ReadonlyMap<string, ImmutableTrieNode>;
  readonly isTerminal: boolean;
  readonly word: string | null;
  readonly metadata: SuggestionMetadata | null;
  
  constructor(
    children: Map<string, ImmutableTrieNode>,
    isTerminal: boolean,
    word: string | null,
    metadata: SuggestionMetadata | null
  ) {
    this.children = children;
    this.isTerminal = isTerminal;
    this.word = word;
    this.metadata = metadata;
  }
  
  // Create new node with updated child
  withChild(char: string, child: ImmutableTrieNode): ImmutableTrieNode {
    const newChildren = new Map(this.children);
    newChildren.set(char, child);
    return new ImmutableTrieNode(
      newChildren,
      this.isTerminal,
      this.word,
      this.metadata
    );
  }
}
 
class COWTrie {
  private root: ImmutableTrieNode;
  
  insert(word: string, metadata: SuggestionMetadata): void {
    // Build path of new nodes from leaf to root
    const path: { char: string; node: ImmutableTrieNode }[] = [];
    let current = this.root;
    
    for (const char of word) {
      path.push({ char, node: current });
      current = current.children.get(char) ?? new ImmutableTrieNode(
        new Map(),
        false,
        null,
        null
      );
    }
    
    // Create terminal node
    let newNode = new ImmutableTrieNode(
      current.children,
      true,
      word,
      metadata
    );
    
    // Walk back up, creating new nodes along the path
    for (let i = path.length - 1; i >= 0; i--) {
      const { char, node } = path[i];
      newNode = node.withChild(char, newNode);
    }
    
    // Atomic swap of root reference
    this.root = newNode;
  }
}
 
// Benefits:
// 1. Zero contention on reads - readers see consistent snapshot
// 2. Readers never block writers, writers never block readers
// 3. Easy snapshots for backup/replication
//
// Costs:
// 1. Memory churn from creating new nodes
// 2. GC pressure in managed languages
// 3. Higher write latency

Strategy 2: Read-Write Locks per Subtree

Partition the trie into subtrees, each with its own lock:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
const NUM_STRIPES = 64; // Power of 2 for fast modulo
 
class StripedTrie {
  private root: TrieNode;
  private locks: RWLock[];
  
  constructor() {
    this.root = new TrieNode();
    this.locks = Array.from(
      { length: NUM_STRIPES },
      () => new RWLock()
    );
  }
  
  private getLockIndex(prefix: string): number {
    // Hash first few characters to determine stripe
    const hashChars = prefix.slice(0, 3).padEnd(3, '\0');
    const hash = hashChars.charCodeAt(0) * 31 * 31 +
                 hashChars.charCodeAt(1) * 31 +
                 hashChars.charCodeAt(2);
    return hash & (NUM_STRIPES - 1); // Fast modulo for power of 2
  }
  
  async search(prefix: string): Promise<Suggestion[]> {
    const lockIndex = this.getLockIndex(prefix);
    await this.locks[lockIndex].acquireRead();
    
    try {
      return this.searchInternal(prefix);
    } finally {
      this.locks[lockIndex].releaseRead();
    }
  }
  
  async insert(word: string, metadata: SuggestionMetadata): Promise<void> {
    const lockIndex = this.getLockIndex(word);
    await this.locks[lockIndex].acquireWrite();
    
    try {
      this.insertInternal(word, metadata);
    } finally {
      this.locks[lockIndex].releaseWrite();
    }
  }
}
 
// Benefits:
// 1. Parallel writes to different prefixes
// 2. Lower memory overhead than COW
// 3. In-place updates
//
// Costs:
// 1. Risk of lock contention on popular prefixes
// 2. Readers must acquire locks (even if read-only)
// 3. Careful deadlock prevention needed

Production Pattern: Batch + Swap

Many production systems sidestep concurrent modification entirely: build a complete new trie in the background, then atomically swap the reference. Queries use the current trie until swap, then seamlessly use the new one. This trades freshness (updates batch every N minutes) for simpler concurrency.

Serialization and Persistence

Tries must be persisted for durability and transferred across network for replication. Efficient serialization is critical.

Binary Serialization Format

A simple but effective format:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
interface SerializedTrie {
  // Header
  magic: number;            // 4 bytes: Format identifier
  version: number;          // 2 bytes: Format version
  nodeCount: number;        // 4 bytes: Total nodes
  stringTableOffset: number;// 4 bytes: Offset to string table
  
  // Node array (fixed-size entries for random access)
  nodes: SerializedNode[];
  
  // String table (variable-length strings)
  stringTable: Uint8Array;
}
 
interface SerializedNode {
  childBitmap: bigint;        // 16 bytes: Which children exist
  childrenOffset: number;     // 4 bytes: Offset in nodes array
  flags: number;              // 1 byte: isTerminal, etc.
  wordOffset: number;         // 4 bytes: Offset in string table
  score: number;              // 4 bytes: Suggestion score
}
 
// Serialization
function serializeTrie(root: TrieNode): ArrayBuffer {
  const nodes: SerializedNode[] = [];
  const strings: string[] = [];
  const stringOffsets = new Map<string, number>();
  
  // BFS traversal to assign node indices
  function traverse(node: TrieNode): number {
    const index = nodes.length;
    
    // Intern the word string
    let wordOffset = -1;
    if (node.word) {
      if (!stringOffsets.has(node.word)) {
        stringOffsets.set(node.word, strings.join('').length);
        strings.push(node.word + '\0');
      }
      wordOffset = stringOffsets.get(node.word)!;
    }
    
    // Create serialized node
    const serialized: SerializedNode = {
      childBitmap: computeBitmap(node.children),
      childrenOffset: -1, // Filled after processing children
      flags: node.isTerminal ? 1 : 0,
      wordOffset,
      score: node.metadata?.score ?? 0,
    };
    nodes.push(serialized);
    
    // Process children
    if (node.children.size > 0) {
      serialized.childrenOffset = nodes.length;
      for (const child of node.children.values()) {
        traverse(child);
      }
    }
    
    return index;
  }
  
  traverse(root);
  
  // Pack into ArrayBuffer
  return packToBuffer(nodes, strings);
}
 
// Deserialization: Memory-map the file for zero-copy access
function loadTrie(path: string): MemoryMappedTrie {
  const buffer = mmap(path);
  return new MemoryMappedTrie(buffer);
}
 
class MemoryMappedTrie {
  private buffer: ArrayBuffer;
  private nodeView: DataView;
  private stringTable: Uint8Array;
  
  search(prefix: string): Suggestion[] {
    // Navigate using offsets directly into buffer
    // No deserialization needed - read nodes in-place
  }
}

Memory-Mapped Access

For large tries, memory-mapping the serialized file offers significant advantages:

Lazy Loading: Pages load on-demand as accessed
Shared Memory: Multiple processes can share the same physical pages
OS-Managed Caching: The OS handles paging based on access patterns
Fast Startup: No deserialization phase—trie is immediately usable

This is how systems like Lucene/Elasticsearch handle their FST (Finite State Transducer) indices.

Alignment Matters

For memory-mapped access, ensure fields are properly aligned (4-byte fields at 4-byte boundaries, 8-byte at 8-byte). Misaligned access is significantly slower on most CPUs and may cause crashes on some architectures.

Summary: Trie Mastery for Typeahead

We've covered the trie data structure comprehensively. Let's consolidate the key points:

Key Takeaways

•O(m) Lookup: Trie lookup time depends only on query length, not corpus size—essential for billion-entry typeahead.
•Radix Compression: Merge single-child chains to reduce node count by 50-70% without affecting lookup time.
•Memory Optimization: Bitmap indexing, string interning, and succinct encodings can achieve 25-50× compression over naive implementations.
•Top-K Retrieval: Pre-compute top-K at nodes for common prefixes; use priority-based DFS with pruning for the long tail.
•Concurrency: Copy-on-write for read-heavy workloads; striped locks or batch-and-swap for update scenarios.
•Persistence: Compact binary serialization with memory-mapping enables fast startup and shared memory across processes.

Production Trie Decision Tree

Corpus size < 1M AND updates infrequent?
├── Yes: Standard radix trie in memory
└── No: Evaluate further
    │
    Corpus size < 100M AND updates < 1K/sec?
    ├── Yes: Radix trie with bitmap children + COW updates
    └── No: Evaluate further
        │
        Corpus > 100M OR memory constrained?
        ├── Yes: Succinct trie (memory-mapped)
        └── No: Distributed trie (next module)

What's next:

With tries understood, the next page explores Prefix Matching in detail—handling multi-word queries, word boundaries, case sensitivity, Unicode normalization, and the edge cases that trip up naive implementations.

Page Complete

You now have deep knowledge of tries as the foundational data structure for typeahead systems. This understanding enables you to make informed trade-offs between memory, speed, and complexity based on your specific requirements.