Database Management SystemsB-Tree Operations

B-Tree Operations: Search, Insert, and Delete

LevelIntermediate

Duration90 mins

TopicB-Tree Operations

4 / 5

Delete Algorithm

Maintaining Balance: The Art of B-Tree Deletion

B-tree deletion is arguably the most complex of the three core operations. While search is straightforward and insertion needs only splitting, deletion must handle underflow through multiple strategies: borrowing from siblings, merging with siblings, and rebalancing separators in ancestors.

Every DELETE FROM statement that affects an indexed column triggers B-tree deletions across all relevant indexes. Understanding this operation is essential for maintaining database performance and ensuring index integrity.

What You Will Master

By completing this page, you will understand: (1) The complete deletion algorithm with all cases, (2) When and how to borrow keys from siblings, (3) When and why node merging occurs, (4) Handling deletion of keys in internal nodes, (5) Cascading rebalancing through the tree, and (6) Production considerations including lazy deletion and tombstones.

Deletion Fundamentals and Invariants

Deletion must preserve all B-tree invariants while removing a key. The challenge arises when removal would leave a node with fewer than the minimum required keys.

Invariants that Deletion Must Maintain:

Minimum Occupancy: Every non-root node must have at least ⌈t/2⌉-1 keys (typically t-1 for minimum degree t)
Maximum Occupancy: No node exceeds 2t-1 keys
Ordering: Keys within nodes and across the tree remain sorted
Balance: All leaves remain at the same depth
Root Occupancy: Root can have as few as 1 key (or 0 if tree becomes empty)

The Underflow Problem:

When a node has exactly t-1 keys (minimum allowed) and we delete one key:

The node would have t-2 keys
This violates the minimum occupancy invariant
We must rebalance by borrowing from siblings or merging

Node States Before and After Deletion
Scenario	Before Deletion	After Naive Delete	Invariant Violated?	Action Needed
Node has spare keys	k keys (k > t-1)	k-1 keys	No	None—simple delete
Node at minimum	t-1 keys	t-2 keys	Yes—underflow	Borrow or merge
Root with 1 key	1 key	0 keys	Depends	Collapse tree height

Deletion vs. Insertion Symmetry

Deletion is the mirror of insertion: insertion causes overflow (→ split), deletion causes underflow (→ merge). However, deletion is more complex because we may need to delete from internal nodes (not just leaves), requiring predecessor/successor strategies.

The Three Deletion Cases

B-tree deletion breaks down into three fundamental cases based on where the key is located:

Case 1: Key is in a Leaf Node

The simplest case—if removing the key doesn't cause underflow, simply remove it. If underflow occurs, rebalance with siblings.

Case 2: Key is in an Internal Node

We cannot simply remove from an internal node because it serves as a separator. Instead, replace it with its predecessor or successor from a leaf, then delete that replacement from the leaf.

Case 3: Key is Not in Current Node

The key might be in a subtree. Ensure the child we descend into has enough keys to handle a potential deletion (proactive rebalancing), then recurse.

High-Level Algorithm:

BTREE-DELETE(node, key):
    i ← FindKeyIndex(node, key)  // Position where key is or would be
    
    IF key is found at node.keys[i]:  // Key in this node
        IF node.isLeaf:
            CASE 1: Delete from leaf
        ELSE:
            CASE 2: Delete from internal node
    ELSE:  // Key not in this node
        IF node.isLeaf:
            RETURN "Key not found"  // Base case: key doesn't exist
        ELSE:
            CASE 3: Ensure child has enough keys, then recurse

Let's examine each case in detail.

Proactive Rebalancing

Similar to proactive splitting during insertion, proactive rebalancing ensures that when we descend into a child, it has at least t keys (one more than minimum). This guarantees we can always delete without needing to propagate rebalancing upward.

Case 1: Deletion from Leaf Nodes

Leaf deletion is the base case—the key and its associated data are directly removed. The complexity arises from maintaining minimum occupancy.

Sub-case 1a: Leaf has spare keys

If the leaf has more than t-1 keys, simply remove the key:

DELETE-FROM-LEAF(leaf, keyIndex):
    // Shift keys and values left to close the gap
    FOR i ← keyIndex TO leaf.numKeys - 2:
        leaf.keys[i] ← leaf.keys[i + 1]
        leaf.values[i] ← leaf.values[i + 1]
    leaf.numKeys ← leaf.numKeys - 1
    DISK-WRITE(leaf)

Example:

Before: [10, 20, 30, 40]  (t=3, min=2 keys, current=4)
Delete 20: [10, 30, 40]  (3 keys ≥ 2, OK)

Sub-case 1b: Leaf will underflow—borrow from sibling

If deleting would cause underflow but a sibling has spare keys, borrow:

BORROW-FROM-LEFT-SIBLING(parent, nodeIndex):
    node ← parent.children[nodeIndex]
    leftSibling ← parent.children[nodeIndex - 1]
    
    // Make room in node by shifting keys right
    FOR i ← node.numKeys - 1 DOWNTO 0:
        node.keys[i + 1] ← node.keys[i]
        node.values[i + 1] ← node.values[i]
    
    // Pull separator from parent into node's first position
    node.keys[0] ← parent.keys[nodeIndex - 1]
    
    // Move sibling's last key up to parent as new separator
    parent.keys[nodeIndex - 1] ← leftSibling.keys[leftSibling.numKeys - 1]
    
    // Transfer sibling's last value to node
    node.values[0] ← leftSibling.values[leftSibling.numKeys - 1]
    
    leftSibling.numKeys ← leftSibling.numKeys - 1
    node.numKeys ← node.numKeys + 1
    
    DISK-WRITE(leftSibling)
    DISK-WRITE(node)
    DISK-WRITE(parent)

Example:

Before:
  Parent: [..., 30, ...]
            /      \
   [10, 20, 25]    [40]  ← Need to delete 40, will underflow

Borrow from left sibling:
  Parent: [..., 25, ...]
            /      \
   [10, 20]        [30, 40]  
   
Now delete 40:
  Parent: [..., 25, ...]
            /      \
   [10, 20]        [30]  ← Valid (≥ min keys)

Borrowing Direction

Check both left and right siblings for spare keys. Prefer the sibling with more keys (reduces future borrowing) or the one that avoids expensive I/O (if one is already cached). Some implementations always try left first for consistency.

Case 1c: Leaf Merging When Siblings Can't Spare

When no sibling has spare keys to borrow, we must merge the underflowing node with a sibling. This is the inverse of splitting.

Merge Algorithm:

MERGE-NODES(parent, leftIndex):
    // Merge children[leftIndex] and children[leftIndex + 1]
    leftNode ← parent.children[leftIndex]
    rightNode ← parent.children[leftIndex + 1]
    
    // Pull separator key from parent into left node
    leftNode.keys[leftNode.numKeys] ← parent.keys[leftIndex]
    leftNode.numKeys ← leftNode.numKeys + 1
    
    // Copy all keys from right node to left node
    FOR j ← 0 TO rightNode.numKeys - 1:
        leftNode.keys[leftNode.numKeys] ← rightNode.keys[j]
        leftNode.values[leftNode.numKeys] ← rightNode.values[j]
        leftNode.numKeys ← leftNode.numKeys + 1
    
    // If internal nodes, also copy child pointers
    IF NOT leftNode.isLeaf:
        FOR j ← 0 TO rightNode.numKeys:
            leftNode.children[leftNode.numKeys - rightNode.numKeys + j] ← rightNode.children[j]
    
    // Remove separator and right child from parent
    FOR j ← leftIndex TO parent.numKeys - 2:
        parent.keys[j] ← parent.keys[j + 1]
        parent.children[j + 1] ← parent.children[j + 2]
    parent.numKeys ← parent.numKeys - 1
    
    // Free the right node
    FREE-NODE(rightNode)
    
    DISK-WRITE(leftNode)
    DISK-WRITE(parent)

Merge Example (t=3, min keys=2):

Before (need to delete 60 from right leaf):
        Parent: [50]
               /    \
        [30, 40]    [60]  ← Will underflow (has minimum)

Both siblings at minimum, cannot borrow.
Merge left + separator + right:

        Parent: []   ← Empty! Might collapse if root
              |
    [30, 40, 50, 60]

Now delete 60:
        Parent: []   
              |
    [30, 40, 50]  ← Valid leaf

Key insight: Merging reduces the parent's key count by 1 (the separator is consumed). If the parent was at minimum, this triggers cascading rebalancing.

Merge + Delete Ordering

The merge must happen BEFORE the actual key deletion. Merge ensures the node has enough keys; then deletion removes one key, leaving at least t-1. Reversing this order would temporarily violate invariants.

B+-Tree Leaf Merge Specifics:

For B+-tree leaves, merging also requires updating the sibling linked list:

MERGE-LEAF-NODES(parent, leftIndex):
    // ... standard merge operations ...
    
    // Update leaf linked list
    leftNode.nextLeaf ← rightNode.nextLeaf
    IF rightNode.nextLeaf ≠ NULL:
        rightNode.nextLeaf.prevLeaf ← leftNode

Forgetting this update breaks range query traversal after the merge point.

Case 2: Deletion from Internal Nodes

Deleting a key from an internal node is more complex because internal keys serve as separators—they define the boundaries between subtrees. We cannot simply remove them without breaking the tree structure.

Strategy: Replace and Delete

Instead of removing the key directly, replace it with its in-order predecessor or in-order successor (both are in leaves), then delete that replacement from the leaf.

    [key to delete: 50]
           /          \
    [..., 45]        [55, ...]
           ↑              ↑
    Predecessor    Successor
      (45)           (55)

Predecessor: The rightmost key in the left subtree Successor: The leftmost key in the right subtree

Both are guaranteed to be in leaf nodes, converting Case 2 to Case 1.

Algorithm for Case 2:

DELETE-FROM-INTERNAL(node, keyIndex):
    key ← node.keys[keyIndex]
    
    // Case 2a: Left child has enough keys
    IF node.children[keyIndex].numKeys >= t:
        predecessor ← GET-PREDECESSOR(node.children[keyIndex])
        node.keys[keyIndex] ← predecessor.key
        BTREE-DELETE(node.children[keyIndex], predecessor.key)
    
    // Case 2b: Right child has enough keys
    ELSE IF node.children[keyIndex + 1].numKeys >= t:
        successor ← GET-SUCCESSOR(node.children[keyIndex + 1])
        node.keys[keyIndex] ← successor.key
        BTREE-DELETE(node.children[keyIndex + 1], successor.key)
    
    // Case 2c: Both children have minimum keys—merge, then delete
    ELSE:
        MERGE-NODES(node, keyIndex)  // Merges children[keyIndex] and [keyIndex+1]
        BTREE-DELETE(node.children[keyIndex], key)  // Key now in merged node

GET-PREDECESSOR(node):
    // Navigate to rightmost leaf, return rightmost key
    WHILE NOT node.isLeaf:
        node ← node.children[node.numKeys]
    RETURN node.keys[node.numKeys - 1]

GET-SUCCESSOR(node):
    // Navigate to leftmost leaf, return leftmost key
    WHILE NOT node.isLeaf:
        node ← node.children[0]
    RETURN node.keys[0]

Internal Deletion Sub-cases
Sub-case	Condition	Action
2a	Left child has ≥ t keys	Replace with predecessor, delete predecessor from left subtree
2b	Right child has ≥ t keys	Replace with successor, delete successor from right subtree
2c	Both children have t-1 keys	Merge children with separator, then delete from merged node

Why t Keys, Not t-1?

We check for ≥ t keys (not just > t-1) because we're about to delete from that subtree. Having t keys means after deletion the child will have t-1 keys (still valid). Having exactly t-1 keys means deletion would cause underflow.

Case 3: Proactive Rebalancing During Descent

When the key to delete is not in the current node, we must descend to a child. But what if that child has exactly t-1 keys? Deleting from it would cause underflow.

Proactive Rebalancing: Before descending, ensure the target child has at least t keys. This guarantees that deletion can proceed without upward propagation.

CASE-3-ENSURE-CHILD-HAS-ENOUGH(node, childIndex):
    child ← node.children[childIndex]
    
    IF child.numKeys >= t:
        RETURN  // Already has enough keys
    
    // Child has exactly t-1 keys; need to add one
    leftSibling ← childIndex > 0 ? node.children[childIndex - 1] : NULL
    rightSibling ← childIndex < node.numKeys ? node.children[childIndex + 1] : NULL
    
    // Case 3a: Borrow from left sibling
    IF leftSibling ≠ NULL AND leftSibling.numKeys >= t:
        BORROW-FROM-LEFT(node, childIndex)
    
    // Case 3b: Borrow from right sibling
    ELSE IF rightSibling ≠ NULL AND rightSibling.numKeys >= t:
        BORROW-FROM-RIGHT(node, childIndex)
    
    // Case 3c: Can't borrow; merge with a sibling
    ELSE:
        IF leftSibling ≠ NULL:
            MERGE-NODES(node, childIndex - 1)  // Merge child with left sibling
            childIndex ← childIndex - 1        // Merged node is at original left sibling's position
        ELSE:
            MERGE-NODES(node, childIndex)      // Merge child with right sibling
    
    // Now descend into the child (which now has enough keys)
    RETURN node.children[childIndex]

Borrowing in Detail:

BORROW-FROM-LEFT(parent, childIndex):
    child ← parent.children[childIndex]
    leftSibling ← parent.children[childIndex - 1]
    
    // Shift child's keys/children right to make room
    FOR i ← child.numKeys DOWNTO 0:
        child.keys[i + 1] ← child.keys[i]
    IF NOT child.isLeaf:
        FOR i ← child.numKeys + 1 DOWNTO 0:
            child.children[i + 1] ← child.children[i]
    
    // Parent's separator moves down to child
    child.keys[0] ← parent.keys[childIndex - 1]
    
    // Sibling's rightmost key moves up to parent
    parent.keys[childIndex - 1] ← leftSibling.keys[leftSibling.numKeys - 1]
    
    // If internal, also transfer sibling's rightmost child
    IF NOT child.isLeaf:
        child.children[0] ← leftSibling.children[leftSibling.numKeys]
    
    leftSibling.numKeys ← leftSibling.numKeys - 1
    child.numKeys ← child.numKeys + 1

The rotation brings the separator down and the sibling's key up, maintaining order.

Single-Pass Deletion

Like proactive splitting for insertion, proactive rebalancing enables single-pass deletion. We never need to backtrack up the tree—all potential underflow is fixed during descent. This simplifies implementation and improves performance.

Root Collapsing: Reducing Tree Height

When deletion causes the root to become empty (0 keys), the tree height must decrease. This is the inverse of root splitting.

Condition: Root has 0 keys and 1 child (can only happen after a merge that consumed the root's last separator).

Action: The single child becomes the new root.

COLLAPSE-ROOT-IF-NEEDED(tree):
    IF tree.root.numKeys = 0 AND NOT tree.root.isLeaf:
        oldRoot ← tree.root
        tree.root ← tree.root.children[0]  // Sole child becomes root
        FREE-NODE(oldRoot)
        tree.height ← tree.height - 1

Example:

Before (delete 50, causes merge, root has 1 key consumed):
         [50]           ← Root with 1 key
        /    \
    [30]     [70]

After merge of root's children:
         []              ← Root empty!
         |
    [30, 50, 70]         ← Merged child

After collapse:
    [30, 50, 70]         ← This is now the root

Now delete 50:
    [30, 70]             ← Tree height reduced by 1

Edge Case: Deleting the Last Key

When the tree contains a single key in a single-node root:

Tree: [42]  ← Only key in tree

Delete 42:
  - Root becomes empty
  - Root is a leaf (no children)
  - Tree is empty

EMPTY-TREE(tree):
    IF tree.root.numKeys = 0 AND tree.root.isLeaf:
        tree.root ← NULL
        // Or keep empty root for future insertions

Some implementations maintain an empty root node rather than nullifying it, avoiding allocation on the next insert.

Height Reduction is Rare

Just as root splits (height increase) are rare, root collapses (height decrease) are equally rare. For a tree with t=100, the root must be reduced from 1 key to 0 through deletion—requiring approximately half the tree's content to be deleted to trigger a height reduction.

The Complete Deletion Algorithm

Let's consolidate all cases into a complete, production-ready algorithm:

BTREE-DELETE(tree, key):
    // Handle root collapse after deletion
    DELETE-FROM-NODE(tree.root, key)
    
    // Check if root needs collapsing
    IF tree.root.numKeys = 0 AND NOT tree.root.isLeaf:
        tree.root ← tree.root.children[0]
        FREE-NODE(oldRoot)

DELETE-FROM-NODE(node, key):
    i ← FindKeyPosition(node, key)
    
    // Key found in this node
    IF i < node.numKeys AND node.keys[i] = key:
        IF node.isLeaf:
            // Case 1: Simple leaf deletion
            DeleteFromLeaf(node, i)
        ELSE:
            // Case 2: Internal node deletion
            DeleteFromInternal(node, i)
    
    // Key not in this node
    ELSE:
        IF node.isLeaf:
            RETURN  // Key not found in tree
        
        // Ensure child has enough keys before descending
        EnsureChildHasEnoughKeys(node, i)
        
        // Recurse into appropriate child
        DELETE-FROM-NODE(node.children[i], key)

btree-delete.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
interface BTreeNode {
    keys: Key[];
    values?: Value[];  // Only for leaves
    children?: BTreeNode[];  // Only for internal nodes
    isLeaf: boolean;
    numKeys: number;
}
 
class BTree {
    root: BTreeNode;
    t: number;  // Minimum degree
 
    delete(key: Key): boolean {
        if (!this.root) return false;
        
        const deleted = this.deleteFromNode(this.root, key);
        
        // Handle root collapse
        if (this.root.numKeys === 0 && !this.root.isLeaf) {
            this.root = this.root.children![0];
        }
        
        return deleted;
    }
 
    private deleteFromNode(node: BTreeNode, key: Key): boolean {
        let i = this.findKeyPosition(node, key);
        
        // Key found in this node
        if (i < node.numKeys && node.keys[i] === key) {
            if (node.isLeaf) {
                return this.deleteFromLeaf(node, i);
            } else {
                return this.deleteFromInternal(node, i);
            }
        }
        
        // Key not in this node; must be in subtree
        if (node.isLeaf) {
            return false;  // Key not found
        }
        
        // Ensure child at position i has enough keys
        this.ensureChildHasEnoughKeys(node, i);
        
        // Recurse; note that child index might have changed after merge
        return this.deleteFromNode(node.children![i], key);
    }
 
    private deleteFromLeaf(leaf: BTreeNode, keyIndex: number): boolean {
        // Shift keys left
        for (let j = keyIndex; j < leaf.numKeys - 1; j++) {
            leaf.keys[j] = leaf.keys[j + 1];
            if (leaf.values) leaf.values[j] = leaf.values[j + 1];
        }
        leaf.numKeys--;
        return true;
    }
    
    // ... deleteFromInternal, ensureChildHasEnoughKeys implementations
}

Index Adjustment After Merge

When ensureChildHasEnoughKeys performs a merge with the left sibling, the target child shifts left by one position. The algorithm must account for this; otherwise, recursion proceeds into the wrong child. This is a common bug source.

Lazy Deletion and Tombstones

Full deletion (with rebalancing) is expensive. Many production systems use lazy deletion strategies that defer or avoid structural modifications.

Strategy 1: Tombstone Marking

Instead of removing keys, mark them as deleted:

LAZY-DELETE(tree, key):
    node ← FindLeafContaining(tree.root, key)
    i ← FindKeyInNode(node, key)
    
    IF key found:
        node.deleted[i] ← TRUE  // Tombstone marker
        tree.deletedCount ← tree.deletedCount + 1
        DISK-WRITE(node)

Advantages:

O(log n) deletion with single write
No structural changes
Easy to implement concurrent deletion

Disadvantages:

Wasted space for deleted keys
Searches must skip tombstones
Eventually need compaction

Strategy 2: Deferred Compaction

Combine tombstone marking with periodic compaction:

PERIODIC-COMPACTION(tree):
    IF tree.deletedCount / tree.totalKeys > THRESHOLD:
        FOR each node in tree:
            CompactNode(node)  // Remove tombstones, rebalance if needed
        RebuildIfNecessary(tree)
        tree.deletedCount ← 0

This amortizes structural modification cost across many deletions.

Strategy 3: MVCC (Multi-Version Concurrency Control)

Rather than deleting, create a new version of the tree without the key:

Old version remains for active transactions
Garbage collection removes unreachable versions
Common in PostgreSQL, CockroachDB, etc.

Strategy 4: Log-Structured Merge (LSM) Trees

An entirely different approach:

All writes go to an in-memory buffer
Deletions are "delete markers" in the buffer
Periodic merge compacts and applies deletions
Used by LevelDB, RocksDB, Cassandra

Deletion Strategy Comparison
Strategy	Delete Latency	Space Overhead	Complexity	Best For
Immediate (full)	O(log n)	None	High	Read-heavy, low delete rate
Tombstone	O(log n)	Growing	Low	Balanced workloads
Deferred compaction	O(log n) + batch	Temporary	Medium	Write-heavy with batch windows
MVCC	O(log n)	Per version	High	Concurrent transactions
LSM	O(1) amortized	Write buffers	High	Write-heavy workloads

Choosing a Strategy

Most databases use hybrid approaches: immediate deletion for small trees, tombstones for hot paths with periodic compaction, and MVCC for transaction support. Understand your workload's read/write/delete ratio to choose appropriately.

Summary: Mastering B-Tree Deletion

B-tree deletion is the most complex core operation, requiring careful handling of underflow through borrowing and merging. Let's consolidate the essential knowledge:

Key Takeaways

•Three cases govern deletion — Leaf deletion (simple), internal node deletion (replace with predecessor/successor), and recursive descent with proactive rebalancing.
•Borrowing maintains minimum occupancy — When a node would underflow, try borrowing from siblings first; this is cheaper than merging.
•Merging is splitting's inverse — When borrowing isn't possible, merge the underflowing node with a sibling, consuming a separator from the parent.
•Proactive rebalancing enables single-pass deletion — By ensuring children have enough keys before descending, we avoid backtracking up the tree.
•Root collapsing reduces height — When the root becomes empty, its single child becomes the new root, decreasing tree height.
•Lazy deletion trades correctness for performance — Tombstones and deferred compaction reduce deletion cost for write-heavy workloads.

Ready for Node Merging Deep Dive

You now understand the B-tree deletion algorithm comprehensively. The next page examines node merging in greater detail—exploring the mechanics, edge cases, and optimization techniques that production systems employ.

4 / 5

Loading learning content...

Database Management SystemsB-Tree Operations

B-Tree Operations: Search, Insert, and Delete

LevelIntermediate

Duration90 mins

TopicB-Tree Operations

4 / 5

Delete Algorithm

Maintaining Balance: The Art of B-Tree Deletion

What You Will Master

Deletion Fundamentals and Invariants

Deletion must preserve all B-tree invariants while removing a key. The challenge arises when removal would leave a node with fewer than the minimum required keys.

Invariants that Deletion Must Maintain:

Minimum Occupancy: Every non-root node must have at least ⌈t/2⌉-1 keys (typically t-1 for minimum degree t)
Maximum Occupancy: No node exceeds 2t-1 keys
Ordering: Keys within nodes and across the tree remain sorted
Balance: All leaves remain at the same depth
Root Occupancy: Root can have as few as 1 key (or 0 if tree becomes empty)

The Underflow Problem:

When a node has exactly t-1 keys (minimum allowed) and we delete one key:

The node would have t-2 keys
This violates the minimum occupancy invariant
We must rebalance by borrowing from siblings or merging

Node States Before and After Deletion
Scenario	Before Deletion	After Naive Delete	Invariant Violated?	Action Needed
Node has spare keys	k keys (k > t-1)	k-1 keys	No	None—simple delete
Node at minimum	t-1 keys	t-2 keys	Yes—underflow	Borrow or merge
Root with 1 key	1 key	0 keys	Depends	Collapse tree height

Deletion vs. Insertion Symmetry

The Three Deletion Cases

B-tree deletion breaks down into three fundamental cases based on where the key is located:

Case 1: Key is in a Leaf Node

The simplest case—if removing the key doesn't cause underflow, simply remove it. If underflow occurs, rebalance with siblings.

Case 2: Key is in an Internal Node

We cannot simply remove from an internal node because it serves as a separator. Instead, replace it with its predecessor or successor from a leaf, then delete that replacement from the leaf.

Case 3: Key is Not in Current Node

The key might be in a subtree. Ensure the child we descend into has enough keys to handle a potential deletion (proactive rebalancing), then recurse.

High-Level Algorithm:

BTREE-DELETE(node, key):
    i ← FindKeyIndex(node, key)  // Position where key is or would be
    
    IF key is found at node.keys[i]:  // Key in this node
        IF node.isLeaf:
            CASE 1: Delete from leaf
        ELSE:
            CASE 2: Delete from internal node
    ELSE:  // Key not in this node
        IF node.isLeaf:
            RETURN "Key not found"  // Base case: key doesn't exist
        ELSE:
            CASE 3: Ensure child has enough keys, then recurse

Let's examine each case in detail.

Proactive Rebalancing

Case 1: Deletion from Leaf Nodes

Leaf deletion is the base case—the key and its associated data are directly removed. The complexity arises from maintaining minimum occupancy.

Sub-case 1a: Leaf has spare keys

If the leaf has more than t-1 keys, simply remove the key:

DELETE-FROM-LEAF(leaf, keyIndex):
    // Shift keys and values left to close the gap
    FOR i ← keyIndex TO leaf.numKeys - 2:
        leaf.keys[i] ← leaf.keys[i + 1]
        leaf.values[i] ← leaf.values[i + 1]
    leaf.numKeys ← leaf.numKeys - 1
    DISK-WRITE(leaf)

Example:

Before: [10, 20, 30, 40]  (t=3, min=2 keys, current=4)
Delete 20: [10, 30, 40]  (3 keys ≥ 2, OK)

Sub-case 1b: Leaf will underflow—borrow from sibling

If deleting would cause underflow but a sibling has spare keys, borrow:

BORROW-FROM-LEFT-SIBLING(parent, nodeIndex):
    node ← parent.children[nodeIndex]
    leftSibling ← parent.children[nodeIndex - 1]
    
    // Make room in node by shifting keys right
    FOR i ← node.numKeys - 1 DOWNTO 0:
        node.keys[i + 1] ← node.keys[i]
        node.values[i + 1] ← node.values[i]
    
    // Pull separator from parent into node's first position
    node.keys[0] ← parent.keys[nodeIndex - 1]
    
    // Move sibling's last key up to parent as new separator
    parent.keys[nodeIndex - 1] ← leftSibling.keys[leftSibling.numKeys - 1]
    
    // Transfer sibling's last value to node
    node.values[0] ← leftSibling.values[leftSibling.numKeys - 1]
    
    leftSibling.numKeys ← leftSibling.numKeys - 1
    node.numKeys ← node.numKeys + 1
    
    DISK-WRITE(leftSibling)
    DISK-WRITE(node)
    DISK-WRITE(parent)

Example:

Before:
  Parent: [..., 30, ...]
            /      \
   [10, 20, 25]    [40]  ← Need to delete 40, will underflow

Borrow from left sibling:
  Parent: [..., 25, ...]
            /      \
   [10, 20]        [30, 40]  
   
Now delete 40:
  Parent: [..., 25, ...]
            /      \
   [10, 20]        [30]  ← Valid (≥ min keys)

Borrowing Direction

Case 1c: Leaf Merging When Siblings Can't Spare

When no sibling has spare keys to borrow, we must merge the underflowing node with a sibling. This is the inverse of splitting.

Merge Algorithm:

MERGE-NODES(parent, leftIndex):
    // Merge children[leftIndex] and children[leftIndex + 1]
    leftNode ← parent.children[leftIndex]
    rightNode ← parent.children[leftIndex + 1]
    
    // Pull separator key from parent into left node
    leftNode.keys[leftNode.numKeys] ← parent.keys[leftIndex]
    leftNode.numKeys ← leftNode.numKeys + 1
    
    // Copy all keys from right node to left node
    FOR j ← 0 TO rightNode.numKeys - 1:
        leftNode.keys[leftNode.numKeys] ← rightNode.keys[j]
        leftNode.values[leftNode.numKeys] ← rightNode.values[j]
        leftNode.numKeys ← leftNode.numKeys + 1
    
    // If internal nodes, also copy child pointers
    IF NOT leftNode.isLeaf:
        FOR j ← 0 TO rightNode.numKeys:
            leftNode.children[leftNode.numKeys - rightNode.numKeys + j] ← rightNode.children[j]
    
    // Remove separator and right child from parent
    FOR j ← leftIndex TO parent.numKeys - 2:
        parent.keys[j] ← parent.keys[j + 1]
        parent.children[j + 1] ← parent.children[j + 2]
    parent.numKeys ← parent.numKeys - 1
    
    // Free the right node
    FREE-NODE(rightNode)
    
    DISK-WRITE(leftNode)
    DISK-WRITE(parent)

Merge Example (t=3, min keys=2):

Before (need to delete 60 from right leaf):
        Parent: [50]
               /    \
        [30, 40]    [60]  ← Will underflow (has minimum)

Both siblings at minimum, cannot borrow.
Merge left + separator + right:

        Parent: []   ← Empty! Might collapse if root
              |
    [30, 40, 50, 60]

Now delete 60:
        Parent: []   
              |
    [30, 40, 50]  ← Valid leaf

Key insight: Merging reduces the parent's key count by 1 (the separator is consumed). If the parent was at minimum, this triggers cascading rebalancing.

Merge + Delete Ordering

B+-Tree Leaf Merge Specifics:

For B+-tree leaves, merging also requires updating the sibling linked list:

MERGE-LEAF-NODES(parent, leftIndex):
    // ... standard merge operations ...
    
    // Update leaf linked list
    leftNode.nextLeaf ← rightNode.nextLeaf
    IF rightNode.nextLeaf ≠ NULL:
        rightNode.nextLeaf.prevLeaf ← leftNode

Forgetting this update breaks range query traversal after the merge point.

Case 2: Deletion from Internal Nodes

Strategy: Replace and Delete

Instead of removing the key directly, replace it with its in-order predecessor or in-order successor (both are in leaves), then delete that replacement from the leaf.

    [key to delete: 50]
           /          \
    [..., 45]        [55, ...]
           ↑              ↑
    Predecessor    Successor
      (45)           (55)

Predecessor: The rightmost key in the left subtree Successor: The leftmost key in the right subtree

Both are guaranteed to be in leaf nodes, converting Case 2 to Case 1.

Algorithm for Case 2:

DELETE-FROM-INTERNAL(node, keyIndex):
    key ← node.keys[keyIndex]
    
    // Case 2a: Left child has enough keys
    IF node.children[keyIndex].numKeys >= t:
        predecessor ← GET-PREDECESSOR(node.children[keyIndex])
        node.keys[keyIndex] ← predecessor.key
        BTREE-DELETE(node.children[keyIndex], predecessor.key)
    
    // Case 2b: Right child has enough keys
    ELSE IF node.children[keyIndex + 1].numKeys >= t:
        successor ← GET-SUCCESSOR(node.children[keyIndex + 1])
        node.keys[keyIndex] ← successor.key
        BTREE-DELETE(node.children[keyIndex + 1], successor.key)
    
    // Case 2c: Both children have minimum keys—merge, then delete
    ELSE:
        MERGE-NODES(node, keyIndex)  // Merges children[keyIndex] and [keyIndex+1]
        BTREE-DELETE(node.children[keyIndex], key)  // Key now in merged node

GET-PREDECESSOR(node):
    // Navigate to rightmost leaf, return rightmost key
    WHILE NOT node.isLeaf:
        node ← node.children[node.numKeys]
    RETURN node.keys[node.numKeys - 1]

GET-SUCCESSOR(node):
    // Navigate to leftmost leaf, return leftmost key
    WHILE NOT node.isLeaf:
        node ← node.children[0]
    RETURN node.keys[0]

Internal Deletion Sub-cases
Sub-case	Condition	Action
2a	Left child has ≥ t keys	Replace with predecessor, delete predecessor from left subtree
2b	Right child has ≥ t keys	Replace with successor, delete successor from right subtree
2c	Both children have t-1 keys	Merge children with separator, then delete from merged node

Why t Keys, Not t-1?

Case 3: Proactive Rebalancing During Descent

When the key to delete is not in the current node, we must descend to a child. But what if that child has exactly t-1 keys? Deleting from it would cause underflow.

Proactive Rebalancing: Before descending, ensure the target child has at least t keys. This guarantees that deletion can proceed without upward propagation.

CASE-3-ENSURE-CHILD-HAS-ENOUGH(node, childIndex):
    child ← node.children[childIndex]
    
    IF child.numKeys >= t:
        RETURN  // Already has enough keys
    
    // Child has exactly t-1 keys; need to add one
    leftSibling ← childIndex > 0 ? node.children[childIndex - 1] : NULL
    rightSibling ← childIndex < node.numKeys ? node.children[childIndex + 1] : NULL
    
    // Case 3a: Borrow from left sibling
    IF leftSibling ≠ NULL AND leftSibling.numKeys >= t:
        BORROW-FROM-LEFT(node, childIndex)
    
    // Case 3b: Borrow from right sibling
    ELSE IF rightSibling ≠ NULL AND rightSibling.numKeys >= t:
        BORROW-FROM-RIGHT(node, childIndex)
    
    // Case 3c: Can't borrow; merge with a sibling
    ELSE:
        IF leftSibling ≠ NULL:
            MERGE-NODES(node, childIndex - 1)  // Merge child with left sibling
            childIndex ← childIndex - 1        // Merged node is at original left sibling's position
        ELSE:
            MERGE-NODES(node, childIndex)      // Merge child with right sibling
    
    // Now descend into the child (which now has enough keys)
    RETURN node.children[childIndex]

Borrowing in Detail:

BORROW-FROM-LEFT(parent, childIndex):
    child ← parent.children[childIndex]
    leftSibling ← parent.children[childIndex - 1]
    
    // Shift child's keys/children right to make room
    FOR i ← child.numKeys DOWNTO 0:
        child.keys[i + 1] ← child.keys[i]
    IF NOT child.isLeaf:
        FOR i ← child.numKeys + 1 DOWNTO 0:
            child.children[i + 1] ← child.children[i]
    
    // Parent's separator moves down to child
    child.keys[0] ← parent.keys[childIndex - 1]
    
    // Sibling's rightmost key moves up to parent
    parent.keys[childIndex - 1] ← leftSibling.keys[leftSibling.numKeys - 1]
    
    // If internal, also transfer sibling's rightmost child
    IF NOT child.isLeaf:
        child.children[0] ← leftSibling.children[leftSibling.numKeys]
    
    leftSibling.numKeys ← leftSibling.numKeys - 1
    child.numKeys ← child.numKeys + 1

The rotation brings the separator down and the sibling's key up, maintaining order.

Single-Pass Deletion

Root Collapsing: Reducing Tree Height

When deletion causes the root to become empty (0 keys), the tree height must decrease. This is the inverse of root splitting.

Condition: Root has 0 keys and 1 child (can only happen after a merge that consumed the root's last separator).

Action: The single child becomes the new root.

COLLAPSE-ROOT-IF-NEEDED(tree):
    IF tree.root.numKeys = 0 AND NOT tree.root.isLeaf:
        oldRoot ← tree.root
        tree.root ← tree.root.children[0]  // Sole child becomes root
        FREE-NODE(oldRoot)
        tree.height ← tree.height - 1

Example:

Before (delete 50, causes merge, root has 1 key consumed):
         [50]           ← Root with 1 key
        /    \
    [30]     [70]

After merge of root's children:
         []              ← Root empty!
         |
    [30, 50, 70]         ← Merged child

After collapse:
    [30, 50, 70]         ← This is now the root

Now delete 50:
    [30, 70]             ← Tree height reduced by 1

Edge Case: Deleting the Last Key

When the tree contains a single key in a single-node root:

Tree: [42]  ← Only key in tree

Delete 42:
  - Root becomes empty
  - Root is a leaf (no children)
  - Tree is empty

EMPTY-TREE(tree):
    IF tree.root.numKeys = 0 AND tree.root.isLeaf:
        tree.root ← NULL
        // Or keep empty root for future insertions

Some implementations maintain an empty root node rather than nullifying it, avoiding allocation on the next insert.

Height Reduction is Rare

The Complete Deletion Algorithm

Let's consolidate all cases into a complete, production-ready algorithm:

BTREE-DELETE(tree, key):
    // Handle root collapse after deletion
    DELETE-FROM-NODE(tree.root, key)
    
    // Check if root needs collapsing
    IF tree.root.numKeys = 0 AND NOT tree.root.isLeaf:
        tree.root ← tree.root.children[0]
        FREE-NODE(oldRoot)

DELETE-FROM-NODE(node, key):
    i ← FindKeyPosition(node, key)
    
    // Key found in this node
    IF i < node.numKeys AND node.keys[i] = key:
        IF node.isLeaf:
            // Case 1: Simple leaf deletion
            DeleteFromLeaf(node, i)
        ELSE:
            // Case 2: Internal node deletion
            DeleteFromInternal(node, i)
    
    // Key not in this node
    ELSE:
        IF node.isLeaf:
            RETURN  // Key not found in tree
        
        // Ensure child has enough keys before descending
        EnsureChildHasEnoughKeys(node, i)
        
        // Recurse into appropriate child
        DELETE-FROM-NODE(node.children[i], key)

btree-delete.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
interface BTreeNode {
    keys: Key[];
    values?: Value[];  // Only for leaves
    children?: BTreeNode[];  // Only for internal nodes
    isLeaf: boolean;
    numKeys: number;
}
 
class BTree {
    root: BTreeNode;
    t: number;  // Minimum degree
 
    delete(key: Key): boolean {
        if (!this.root) return false;
        
        const deleted = this.deleteFromNode(this.root, key);
        
        // Handle root collapse
        if (this.root.numKeys === 0 && !this.root.isLeaf) {
            this.root = this.root.children![0];
        }
        
        return deleted;
    }
 
    private deleteFromNode(node: BTreeNode, key: Key): boolean {
        let i = this.findKeyPosition(node, key);
        
        // Key found in this node
        if (i < node.numKeys && node.keys[i] === key) {
            if (node.isLeaf) {
                return this.deleteFromLeaf(node, i);
            } else {
                return this.deleteFromInternal(node, i);
            }
        }
        
        // Key not in this node; must be in subtree
        if (node.isLeaf) {
            return false;  // Key not found
        }
        
        // Ensure child at position i has enough keys
        this.ensureChildHasEnoughKeys(node, i);
        
        // Recurse; note that child index might have changed after merge
        return this.deleteFromNode(node.children![i], key);
    }
 
    private deleteFromLeaf(leaf: BTreeNode, keyIndex: number): boolean {
        // Shift keys left
        for (let j = keyIndex; j < leaf.numKeys - 1; j++) {
            leaf.keys[j] = leaf.keys[j + 1];
            if (leaf.values) leaf.values[j] = leaf.values[j + 1];
        }
        leaf.numKeys--;
        return true;
    }
    
    // ... deleteFromInternal, ensureChildHasEnoughKeys implementations
}

Index Adjustment After Merge

Lazy Deletion and Tombstones

Full deletion (with rebalancing) is expensive. Many production systems use lazy deletion strategies that defer or avoid structural modifications.

Strategy 1: Tombstone Marking

Instead of removing keys, mark them as deleted:

LAZY-DELETE(tree, key):
    node ← FindLeafContaining(tree.root, key)
    i ← FindKeyInNode(node, key)
    
    IF key found:
        node.deleted[i] ← TRUE  // Tombstone marker
        tree.deletedCount ← tree.deletedCount + 1
        DISK-WRITE(node)

Advantages:

O(log n) deletion with single write
No structural changes
Easy to implement concurrent deletion

Disadvantages:

Wasted space for deleted keys
Searches must skip tombstones
Eventually need compaction

Strategy 2: Deferred Compaction

Combine tombstone marking with periodic compaction:

PERIODIC-COMPACTION(tree):
    IF tree.deletedCount / tree.totalKeys > THRESHOLD:
        FOR each node in tree:
            CompactNode(node)  // Remove tombstones, rebalance if needed
        RebuildIfNecessary(tree)
        tree.deletedCount ← 0

This amortizes structural modification cost across many deletions.

Strategy 3: MVCC (Multi-Version Concurrency Control)

Rather than deleting, create a new version of the tree without the key:

Old version remains for active transactions
Garbage collection removes unreachable versions
Common in PostgreSQL, CockroachDB, etc.

Strategy 4: Log-Structured Merge (LSM) Trees

An entirely different approach:

All writes go to an in-memory buffer
Deletions are "delete markers" in the buffer
Periodic merge compacts and applies deletions
Used by LevelDB, RocksDB, Cassandra

Deletion Strategy Comparison
Strategy	Delete Latency	Space Overhead	Complexity	Best For
Immediate (full)	O(log n)	None	High	Read-heavy, low delete rate
Tombstone	O(log n)	Growing	Low	Balanced workloads
Deferred compaction	O(log n) + batch	Temporary	Medium	Write-heavy with batch windows
MVCC	O(log n)	Per version	High	Concurrent transactions
LSM	O(1) amortized	Write buffers	High	Write-heavy workloads

Choosing a Strategy

Summary: Mastering B-Tree Deletion

B-tree deletion is the most complex core operation, requiring careful handling of underflow through borrowing and merging. Let's consolidate the essential knowledge:

Key Takeaways

•Three cases govern deletion — Leaf deletion (simple), internal node deletion (replace with predecessor/successor), and recursive descent with proactive rebalancing.
•Borrowing maintains minimum occupancy — When a node would underflow, try borrowing from siblings first; this is cheaper than merging.
•Merging is splitting's inverse — When borrowing isn't possible, merge the underflowing node with a sibling, consuming a separator from the parent.
•Proactive rebalancing enables single-pass deletion — By ensuring children have enough keys before descending, we avoid backtracking up the tree.
•Root collapsing reduces height — When the root becomes empty, its single child becomes the new root, decreasing tree height.
•Lazy deletion trades correctness for performance — Tombstones and deferred compaction reduce deletion cost for write-heavy workloads.

Ready for Node Merging Deep Dive

4 / 5