Data Structures & AlgorithmsHeaps & Priority Queues

What Is a Heap? Structure & Properties

LevelIntermediate

Duration60 mins

TopicHeaps & Priority Queues

1 / 4

Definition of a Binary Heap

The Data Structure That Does More With Less

Imagine you're building a hospital emergency room triage system. Patients arrive continuously, each with a severity level. At any moment, you need to answer one question instantly: Who needs treatment most urgently? Not who arrived first—the most critical patient, regardless of arrival time.

A simple queue fails here. A sorted list works but costs O(n) to maintain after each arrival. What you need is a data structure optimized for a specific operation pattern: efficient insertion of elements and efficient extraction of the extreme element (maximum or minimum).

This is precisely what a binary heap provides—and it does so with remarkable elegance, requiring no pointers, no complex balancing logic, and just O(log n) time for both core operations.

What You Will Learn

By the end of this page, you will understand the precise definition of a binary heap, including its two fundamental requirements (the heap property and the shape property), how these requirements differ from other tree structures, and why this specific combination enables the efficiency that makes heaps indispensable in computer science.

The Formal Definition of a Binary Heap

Let's establish the precise, formal definition that distinguishes a binary heap from all other data structures:

Definition: A binary heap is a data structure that satisfies two invariant properties simultaneously:

The Heap Property (ordering invariant): For every node N in the tree, the value stored at N satisfies a consistent ordering relationship with all of N's descendants.

The Shape Property (structural invariant): The tree is a complete binary tree—every level is fully filled except possibly the last, which is filled from left to right.

These two properties are independent but both necessary. A tree satisfying only the heap property might be wildly unbalanced. A complete binary tree without the heap property is just a complete binary tree. The magic—and the efficiency—emerges from requiring both.

The Two-Property Framework

Think of a binary heap as having a 'personality' and a 'body'. The heap property (personality) determines what values go where—it's about ordering. The shape property (body) determines the physical structure—it's about completeness. Both must hold at all times for the structure to function as a heap.

Why the term 'binary'?

The word binary in 'binary heap' refers to the tree structure: each node has at most two children. This distinguishes it from:

d-ary heaps: Generalized heaps where each node has at most d children
Fibonacci heaps: A more complex heap variant with different structural properties
Binomial heaps: Collections of binomial trees with heap ordering

We focus on binary heaps because they're the most practical: simple to understand, simple to implement, and efficient for nearly all use cases. The principles extend naturally to the variants.

Binary Heap vs Related Data Structures
Data Structure	Ordering Property	Shape Property	Use Case
Binary Heap	Parent ≤ or ≥ children	Complete binary tree	Priority queues, HeapSort
Binary Search Tree	Left < Parent < Right	No shape requirement	Sorted storage, range queries
Complete Binary Tree	None	Complete (left-to-right filling)	Array-based tree representation
Balanced BST (AVL/RB)	Left < Parent < Right	Height-balanced	Guaranteed O(log n) operations

Understanding the Heap Property Precisely

The heap property is often stated loosely as 'parent is larger (or smaller) than children.' Let's be more precise, because precision matters when reasoning about correctness.

Max-Heap Property:

For every node N with value V, and for every descendant D of N with value W: V ≥ W

Min-Heap Property:

For every node N with value V, and for every descendant D of N with value W: V ≤ W

Notice several critical aspects of these definitions:

Critical Aspects of the Heap Property

•It applies to ALL descendants, not just children — A parent must dominate its children, grandchildren, great-grandchildren, etc. However, we typically only verify parent-child relationships because if every parent dominates its children, transitivity guarantees it dominates all descendants.
•The relationship is non-strict (≥ or ≤) — Equal values are allowed. A heap can contain duplicate elements, and a parent can equal its child without violating the property.
•No ordering between siblings — The heap property says nothing about the relationship between left and right children. They could be in any relative order. This is fundamentally different from a BST.
•The relationship is with respect to a single ordering — Max-heap or min-heap, but not both simultaneously. A single heap cannot efficiently provide both maximum and minimum.

Common Misconception: Heap vs BST

Students often confuse heaps and BSTs. In a BST, left child < parent < right child—siblings ARE ordered. In a heap, parent ≥ children (for max-heap)—siblings have NO ordering relationship. A heap is NOT a search structure; you cannot binary search a heap.

The Transitivity Principle:

Why can we verify the heap property by only checking parent-child relationships rather than every ancestor-descendant pair?

Consider a max-heap with nodes A (root), B (A's child), and C (B's child):

If A ≥ B (parent-child property holds)
And B ≥ C (parent-child property holds)
Then A ≥ C (by transitivity of ≥)

This transitivity means:

Checking parent-child relationships is sufficient to guarantee the full property
The root always contains the maximum (max-heap) or minimum (min-heap) element
Every subtree is itself a valid heap—the property is recursive

The Root Guarantee

The most powerful consequence of the heap property: the root always contains the extreme element. In a max-heap, the root is guaranteed to be ≥ every other element. In a min-heap, the root is guaranteed to be ≤ every other element. This enables O(1) peek at the extreme value—just look at the root.

Why Not Just Keep Elements Sorted?

Before diving deeper into heaps, let's address an obvious question: if we want quick access to the maximum (or minimum) element, why not just maintain a sorted array?

The Sorted Array Approach:

Keep elements in descending order (for max extraction)
Maximum is always at index 0: O(1) access
Extraction: Remove index 0, shift everything: O(n)
Insertion: Binary search to find position O(log n), then shift to make room O(n)

The fundamental problem: Maintaining full sorted order is expensive when elements change frequently. Every insertion requires potentially shifting n elements.

Time Complexity: Sorted Array vs Binary Heap
Operation	Sorted Array	Binary Heap	Heap Advantage
Find Maximum	O(1)	O(1)	Same
Extract Maximum	O(n) — shifting	O(log n)	Heap wins
Insert Element	O(n) — shifting	O(log n)	Heap wins
Decrease Key	O(n) — find + shift	O(log n)	Heap wins
Build from n elements	O(n log n) — sorting	O(n) — heapify	Heap wins

The Heap's Insight: Partial Order Is Enough

Here's the key insight that makes heaps brilliant: we don't need full sorted order to find the extreme element efficiently.

The heap property provides a partial order—not every pair of elements is comparable via the structure, but we always know:

Which element is the maximum (the root)
Enough about the structure to efficiently restore the property after changes

The heap trades complete ordering for structural simplicity. You can't iterate through a heap in sorted order efficiently (that would take O(n log n)), but you don't need to. If your primary operations are 'insert' and 'extract-max', partial order is exactly what you need.

When to Use a Heap

•Frequent insertions and extractions
•Only need access to extreme element
•Elements arrive dynamically
•Priority queue semantics needed
•Implementing efficient algorithms (Dijkstra, Huffman)

When Sorted Order Is Better

•Need to iterate in sorted order frequently
•Static data with rare modifications
•Range queries over ordered elements
•Need to search for arbitrary elements
•Need both min and max efficiently

Historical Context: The Invention of Heaps

Understanding where heaps came from illuminates why they were designed the way they were.

1964: J.W.J. Williams invents the heap

Williams introduced the binary heap specifically as a data structure for his new sorting algorithm: Heapsort. His goal was to create an in-place sorting algorithm with guaranteed O(n log n) worst-case performance—something Quicksort couldn't provide.

The key insight: if you can build a heap in O(n) time and extract n elements in O(n log n) time, you have an O(n log n) sorting algorithm that:

Works in-place (O(1) extra space)
Has guaranteed worst-case performance (unlike Quicksort's O(n²) worst case)
Doesn't require recursion (unlike Mergesort)

Floyd's Contribution (1964)

In the same year, Robert W. Floyd improved Williams' heap construction. Williams' original method built a heap in O(n log n) by inserting elements one at a time. Floyd showed that bottom-up heapify achieves O(n)—a significant theoretical improvement that we'll explore in later modules.

Why 'Heap'?

The term 'heap' predates computer science. In everyday language, a heap is an untidy pile of things—seemingly chaotic but with structure. The data structure heap captures this essence:

It looks 'messy' compared to a sorted array—elements aren't in order
But it has hidden structure—the maximum is always at the top
Like a physical heap, you can't easily access arbitrary elements, but the top is readily available

Note: The term 'heap' in 'heap memory' (dynamic memory allocation) is unrelated to the heap data structure. This is an unfortunate naming collision in computer science terminology. The memory heap refers to a pool of memory available for dynamic allocation; the name likely comes from the 'pile' or 'heap' of available memory blocks.

The Evolution of Heap Applications:

1964-1970s: Primarily for sorting (Heapsort)

1970s-1980s: Priority queues for operating systems (process scheduling)

1980s-Present: Graph algorithms (Dijkstra, Prim), compression (Huffman coding), event-driven simulation

Today: Heaps are fundamental infrastructure:

Every major programming language provides a heap-based priority queue
Database query optimizers use heaps for k-way merging
Real-time systems use heaps for deadline scheduling
Machine learning uses heaps for nearest-neighbor search

The binary heap remains the workhorse implementation because of its simplicity and excellent cache locality—properties that matter enormously on modern hardware.

A Preview of Heap Variants

While this module focuses on binary heaps, it's valuable to know that the heap concept has been extended and refined in many ways. Each variant optimizes for different operation patterns:

Binary Heap vs Other Heap Types (Theoretical Comparison)
Operation	Binary Heap	Fibonacci Heap	Binomial Heap	Pairing Heap
Insert	O(log n)	O(1) amortized	O(log n)	O(1)
Find Min/Max	O(1)	O(1)	O(log n) or O(1)*	O(1)
Extract Min/Max	O(log n)	O(log n) amortized	O(log n)	O(log n) amortized
Decrease Key	O(log n)	O(1) amortized	O(log n)	O(log n) amortized**
Merge Two Heaps	O(n)	O(1)	O(log n)	O(1)

Why Binary Heaps Win in Practice

Despite Fibonacci heaps having better theoretical bounds for several operations, binary heaps are almost always preferred in practice. Why? Constant factors and cache locality. Binary heaps have tiny constant factors, fit in arrays with excellent cache behavior, and are trivially simple to implement correctly. Fibonacci heaps have huge constant factors and complex implementations. For real-world sizes, binary heaps are usually faster.

The 80/20 Rule of Heap Selection:

For 80% or more of practical applications, a binary heap is the right choice. Consider alternatives only when:

You need efficient decrease-key: Fibonacci heaps shine in algorithms like Dijkstra's where decrease-key is frequent and the graph is dense.
You need to merge heaps: Binomial or Fibonacci heaps support O(log n) or O(1) merge, versus O(n) for binary heaps.
You have proven that the heap is the bottleneck: Profile first, optimize second. A simpler data structure with good cache behavior often outperforms a theoretically superior structure.

For this course, we focus on binary heaps because:

They're sufficient for nearly all interview problems
Understanding them deeply prepares you for the variants
They're what standard libraries actually implement
The principles transfer to all heap types

The Complete Mental Model

Let's synthesize everything into a complete mental model for binary heaps that you can carry forward:

A binary heap is a complete binary tree where each node dominates its descendants.

Unpack this sentence:

Binary tree: Each node has at most two children (left and right)
Complete: All levels are full except possibly the last, which is left-filled
Dominates: Parent is ≥ all descendants (max-heap) or ≤ all descendants (min-heap)
Its descendants: The property is transitive—it applies to the entire subtree

The Three Pillars of Understanding

To truly understand heaps, you need three things: (1) The definition—what properties must hold, (2) The representation—how we store it efficiently in an array, (3) The operations—how we maintain properties during insert/extract. This page covers the definition. The next pages cover representation and operations.

What Heaps Are NOT:

To solidify understanding, explicitly consider what heaps are not:

Not a search structure: You cannot efficiently find an arbitrary element (that's O(n))
Not fully sorted: Siblings have no ordering relationship
Not balanced by a balance factor: The shape is exactly determined (complete), not approximately balanced
Not recursive in the BST sense: Subtrees are heaps, but there's no 'left subtree < root < right subtree' relationship
Not a replacement for all tree structures: It's specialized for extreme-element access

Summary: The Definition Checklist

•Structure: Complete binary tree (all levels full except last, left-filled)
•Ordering: Each node dominates all its descendants (heap property)
•Extreme Access: Root is always the maximum (max-heap) or minimum (min-heap)
•Sibling Relationship: None—left and right children are unordered relative to each other
•Recursive Property: Every subtree is also a valid heap
•Trade-off: Fast extreme access in exchange for no efficient search

Looking Ahead: What's Next

You now have a rigorous understanding of what a binary heap is. But definition is just the beginning. To use heaps effectively, you need to understand:

Coming up in this module:

The Heap Property (Max vs Min): Deep dive into the ordering invariant, with precise analysis of how it enables efficient operations.
Complete Binary Tree Structure: Why completeness is the key that unlocks array-based representation and O(log n) height guarantees.
Why Shape Matters: The connection between shape, height, and performance—and why heaps outperform arbitrary binary trees for priority queue operations.

Coming in later modules:

Array-based representation: How to store a heap without pointers, achieving perfect memory efficiency and cache locality
Insert operation: How to add an element while maintaining both properties
Extract operation: How to remove the extreme element while restoring the heap
Heapify: How to build a heap from arbitrary data in O(n) time
Heap applications: Heapsort, priority queues, k-th element problems, median tracking

Page Complete

You've mastered the definition of a binary heap: a complete binary tree satisfying the heap property. You understand why this specific combination enables efficient extreme-element operations, how it differs from other tree structures, and where it fits in the broader landscape of heap variants. Next, we'll examine the heap property in depth—the ordering invariant that makes heaps useful for priority-based operations.

1 / 4

Loading learning content...

Data Structures & AlgorithmsHeaps & Priority Queues

What Is a Heap? Structure & Properties

LevelIntermediate

Duration60 mins

TopicHeaps & Priority Queues

1 / 4

Definition of a Binary Heap

The Data Structure That Does More With Less

This is precisely what a binary heap provides—and it does so with remarkable elegance, requiring no pointers, no complex balancing logic, and just O(log n) time for both core operations.

What You Will Learn

The Formal Definition of a Binary Heap

Let's establish the precise, formal definition that distinguishes a binary heap from all other data structures:

Definition: A binary heap is a data structure that satisfies two invariant properties simultaneously:

The Heap Property (ordering invariant): For every node N in the tree, the value stored at N satisfies a consistent ordering relationship with all of N's descendants.

The Shape Property (structural invariant): The tree is a complete binary tree—every level is fully filled except possibly the last, which is filled from left to right.

The Two-Property Framework

Why the term 'binary'?

The word binary in 'binary heap' refers to the tree structure: each node has at most two children. This distinguishes it from:

d-ary heaps: Generalized heaps where each node has at most d children
Fibonacci heaps: A more complex heap variant with different structural properties
Binomial heaps: Collections of binomial trees with heap ordering

We focus on binary heaps because they're the most practical: simple to understand, simple to implement, and efficient for nearly all use cases. The principles extend naturally to the variants.

Binary Heap vs Related Data Structures
Data Structure	Ordering Property	Shape Property	Use Case
Binary Heap	Parent ≤ or ≥ children	Complete binary tree	Priority queues, HeapSort
Binary Search Tree	Left < Parent < Right	No shape requirement	Sorted storage, range queries
Complete Binary Tree	None	Complete (left-to-right filling)	Array-based tree representation
Balanced BST (AVL/RB)	Left < Parent < Right	Height-balanced	Guaranteed O(log n) operations

Understanding the Heap Property Precisely

The heap property is often stated loosely as 'parent is larger (or smaller) than children.' Let's be more precise, because precision matters when reasoning about correctness.

Max-Heap Property:

For every node N with value V, and for every descendant D of N with value W: V ≥ W

Min-Heap Property:

For every node N with value V, and for every descendant D of N with value W: V ≤ W

Notice several critical aspects of these definitions:

Critical Aspects of the Heap Property

•It applies to ALL descendants, not just children — A parent must dominate its children, grandchildren, great-grandchildren, etc. However, we typically only verify parent-child relationships because if every parent dominates its children, transitivity guarantees it dominates all descendants.
•The relationship is non-strict (≥ or ≤) — Equal values are allowed. A heap can contain duplicate elements, and a parent can equal its child without violating the property.
•No ordering between siblings — The heap property says nothing about the relationship between left and right children. They could be in any relative order. This is fundamentally different from a BST.
•The relationship is with respect to a single ordering — Max-heap or min-heap, but not both simultaneously. A single heap cannot efficiently provide both maximum and minimum.

Common Misconception: Heap vs BST

The Transitivity Principle:

Why can we verify the heap property by only checking parent-child relationships rather than every ancestor-descendant pair?

Consider a max-heap with nodes A (root), B (A's child), and C (B's child):

If A ≥ B (parent-child property holds)
And B ≥ C (parent-child property holds)
Then A ≥ C (by transitivity of ≥)

This transitivity means:

Checking parent-child relationships is sufficient to guarantee the full property
The root always contains the maximum (max-heap) or minimum (min-heap) element
Every subtree is itself a valid heap—the property is recursive

The Root Guarantee

Why Not Just Keep Elements Sorted?

Before diving deeper into heaps, let's address an obvious question: if we want quick access to the maximum (or minimum) element, why not just maintain a sorted array?

The Sorted Array Approach:

Keep elements in descending order (for max extraction)
Maximum is always at index 0: O(1) access
Extraction: Remove index 0, shift everything: O(n)
Insertion: Binary search to find position O(log n), then shift to make room O(n)

The fundamental problem: Maintaining full sorted order is expensive when elements change frequently. Every insertion requires potentially shifting n elements.

Time Complexity: Sorted Array vs Binary Heap
Operation	Sorted Array	Binary Heap	Heap Advantage
Find Maximum	O(1)	O(1)	Same
Extract Maximum	O(n) — shifting	O(log n)	Heap wins
Insert Element	O(n) — shifting	O(log n)	Heap wins
Decrease Key	O(n) — find + shift	O(log n)	Heap wins
Build from n elements	O(n log n) — sorting	O(n) — heapify	Heap wins

The Heap's Insight: Partial Order Is Enough

Here's the key insight that makes heaps brilliant: we don't need full sorted order to find the extreme element efficiently.

The heap property provides a partial order—not every pair of elements is comparable via the structure, but we always know:

Which element is the maximum (the root)
Enough about the structure to efficiently restore the property after changes

When to Use a Heap

•Frequent insertions and extractions
•Only need access to extreme element
•Elements arrive dynamically
•Priority queue semantics needed
•Implementing efficient algorithms (Dijkstra, Huffman)

When Sorted Order Is Better

•Need to iterate in sorted order frequently
•Static data with rare modifications
•Range queries over ordered elements
•Need to search for arbitrary elements
•Need both min and max efficiently

Historical Context: The Invention of Heaps

Understanding where heaps came from illuminates why they were designed the way they were.

1964: J.W.J. Williams invents the heap

The key insight: if you can build a heap in O(n) time and extract n elements in O(n log n) time, you have an O(n log n) sorting algorithm that:

Works in-place (O(1) extra space)
Has guaranteed worst-case performance (unlike Quicksort's O(n²) worst case)
Doesn't require recursion (unlike Mergesort)

Floyd's Contribution (1964)

Why 'Heap'?

The term 'heap' predates computer science. In everyday language, a heap is an untidy pile of things—seemingly chaotic but with structure. The data structure heap captures this essence:

It looks 'messy' compared to a sorted array—elements aren't in order
But it has hidden structure—the maximum is always at the top
Like a physical heap, you can't easily access arbitrary elements, but the top is readily available

The Evolution of Heap Applications:

1964-1970s: Primarily for sorting (Heapsort)

1970s-1980s: Priority queues for operating systems (process scheduling)

1980s-Present: Graph algorithms (Dijkstra, Prim), compression (Huffman coding), event-driven simulation

Today: Heaps are fundamental infrastructure:

Every major programming language provides a heap-based priority queue
Database query optimizers use heaps for k-way merging
Real-time systems use heaps for deadline scheduling
Machine learning uses heaps for nearest-neighbor search

The binary heap remains the workhorse implementation because of its simplicity and excellent cache locality—properties that matter enormously on modern hardware.

A Preview of Heap Variants

While this module focuses on binary heaps, it's valuable to know that the heap concept has been extended and refined in many ways. Each variant optimizes for different operation patterns:

Binary Heap vs Other Heap Types (Theoretical Comparison)
Operation	Binary Heap	Fibonacci Heap	Binomial Heap	Pairing Heap
Insert	O(log n)	O(1) amortized	O(log n)	O(1)
Find Min/Max	O(1)	O(1)	O(log n) or O(1)*	O(1)
Extract Min/Max	O(log n)	O(log n) amortized	O(log n)	O(log n) amortized
Decrease Key	O(log n)	O(1) amortized	O(log n)	O(log n) amortized**
Merge Two Heaps	O(n)	O(1)	O(log n)	O(1)

Why Binary Heaps Win in Practice

The 80/20 Rule of Heap Selection:

For 80% or more of practical applications, a binary heap is the right choice. Consider alternatives only when:

You need efficient decrease-key: Fibonacci heaps shine in algorithms like Dijkstra's where decrease-key is frequent and the graph is dense.
You need to merge heaps: Binomial or Fibonacci heaps support O(log n) or O(1) merge, versus O(n) for binary heaps.
You have proven that the heap is the bottleneck: Profile first, optimize second. A simpler data structure with good cache behavior often outperforms a theoretically superior structure.

For this course, we focus on binary heaps because:

They're sufficient for nearly all interview problems
Understanding them deeply prepares you for the variants
They're what standard libraries actually implement
The principles transfer to all heap types

The Complete Mental Model

Let's synthesize everything into a complete mental model for binary heaps that you can carry forward:

A binary heap is a complete binary tree where each node dominates its descendants.

Unpack this sentence:

Binary tree: Each node has at most two children (left and right)
Complete: All levels are full except possibly the last, which is left-filled
Dominates: Parent is ≥ all descendants (max-heap) or ≤ all descendants (min-heap)
Its descendants: The property is transitive—it applies to the entire subtree

The Three Pillars of Understanding

What Heaps Are NOT:

To solidify understanding, explicitly consider what heaps are not:

Not a search structure: You cannot efficiently find an arbitrary element (that's O(n))
Not fully sorted: Siblings have no ordering relationship
Not balanced by a balance factor: The shape is exactly determined (complete), not approximately balanced
Not recursive in the BST sense: Subtrees are heaps, but there's no 'left subtree < root < right subtree' relationship
Not a replacement for all tree structures: It's specialized for extreme-element access

Summary: The Definition Checklist

•Structure: Complete binary tree (all levels full except last, left-filled)
•Ordering: Each node dominates all its descendants (heap property)
•Extreme Access: Root is always the maximum (max-heap) or minimum (min-heap)
•Sibling Relationship: None—left and right children are unordered relative to each other
•Recursive Property: Every subtree is also a valid heap
•Trade-off: Fast extreme access in exchange for no efficient search

Looking Ahead: What's Next

You now have a rigorous understanding of what a binary heap is. But definition is just the beginning. To use heaps effectively, you need to understand:

Coming up in this module:

The Heap Property (Max vs Min): Deep dive into the ordering invariant, with precise analysis of how it enables efficient operations.
Complete Binary Tree Structure: Why completeness is the key that unlocks array-based representation and O(log n) height guarantees.
Why Shape Matters: The connection between shape, height, and performance—and why heaps outperform arbitrary binary trees for priority queue operations.

Coming in later modules:

Array-based representation: How to store a heap without pointers, achieving perfect memory efficiency and cache locality
Insert operation: How to add an element while maintaining both properties
Extract operation: How to remove the extreme element while restoring the heap
Heapify: How to build a heap from arbitrary data in O(n) time
Heap applications: Heapsort, priority queues, k-th element problems, median tracking

Page Complete

1 / 4