Data Structures & AlgorithmsLinked Lists

What Is a Linked List?

LevelBeginner

Duration55 mins

TopicLinked Lists

1 / 4

Definition of a Linked List

A New Way to Think About Sequences

In the previous chapter, we mastered arrays—contiguous blocks of memory where elements sit side by side like books on a shelf. Arrays give us lightning-fast random access, but they come with a fundamental constraint: they require continuous memory, and inserting or removing elements in the middle demands shifting everything around.

What if we could break free from this constraint? What if elements didn't need to be neighbors in memory? What if we could insert and remove elements without touching anything else?

This is precisely what linked lists offer. They represent a fundamentally different approach to storing sequential data—one where elements can live anywhere in memory, connected not by physical proximity but by explicit references that form a chain.

What You Will Learn

By the end of this page, you will understand the formal definition of a linked list, why it represents a paradigm shift from array-based thinking, and how the concept of 'connected elements' opens new possibilities for how we organize and manipulate data in memory.

The Formal Definition

Let's begin with precision. A linked list is a linear data structure consisting of a sequence of elements, where each element (called a node) contains:

Data — The actual value or payload the node holds
A reference (or pointer) — The address of the next node in the sequence

Unlike arrays, linked list elements are not stored contiguously in memory. Each node can exist at any memory address, and the list maintains its logical order through the chain of references that connect one node to the next.

Formally:

A linked list is an ordered collection of nodes where each node stores data and a reference to its successor, with the final node's reference pointing to nothing (null/None/nil), signifying the end of the list.

The Key Insight

In an array, the order of elements is determined by their physical positions in memory. In a linked list, the order is determined by the references each node holds. Physical memory location becomes irrelevant—only the logical connections matter.

The simplest linked list:

Imagine three numbers: 10, 20, 30. In an array, they occupy consecutive memory cells:

Memory: | 10 | 20 | 30 |
Index:    0    1    2

In a linked list, each number lives in a node that could be anywhere in memory:

Node A (at address 0x1000): data=10, next→0x2500
Node B (at address 0x2500): data=20, next→0x1800
Node C (at address 0x1800): data=30, next→null

The sequence 10→20→30 is maintained not by where the nodes live, but by the chain of next pointers. Node A points to Node B regardless of where B is located. Node B points to Node C. Node C's next is null, indicating the end.

Why This Matters: The Paradigm Shift

Understanding linked lists isn't just about learning another data structure—it's about recognizing that there's more than one way to organize sequential data. Arrays and linked lists represent two fundamentally different philosophies:

Arrays: Trust in physical arrangement. "Put things next to each other, and you'll always know where to find them."
Linked Lists: Trust in explicit connections. "Let things live wherever they want, but make sure each one knows where the next one is."

This distinction has profound implications for how we think about algorithms, memory management, and system design.

Philosophical Comparison: Arrays vs Linked Lists
Aspect	Array Philosophy	Linked List Philosophy
Order Maintained By	Physical memory position	Explicit references/pointers
Memory Requirement	Contiguous block required	Scattered allocation allowed
Size Flexibility	Fixed or expensive to resize	Grows/shrinks naturally
Space Overhead	None (data only)	Reference storage per node
Access Pattern	Direct (compute address)	Traversal (follow chain)
Insertion/Deletion	Shift elements (expensive)	Redirect pointers (cheap)

The Trade-Off Is the Point

Neither arrays nor linked lists are 'better.' Each represents a different trade-off between access speed and modification flexibility. Expert engineers recognize which trade-off suits their problem and choose accordingly.

Real-World Analogy: The Treasure Hunt

To truly internalize linked lists, let's consider a powerful real-world analogy.

The Array Model: A Bookshelf

Imagine a bookshelf with numbered slots. To find the 5th book, you walk directly to slot 5. Simple, fast, direct. But if you want to insert a new book between slots 4 and 5? You must shift every book from slot 5 onward to make room.

The Linked List Model: A Treasure Hunt

Now imagine a treasure hunt. Each location has:

A clue (the data)
Instructions to the next location (the pointer)

The hunt starts at the first clue. You read it, follow the instructions, find the next clue, and repeat. The clues can be hidden anywhere—under a rock, in a tree, across town—it doesn't matter because each one tells you exactly where to go next.

Want to add a new location between clue 2 and clue 3? Simply:

Hide the new clue somewhere
Update clue 2's instructions to point to the new location
Set the new clue's instructions to point to clue 3's original location

No other clues need to move. The chain is maintained by changing a single instruction.

Bookshelf (Array)

•Books arranged in numbered slots
•Finding book #5? Go directly to slot 5
•Insert new book? Shift everything after
•Remove a book? Shift everything to fill gap
•Fixed shelf size limits capacity

Treasure Hunt (Linked List)

•Clues scattered across locations
•Finding clue #5? Follow chain from start
•Insert new clue? Change one instruction
•Remove a clue? Bridge over it
•No physical limit on locations

The Abstract Definition: A Recursive View

Computer scientists often define linked lists recursively, which reveals their elegant mathematical structure:

A linked list is either:

Empty (null/None/nil), or

A node containing data and a reference to another linked list

This definition is beautifully self-referential. A linked list is a node connected to... another linked list! And that linked list is itself either empty or a node connected to yet another linked list.

Let's trace through an example:

List [10, 20, 30] = Node(10) → List [20, 30]
List [20, 30] = Node(20) → List [30]
List [30] = Node(30) → List []
List [] = Empty (null)

This recursive definition has practical implications:

Base case: An empty list is valid and represents termination
Recursive case: A non-empty list is always a head node plus a tail list
Algorithms: Many linked list algorithms naturally express themselves recursively

Why the Recursive Definition Matters

The recursive definition isn't just theoretical elegance—it directly influences how we write code. Traversal, search, insertion, and even reversal can be expressed recursively with remarkable clarity. When we implement linked list operations later, this perspective will prove invaluable.

Types of Linked Lists: A Preview

While we'll explore each variant in dedicated modules, it's valuable to know that linked lists come in several forms:

1. Singly Linked List Each node has one pointer—to its successor. You can only move forward through the list.

2. Doubly Linked List Each node has two pointers—one to its successor and one to its predecessor. You can move both forward and backward.

3. Circular Linked List Instead of the last node pointing to null, it points back to the first node, forming a cycle. This can apply to both singly and doubly linked variants.

Each type has different trade-offs:

Singly linked: Simpler, less memory, but only forward traversal
Doubly linked: More memory overhead, but bidirectional navigation and easier deletion
Circular: Useful when the data naturally forms a cycle (e.g., round-robin scheduling)

Linked List Variants at a Glance
Variant	Pointers per Node	Traversal	Memory Overhead	Common Use Cases
Singly Linked	1 (next only)	Forward only	Lower	Stacks, simple sequences, undo systems
Doubly Linked	2 (next + prev)	Both directions	Higher	Browsers (back/forward), LRU caches
Circular Singly	1 (next)	Forward (cycles)	Lower	Round-robin, circular buffers
Circular Doubly	2 (next + prev)	Both (cycles)	Higher	Music playlists, rotating schedules

Focus First on Singly Linked Lists

We'll begin with singly linked lists because they're the purest expression of the linked list concept. Master them, and doubly linked and circular variants become natural extensions.

Historical Context: Where Linked Lists Came From

Linked lists aren't a modern invention—they emerged at the very dawn of computer science.

The 1950s Origins

Linked lists were invented around 1955-1956 by researchers at the RAND Corporation, including Allen Newell, Cliff Shaw, and Herbert A. Simon. They developed linked lists for their Information Processing Language (IPL), used to create early artificial intelligence programs, including the Logic Theorist—often considered the first AI program.

Why Linked Lists Mattered Then

Early computers had extremely limited and expensive memory. The ability to allocate memory dynamically—adding nodes only when needed, freeing them when done—was revolutionary. Programmers could build structures that grew and shrank with their data, rather than guessing maximum sizes in advance.

Why Linked Lists Still Matter Now

Despite decades of hardware evolution, the fundamental trade-offs remain. Modern systems still need:

Dynamic data structures that grow/shrink
Efficient insertions and deletions
Building blocks for more complex structures (trees, graphs, hash tables)

Linked lists remain a cornerstone concept, foundational to understanding virtually every advanced data structure.

A Foundation for Everything

Trees are linked lists with multiple next pointers. Graphs are linked lists where nodes can connect to multiple others. Hash tables often use linked lists to handle collisions. Understanding linked lists unlocks understanding of the entire data structure landscape.

Contrasting with Arrays: A Deeper Look

To fully appreciate linked lists, we must understand precisely how they differ from arrays—not just superficially, but at the memory and operational level.

Memory Allocation

When you create an array of size 100, the system must find 100 consecutive memory slots. If memory is fragmented (many small gaps but no large continuous block), allocation fails even if plenty of total memory exists.

A linked list of 100 nodes? Each node needs only enough space for itself. They can be scattered across memory, using whatever fragments are available. Memory fragmentation that would defeat an array poses no problem for a linked list.

Resizing Behavior

Need to grow an array beyond its capacity? The system must:

Allocate a new, larger contiguous block
Copy all elements to the new location
Free the old block

This is expensive—O(n) for n elements. Even with amortized O(1) tricks (like doubling capacity), individual operations can be costly.

Growing a linked list? Simply allocate a new node and update one pointer. Constant time, every time.

Shrinking Behavior

Arrays don't naturally shrink. Deleting elements leaves gaps or requires explicit shrinking operations. Linked lists naturally release memory as nodes are removed—each deletion frees that node's memory immediately.

Array Characteristics

•Contiguous Memory: Entire block must be found upfront
•Static Size (or expensive resize): Growing = copy everything
•Cache Friendly: Adjacent elements = excellent locality
•O(1) Access: Direct index calculation to any element
•O(n) Insertion/Deletion: Shift many elements

Linked List Characteristics

•Scattered Memory: Use whatever fragments available
•Fully Dynamic: Add/remove nodes in constant time
•Cache Unfriendly: Nodes scattered = poor locality
•O(n) Access: Must traverse from head
•O(1) Insertion/Deletion: Redirect pointers only

When Linked Lists Excel

Given the trade-offs we've discussed, linked lists are particularly well-suited to certain scenarios:

1. Frequent Insertions and Deletions

If your application constantly adds and removes elements—especially at arbitrary positions—linked lists shine. A text editor's undo system, a task queue where items are frequently added and removed, or a playlist where songs are constantly reordered all benefit from linked list efficiency.

2. Unknown or Highly Variable Size

When you can't predict how much data you'll hold, linked lists adapt naturally. You never over-allocate (wasting memory) or under-allocate (causing crashes or expensive resizes).

3. No Random Access Needed

If you always process elements in order—traversing from start to finish—the inability to jump directly to index n isn't a problem. First-in-first-out queues and last-in-first-out stacks fit this pattern perfectly.

4. Memory Fragmentation Environments

In embedded systems or long-running servers with fragmented memory, linked lists can succeed where large array allocations would fail.

5. Building Blocks for Complex Structures

Many advanced data structures use linked lists internally: adjacency lists for graphs, chaining for hash tables, and the spine of certain tree implementations.

When NOT to Use Linked Lists

If you need frequent random access (by index), want cache-efficient traversal, or are working with small, fixed-size datasets, arrays are usually the better choice. Linked lists' pointer overhead and poor cache locality make them slower than arrays for many common operations.

Building Your Mental Model

As we conclude this definitional exploration, let's solidify the mental model you should carry forward:

1. Think in Nodes, Not Indices

With arrays, you think: "element at index 3." With linked lists, you think: "the node that comes after the node that comes after the head." There's no index calculation—only chain-following.

2. Visualize the Chain

Always picture linked lists as nodes connected by arrows. The arrows represent memory addresses. If you can see the arrows in your mind, you can reason about pointer manipulation.

3. Remember: Only the Head is Known

Unlike arrays where every element is equally accessible, only the head (first node) of a linked list is directly accessible. Everything else requires traversal from the head. This shapes every algorithm we'll design.

4. The Cost of Flexibility

Linked lists trade access speed for modification flexibility. Every time you work with linked lists, consciously acknowledge this trade-off. Is flexibility worth the traversal cost for your problem?

Core Mental Model Takeaways

•A linked list is a chain of nodes, each holding data and a reference to the next
•Order is logical, not physical—memory locations are irrelevant
•Only the head is directly accessible—everything else requires traversal
•Insertions and deletions are pointer operations—no shifting required
•Trade-off: Fast modification at the cost of slow access

Summary: The Linked List Defined

We've established the foundational understanding of what a linked list is:

Definition: A linked list is a linear data structure consisting of nodes, where each node contains data and a reference to the next node in the sequence. The final node's reference is null, indicating the end.

Key Properties:

Non-contiguous memory allocation
Logical order maintained by references, not physical position
Dynamic size—grows and shrinks with data
O(1) insertion/deletion when node position is known
O(n) access—must traverse from head

Paradigm Shift: Linked lists represent a fundamentally different approach to sequential data—trusting explicit connections over physical arrangement.

Key Takeaways

•A linked list is a chain of nodes connected by references/pointers
•Each node holds data and a 'next' pointer to the following node
•Nodes can live anywhere in memory—contiguity is not required
•The head is the only entry point—all access starts there
•Linked lists trade access speed for modification flexibility
•They form the foundation for trees, graphs, and many other structures

What's Next:

Now that we understand what a linked list is conceptually, the next page will dissect the building block at its heart: the node. We'll explore how nodes store data and pointers, how pointers work at a deeper level, and how to think about memory references in linked structures.

Page Complete

You now understand the formal definition of a linked list, its relationship to arrays, and the paradigm shift it represents. Next, we'll dive deep into nodes, data, and the pointers that connect them—the atoms from which all linked structures are built.

1 / 4

Loading learning content...

Data Structures & AlgorithmsLinked Lists

What Is a Linked List?

LevelBeginner

Duration55 mins

TopicLinked Lists

1 / 4

Definition of a Linked List

A New Way to Think About Sequences

What if we could break free from this constraint? What if elements didn't need to be neighbors in memory? What if we could insert and remove elements without touching anything else?

What You Will Learn

The Formal Definition

Let's begin with precision. A linked list is a linear data structure consisting of a sequence of elements, where each element (called a node) contains:

Data — The actual value or payload the node holds
A reference (or pointer) — The address of the next node in the sequence

Formally:

A linked list is an ordered collection of nodes where each node stores data and a reference to its successor, with the final node's reference pointing to nothing (null/None/nil), signifying the end of the list.

The Key Insight

The simplest linked list:

Imagine three numbers: 10, 20, 30. In an array, they occupy consecutive memory cells:

Memory: | 10 | 20 | 30 |
Index:    0    1    2

In a linked list, each number lives in a node that could be anywhere in memory:

Node A (at address 0x1000): data=10, next→0x2500
Node B (at address 0x2500): data=20, next→0x1800
Node C (at address 0x1800): data=30, next→null

Why This Matters: The Paradigm Shift

Arrays: Trust in physical arrangement. "Put things next to each other, and you'll always know where to find them."
Linked Lists: Trust in explicit connections. "Let things live wherever they want, but make sure each one knows where the next one is."

This distinction has profound implications for how we think about algorithms, memory management, and system design.

Philosophical Comparison: Arrays vs Linked Lists
Aspect	Array Philosophy	Linked List Philosophy
Order Maintained By	Physical memory position	Explicit references/pointers
Memory Requirement	Contiguous block required	Scattered allocation allowed
Size Flexibility	Fixed or expensive to resize	Grows/shrinks naturally
Space Overhead	None (data only)	Reference storage per node
Access Pattern	Direct (compute address)	Traversal (follow chain)
Insertion/Deletion	Shift elements (expensive)	Redirect pointers (cheap)

The Trade-Off Is the Point

Real-World Analogy: The Treasure Hunt

To truly internalize linked lists, let's consider a powerful real-world analogy.

The Array Model: A Bookshelf

The Linked List Model: A Treasure Hunt

Now imagine a treasure hunt. Each location has:

A clue (the data)
Instructions to the next location (the pointer)

Want to add a new location between clue 2 and clue 3? Simply:

Hide the new clue somewhere
Update clue 2's instructions to point to the new location
Set the new clue's instructions to point to clue 3's original location

No other clues need to move. The chain is maintained by changing a single instruction.

Bookshelf (Array)

•Books arranged in numbered slots
•Finding book #5? Go directly to slot 5
•Insert new book? Shift everything after
•Remove a book? Shift everything to fill gap
•Fixed shelf size limits capacity

Treasure Hunt (Linked List)

•Clues scattered across locations
•Finding clue #5? Follow chain from start
•Insert new clue? Change one instruction
•Remove a clue? Bridge over it
•No physical limit on locations

The Abstract Definition: A Recursive View

Computer scientists often define linked lists recursively, which reveals their elegant mathematical structure:

A linked list is either:

Empty (null/None/nil), or

A node containing data and a reference to another linked list

Let's trace through an example:

List [10, 20, 30] = Node(10) → List [20, 30]
List [20, 30] = Node(20) → List [30]
List [30] = Node(30) → List []
List [] = Empty (null)

This recursive definition has practical implications:

Base case: An empty list is valid and represents termination
Recursive case: A non-empty list is always a head node plus a tail list
Algorithms: Many linked list algorithms naturally express themselves recursively

Why the Recursive Definition Matters

Types of Linked Lists: A Preview

While we'll explore each variant in dedicated modules, it's valuable to know that linked lists come in several forms:

1. Singly Linked List Each node has one pointer—to its successor. You can only move forward through the list.

2. Doubly Linked List Each node has two pointers—one to its successor and one to its predecessor. You can move both forward and backward.

3. Circular Linked List Instead of the last node pointing to null, it points back to the first node, forming a cycle. This can apply to both singly and doubly linked variants.

Each type has different trade-offs:

Singly linked: Simpler, less memory, but only forward traversal
Doubly linked: More memory overhead, but bidirectional navigation and easier deletion
Circular: Useful when the data naturally forms a cycle (e.g., round-robin scheduling)

Linked List Variants at a Glance
Variant	Pointers per Node	Traversal	Memory Overhead	Common Use Cases
Singly Linked	1 (next only)	Forward only	Lower	Stacks, simple sequences, undo systems
Doubly Linked	2 (next + prev)	Both directions	Higher	Browsers (back/forward), LRU caches
Circular Singly	1 (next)	Forward (cycles)	Lower	Round-robin, circular buffers
Circular Doubly	2 (next + prev)	Both (cycles)	Higher	Music playlists, rotating schedules

Focus First on Singly Linked Lists

We'll begin with singly linked lists because they're the purest expression of the linked list concept. Master them, and doubly linked and circular variants become natural extensions.

Historical Context: Where Linked Lists Came From

Linked lists aren't a modern invention—they emerged at the very dawn of computer science.

The 1950s Origins

Why Linked Lists Mattered Then

Why Linked Lists Still Matter Now

Despite decades of hardware evolution, the fundamental trade-offs remain. Modern systems still need:

Dynamic data structures that grow/shrink
Efficient insertions and deletions
Building blocks for more complex structures (trees, graphs, hash tables)

Linked lists remain a cornerstone concept, foundational to understanding virtually every advanced data structure.

A Foundation for Everything

Contrasting with Arrays: A Deeper Look

To fully appreciate linked lists, we must understand precisely how they differ from arrays—not just superficially, but at the memory and operational level.

Memory Allocation

Resizing Behavior

Need to grow an array beyond its capacity? The system must:

Allocate a new, larger contiguous block
Copy all elements to the new location
Free the old block

This is expensive—O(n) for n elements. Even with amortized O(1) tricks (like doubling capacity), individual operations can be costly.

Growing a linked list? Simply allocate a new node and update one pointer. Constant time, every time.

Shrinking Behavior

Array Characteristics

•Contiguous Memory: Entire block must be found upfront
•Static Size (or expensive resize): Growing = copy everything
•Cache Friendly: Adjacent elements = excellent locality
•O(1) Access: Direct index calculation to any element
•O(n) Insertion/Deletion: Shift many elements

Linked List Characteristics

•Scattered Memory: Use whatever fragments available
•Fully Dynamic: Add/remove nodes in constant time
•Cache Unfriendly: Nodes scattered = poor locality
•O(n) Access: Must traverse from head
•O(1) Insertion/Deletion: Redirect pointers only

When Linked Lists Excel

Given the trade-offs we've discussed, linked lists are particularly well-suited to certain scenarios:

1. Frequent Insertions and Deletions

2. Unknown or Highly Variable Size

When you can't predict how much data you'll hold, linked lists adapt naturally. You never over-allocate (wasting memory) or under-allocate (causing crashes or expensive resizes).

3. No Random Access Needed

4. Memory Fragmentation Environments

In embedded systems or long-running servers with fragmented memory, linked lists can succeed where large array allocations would fail.

5. Building Blocks for Complex Structures

Many advanced data structures use linked lists internally: adjacency lists for graphs, chaining for hash tables, and the spine of certain tree implementations.

When NOT to Use Linked Lists

Building Your Mental Model

As we conclude this definitional exploration, let's solidify the mental model you should carry forward:

1. Think in Nodes, Not Indices

With arrays, you think: "element at index 3." With linked lists, you think: "the node that comes after the node that comes after the head." There's no index calculation—only chain-following.

2. Visualize the Chain

Always picture linked lists as nodes connected by arrows. The arrows represent memory addresses. If you can see the arrows in your mind, you can reason about pointer manipulation.

3. Remember: Only the Head is Known

4. The Cost of Flexibility

Linked lists trade access speed for modification flexibility. Every time you work with linked lists, consciously acknowledge this trade-off. Is flexibility worth the traversal cost for your problem?

Core Mental Model Takeaways

•A linked list is a chain of nodes, each holding data and a reference to the next
•Order is logical, not physical—memory locations are irrelevant
•Only the head is directly accessible—everything else requires traversal
•Insertions and deletions are pointer operations—no shifting required
•Trade-off: Fast modification at the cost of slow access

Summary: The Linked List Defined

We've established the foundational understanding of what a linked list is:

Key Properties:

Non-contiguous memory allocation
Logical order maintained by references, not physical position
Dynamic size—grows and shrinks with data
O(1) insertion/deletion when node position is known
O(n) access—must traverse from head

Paradigm Shift: Linked lists represent a fundamentally different approach to sequential data—trusting explicit connections over physical arrangement.

Key Takeaways

•A linked list is a chain of nodes connected by references/pointers
•Each node holds data and a 'next' pointer to the following node
•Nodes can live anywhere in memory—contiguity is not required
•The head is the only entry point—all access starts there
•Linked lists trade access speed for modification flexibility
•They form the foundation for trees, graphs, and many other structures

What's Next:

Page Complete

1 / 4