Operating SystemsVirtual Memory Concepts

Virtual Memory Concepts

LevelIntermediate

Duration60 mins

TopicVirtual Memory Concepts

2 / 5

Larger Than Physical Memory

Breaking the Physical Barrier

In 1968, programmers at MIT developed the first commercial time-sharing system on a machine with 256 KB of memory—yet each user's program could address far more memory than physically existed. This seeming paradox wasn't magic; it was the birth of virtual memory's most transformative capability: allowing programs to use more memory than the machine physically contains.

Today, this capability is so fundamental that we take it for granted. Your laptop with 16 GB of RAM routinely runs applications whose combined memory demands exceed 100 GB. Video editing software opens 8K footage files larger than physical memory. Databases hold indexes that dwarf available RAM. Scientific simulations work with datasets measured in terabytes.

How is this possible? How can software use memory that doesn't exist? This page answers that question, exploring the mechanisms that break the physical memory barrier and the profound implications for system design.

What You Will Learn

By the end of this page, you will understand how virtual address spaces can exceed physical memory size, the role of secondary storage in extending memory capacity, the key mechanisms that make this possible, and the fundamental tradeoffs involved.

The Apparent Contradiction

At first glance, using more memory than exists seems impossible. Where can data go if there's no physical storage for it? The apparent contradiction dissolves when we realize that not all data needs to be in physical memory simultaneously.

The key insight: Locality of Reference

Programs don't access all their data uniformly. At any given moment, they're focused on a small subset of their total address space—the working set. This behavior, called locality of reference, has two forms:

Temporal locality — Recently accessed locations are likely to be accessed again soon
Spatial locality — Locations near recently accessed ones are likely to be accessed soon

Because of locality, we only need the currently active portions of a program in physical memory. Everything else can wait on disk until it's needed.

Locality of Reference in Practice
Scenario	Total Data Size	Active Working Set	Locality Ratio
Text editor with large file	500 MB file	~1 MB visible + buffers	1:500
Web browser with many tabs	2 GB total	50 MB active tab	1:40
Database query execution	10 TB database	100 MB hot data	1:100,000
Compiling large project	1 GB source	10 MB current unit	1:100
Video editing timeline	50 GB footage	200 MB visible segment	1:250

The enabling abstraction:

Virtual memory exploits locality by creating a two-level memory hierarchy:

Primary level (Physical RAM) — Fast but limited; holds the working set
Secondary level (Disk/SSD) — Slow but vast; holds everything else

The operating system automatically migrates data between these levels, keeping hot data in RAM and cold data on disk. From the program's perspective, it appears to have unlimited fast memory—the abstraction hides the underlying reality.

The Speed Gap Is Enormous

RAM access takes ~100 nanoseconds. SSD access takes ~100 microseconds. HDD access takes ~10 milliseconds. That's a 100x gap between RAM and SSD, and 100,000x between RAM and HDD. Virtual memory's success depends on minimizing trips to the slower levels.

How It Works: The Basic Mechanism

The mechanism that enables virtual address spaces larger than physical memory involves several cooperating components working together.

The core concept: Partial Residency

At any moment, only a fraction of a process's virtual pages are 'resident' in physical memory. The rest exist only in the backing store (swap space on disk). When a process accesses a non-resident page, a page fault occurs, triggering the OS to load the page from disk.

Step-by-step mechanism:

Virtual Memory Access Flow
Memory Access with Virtual Memory Larger Than Physical:
═══════════════════════════════════════════════════════
 
1. INITIAL STATE:
   ┌──────────────────────────────────────────────────────────┐
   │ Process Virtual Address Space: 16 GB                     │
   │ Physical RAM: 4 GB                                        │
   │ Swap Space on Disk: 20 GB                                 │
   │                                                           │
   │ Currently: 3 GB of pages in RAM, 13 GB on disk            │
   └──────────────────────────────────────────────────────────┘
 
2. PROCESS ACCESSES VIRTUAL ADDRESS 0x7FF000001000:
   ┌──────────────────────────────────────────────────────────┐
   │ CPU checks page table for this virtual page...           │
   │                                                           │
   │ Page Table Entry says:                                    │
   │   Present bit = 0 (not in physical memory!)              │
   │   Disk location = Swap Block #4521                        │
   └──────────────────────────────────────────────────────────┘
 
3. PAGE FAULT OCCURS:
   ┌──────────────────────────────────────────────────────────┐
   │ CPU raises Page Fault Exception                          │
   │ Control transfers to OS page fault handler               │
   │                                                           │
   │ The process is BLOCKED until fault is resolved           │
   └──────────────────────────────────────────────────────────┘
 
4. OS HANDLING:
   ┌──────────────────────────────────────────────────────────┐
   │ a) Find a free physical frame                             │
   │    → If none free, evict a page (page replacement)        │
   │                                                           │
   │ b) Read page from Swap Block #4521 into frame            │
   │    → This takes ~100 μs (SSD) or ~10 ms (HDD)            │
   │                                                           │
   │ c) Update page table entry:                               │
   │    → Present bit = 1                                      │
   │    → Frame number = physical frame allocated              │
   │                                                           │
   │ d) Resume the faulting instruction                        │
   └──────────────────────────────────────────────────────────┘
 
5. ACCESS COMPLETES:
   ┌──────────────────────────────────────────────────────────┐
   │ The instruction that faulted re-executes                 │
   │ This time, TLB/page table has valid mapping              │
   │ Memory access succeeds!                                   │
   │                                                           │
   │ Now: 3 GB + 1 page in RAM, or if eviction occurred,      │
   │      still ~3 GB but with different pages                 │
   └──────────────────────────────────────────────────────────┘

Transparency Is the Goal

The beauty of this mechanism is that the process doesn't know anything happened. It issued a memory access, waited (blocked), and received its data. From the program's perspective, it's just using memory—slowly for some accesses, quickly for others, but always successfully.

The Mathematics of Memory Over-commitment

Let's examine the quantitative aspects of having virtual address spaces exceed physical memory. Understanding these numbers helps in system sizing and performance tuning.

Virtual address space vs. allocated memory vs. resident memory:

Three different measurements describe a process's memory usage:

Memory Metrics for a Typical Process
Metric	Definition	Example Values	Where It Lives
Virtual Size (VSZ)	Total address space mapped	100 GB	Exists in page tables
Resident Set Size (RSS)	Pages currently in RAM	500 MB	Physical memory
Swap Usage	Pages currently on disk	200 MB	Swap partition/file
Private Memory	Not shared with other processes	300 MB	RAM + Swap
Shared Memory	Mapped libraries, shared segments	400 MB	RAM (counted once)

Over-commitment ratios:

Systems routinely over-commit memory—promising more virtual memory than physical memory exists:

Over-commitment Ratio = Total Virtual Memory Promised / Physical RAM Available

Example:
  - 50 processes, each with 8 GB virtual space = 400 GB
  - Physical RAM = 32 GB
  - Over-commitment ratio = 400 / 32 = 12.5x

This works because:

Processes don't use their entire virtual space
Shared libraries are counted once physically but appear in each process's virtual space
Locality means most pages aren't needed simultaneously
Memory-mapped files can be re-read from their original file (not swap)

The OOM (Out of Memory) risk:

Over-commitment has a failure mode: if processes collectively try to use more memory than RAM + swap, the system runs out. This triggers the OOM Killer on Linux, which terminates processes to free memory—a traumatic but necessary response to over-commitment failure.

Checking Memory Over-commitment on Linux
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
# View system-wide memory statistics
$ free -h
              total        used        free      shared  buff/cache   available
Mem:           31Gi       8.5Gi       5.2Gi       1.2Gi        17Gi        21Gi
Swap:          16Gi       0.5Gi        16Gi
 
# Check over-commit settings (Linux-specific)
$ cat /proc/sys/vm/overcommit_memory
0   # 0=heuristic, 1=always, 2=never over-commit
 
$ cat /proc/sys/vm/overcommit_ratio
50  # When mode=2, allow RAM + 50% of RAM as virtual
 
# View a process's memory breakdown
$ cat /proc/self/status | grep -E "(VmSize|VmRSS|VmSwap)"
VmSize:   123456 kB    # Virtual address space size
VmRSS:     45678 kB    # Resident set (in RAM)
VmSwap:     1234 kB    # Currently swapped out

Over-commitment Is a Bet on Locality

Memory over-commitment works when locality holds—when processes don't simultaneously access their full allocations. Workloads with poor locality (random access patterns, in-memory databases) may defeat this assumption and require careful memory sizing.

Backing Store and Swap Space

The backing store is the secondary storage that holds pages not currently in physical memory. Without it, virtual memory larger than physical memory would be impossible.

Types of backing store:

Types of Backing Store for Virtual Memory
Type	Contents	Writeback Required?	Examples
Swap Space	Anonymous pages (heap, stack)	Yes, if modified	swap partition, pagefile.sys
Executable Files	Code pages (text segment)	No (read-only)	ELF binary, PE executable
Shared Libraries	Library code and data	No (read-only code)	libc.so, kernel32.dll
Memory-Mapped Files	File contents mapped to memory	Yes, if mmap'd writeable	Database files, config files

Swap space design:

Swap space is dedicated storage for pages that have no other backing file (anonymous pages). Design considerations include:

1. Swap partition vs. swap file:

Partition: Slightly faster, dedicated space, harder to resize
File: Flexible size, can be added/removed dynamically, slight overhead

2. Sizing guidance:

Historical rule: swap = 2× RAM (from memory-scarce era)
Modern practice: swap = RAM or less for desktops; depends on workload for servers
Hibernation: swap ≥ RAM (to store entire memory image)

3. SSD considerations:

SSDs make swap much faster (~100× vs HDD)
Write wear is a concern for heavy swapping
Many modern systems rely more on memory compression than swap

Understanding Page Backing
Where Pages Come From When Faulted:
════════════════════════════════════
 
Page Type                 → Backed By                → Example
─────────────────────────────────────────────────────────────────
Code page                 → Executable file          → main() from /usr/bin/prog
Library code page         → Shared library file      → printf() from /lib/libc.so
Initialized data          → Executable file          → const char* msg = "Hello"
Heap page (new alloc)     → Zero-filled on demand    → malloc(1000)
Heap page (swapped out)   → Swap space               → Previously used memory
Stack page (new)          → Zero-filled on demand    → Growing stack frame
Stack page (swapped)      → Swap space               → Cold stack frames
mmap'd file page          → Original file            → Database page
mmap'd anonymous          → Swap space               → Large allocation
 
KEY INSIGHT:
─────────────
Pages backed by files don't need swap space—they can be
re-read from their original file. Only "anonymous" pages
(heap, stack, private data modified after loading) require
swap space for storage.

File-Backed Pages Are Free to Evict

A clean (unmodified) page backed by a file can be discarded without writing anywhere—if needed again, it's re-read from the file. This is why read-only code and data segments are 'cheap' in memory terms: they can be evicted and restored at will.

Demand Paging: Load Pages Only When Needed

Demand paging is the mechanism that enables virtual spaces larger than physical memory. Instead of loading an entire program into memory at startup, the OS loads pages only when they're accessed—'on demand.'

Why demand paging is essential:

Without demand paging, a program would need all its pages loaded before execution. Consider:

A modern browser binary: 200+ MB
Linked libraries: 500+ MB
Data files: potentially GB
Total: Would require gigabytes of free RAM before starting

With demand paging:

Process starts with almost no pages loaded (or just a few critical ones)
Execution begins immediately
Pages are faulted in as accessed
Many pages may never be loaded (unused features, error handlers)

Without Demand Paging

•Load entire program at startup
•Startup blocked until all pages loaded
•Unused code wastes memory
•Large programs need large free RAM
•Multiple large programs can't run

With Demand Paging

•Load pages as accessed
•Near-instant startup
•Only used code occupies memory
•Programs larger than RAM can run
•Many large programs can coexist

The page fault as the loading trigger:

When a program references a page not yet loaded:

The page table entry has Present=0 (not in RAM)
CPU raises a page fault exception
OS page fault handler determines the page should be loaded (vs. a genuine error)
OS reads the page from its backing store into a physical frame
Page table is updated with the frame number and Present=1
Faulting instruction is restarted

This is invisible to the program. The load instruction sees a brief delay, then gets its data. The program doesn't explicitly request page loading—it just uses addresses, and the system provides the data.

Pure vs. Prepaging

Pure demand paging loads zero pages at startup—even the first instruction faults. Most real systems use 'prepaging' to load a few initial pages, reducing startup faults. Some also speculatively load pages adjacent to faulted ones (clustered paging), exploiting spatial locality.

Benefits of Exceeding Physical Memory

Having virtual address spaces exceed physical memory isn't just a trick—it fundamentally changes what's possible in computing. Let's enumerate the benefits:

Transformative Benefits

•Run programs larger than RAM — A video editor can work with 100 GB timelines on a 16 GB machine, because only visible/active portions need to be resident.
•Run more programs concurrently — Twenty programs each claiming 4 GB can coexist on an 8 GB system, as long as their combined working sets fit in RAM.
•Instant program startup — Programs begin executing immediately; pages load as needed rather than forcing the user to wait for complete loading.
•Efficient memory utilization — Physical memory holds only actively-used data, not idle code paths or rarely-accessed data structures.
•Simplified programming — Programmers can allocate large arrays and data structures without worrying about physical memory limits; the OS handles the complexity.
•Graceful performance degradation — When memory pressure increases, performance degrades gradually (more page faults) rather than causing immediate failure.

The economic argument:

Virtual memory larger than physical has significant economic implications:

Approach	Cost	Experience
Buy RAM to match peak usage	Very expensive	Always fast
Use virtual memory	Moderate	Usually fast, occasionally slow
Refuse to run large programs	Cheap	Frustrating

Virtual memory finds the sweet spot: it enables capabilities that would otherwise require expensive hardware, accepting occasional slowdowns when working sets exceed RAM.

The software architecture impact:

Programmers can design software as if memory were unlimited:

Databases can index huge datasets in 'memory' (actually virtual memory)
Applications can memory-map large files for convenient access
Scientific software can work with datasets larger than any machine's RAM
Caches can be sized generously without counting every byte

But Don't Abuse It

While virtual memory enables using 'more' memory, heavy reliance on swap devastates performance. The disk-to-RAM speed gap means a swapping-heavy workload might run 1000× slower than one that fits in RAM. Virtual memory is a safety net and capability enabler, not a substitute for adequate RAM.

Tradeoffs and Limitations

The ability to exceed physical memory isn't free—it comes with significant tradeoffs that system designers and performance engineers must understand.

The fundamental tradeoff: Space vs. Time

Memory Access Latencies
Storage Level	Latency	Relative Speed	Impact on Page Fault
L1 Cache	~1 ns	1×	N/A
L3 Cache	~20 ns	20×	N/A
RAM	~100 ns	100×	No page fault
NVMe SSD	~100 μs	100,000×	Causes ~0.1ms fault
SATA SSD	~200 μs	200,000×	Causes ~0.2ms fault
HDD	~10 ms	10,000,000×	Causes ~10ms fault

When virtual > physical fails:

Thrashing — When the working set exceeds physical memory, the system spends most of its time swapping pages in and out rather than doing useful work. CPU utilization drops even as the system is 100% busy doing I/O.

OOM conditions — If swap space is exhausted and memory is still needed, the OS must kill processes. This is non-deterministic and can kill the wrong process.

Latency-sensitive workloads — Real-time systems, low-latency trading, interactive applications—page faults introduce unacceptable jitter.

Why locality can fail:

Random access patterns (graph traversal, hash table with random keys)
Working set grows suddenly (batch job processing large dataset)
Many concurrent processes with overlapping working sets
Memory pressure from other processes

Thrashing Detection
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
Signs of Thrashing:
════════════════════
 
# High page fault rate
$ vmstat 1
procs -----------memory---------- ---swap-- -----io----
 r  b   swpd   free   buff  cache   si   so    bi    bo
 1 15 102400  50000   1000  10000  500  600 50000  60000
 1 14 103000  48000   1000  10000  550  700 55000  70000
    ▲                              ▲    ▲    ▲▲▲▲▲
    │                              │    │    High I/O = thrashing
    │                              │    └─ swap out
    │                              └─ swap in  
    └─ blocked processes (waiting for I/O)
 
Key indicators:
  • High 'si' (swap in) and 'so' (swap out)
  • Many blocked processes ('b' column)
  • Low CPU utilization in 'top' despite high load
  • System feels slow/unresponsive

Thrashing Is a Death Spiral

Thrashing is self-reinforcing: as the system swaps pages out, those pages soon need to be swapped back in, causing more evictions. The only escapes are reducing memory pressure (killing processes), adding RAM, or using techniques like working set management that we'll cover in later chapters.

Historical Context: Why This Was Revolutionary

The idea that virtual memory could exceed physical memory was not obvious—it was a breakthrough that took decades to develop and refine.

Timeline of virtual memory development:

Milestones in Virtual Memory History
Year	System	Contribution
1959	Atlas (Manchester)	First working virtual memory system; pioneered paging
1961	Burroughs B5000	Hardware-supported virtual memory, segmentation
1962	MIT Multics	Sophisticated virtual memory with demand paging and segments
1969	IBM System/370	Virtual memory becomes mainstream commercial feature
1976	VAX VMS	Advanced virtual memory with extensive tuning options
1983	4.2BSD Unix	Demand paging widespread in university/research Unix
1991	Linux 0.01	Virtual memory from the start; eventually mmap, swappiness
2000s	Modern SSDs	Fast swap storage changes virtual memory economics

The Atlas computer's insight:

The Atlas computer at the University of Manchester first demonstrated that programmers could use more addresses than fit in physical memory. The key insight of Tom Kilburn and colleagues was that the working set principle made this practical—most programs exhibited locality, making demand paging viable.

Resistance to virtual memory:

Not everyone was convinced. Some concerns were valid, others less so:

'Too slow' — Early systems with drum storage were indeed slow; SSDs changed this
'Too complex' — Hardware support and OS complexity; now routine
'Wastes memory' — Page tables consume memory; hierarchical tables addressed this
'Unpredictable performance' — Real concern for real-time systems; mitigated with page locking

Despite early skepticism, virtual memory won because the programmer productivity gains outweighed the complexity and occasional performance variability.

A Universal Feature Today

Every modern desktop, server, laptop, and smartphone uses virtual memory with the capability to exceed physical RAM. Even embedded systems increasingly adopt it. The only holdouts are hard real-time systems where deterministic timing trumps flexibility.

Summary: Breaking the Physical Barrier

The ability for virtual address spaces to exceed physical memory is one of computing's most significant abstractions. We've explored how this works and why it matters:

Key Takeaways

•Locality makes it possible — Programs access only a small working set at any moment, allowing the rest to reside on disk.
•Demand paging is the mechanism — Pages are loaded from backing storage only when accessed, triggered by page faults.
•Backing store extends capacity — Swap space and file-backed mappings provide secondary storage for pages not in RAM.
•Benefits are transformative — Run larger programs, more programs concurrently, with instant startup and simplified programming.
•Tradeoffs are real — Page faults are slow; thrashing can cripple systems; latency-sensitive workloads suffer.
•The economics work — Virtual memory provides 'enough' memory at reasonable cost, bridging the gap between infinite memory and limited budgets.

What's next:

We've seen that virtual memory can exceed physical memory through demand paging. The next page examines the demand paging mechanism in detail—how pages are brought in lazily, what happens during page faults, and the distinction between demand paging and demand segmentation.

Page Complete

You now understand how virtual address spaces can exceed physical memory, the mechanisms that enable this, and the tradeoffs involved. This capability is what transforms virtual memory from mere address translation into a powerful resource virtualization mechanism.

2 / 5

Loading learning content...

Operating SystemsVirtual Memory Concepts

Virtual Memory Concepts

LevelIntermediate

Duration60 mins

TopicVirtual Memory Concepts

2 / 5

Larger Than Physical Memory

Breaking the Physical Barrier

What You Will Learn

The Apparent Contradiction

The key insight: Locality of Reference

Temporal locality — Recently accessed locations are likely to be accessed again soon
Spatial locality — Locations near recently accessed ones are likely to be accessed soon

Because of locality, we only need the currently active portions of a program in physical memory. Everything else can wait on disk until it's needed.

Locality of Reference in Practice
Scenario	Total Data Size	Active Working Set	Locality Ratio
Text editor with large file	500 MB file	~1 MB visible + buffers	1:500
Web browser with many tabs	2 GB total	50 MB active tab	1:40
Database query execution	10 TB database	100 MB hot data	1:100,000
Compiling large project	1 GB source	10 MB current unit	1:100
Video editing timeline	50 GB footage	200 MB visible segment	1:250

The enabling abstraction:

Virtual memory exploits locality by creating a two-level memory hierarchy:

Primary level (Physical RAM) — Fast but limited; holds the working set
Secondary level (Disk/SSD) — Slow but vast; holds everything else

The Speed Gap Is Enormous

How It Works: The Basic Mechanism

The mechanism that enables virtual address spaces larger than physical memory involves several cooperating components working together.

The core concept: Partial Residency

Step-by-step mechanism:

Virtual Memory Access Flow
Memory Access with Virtual Memory Larger Than Physical:
═══════════════════════════════════════════════════════
 
1. INITIAL STATE:
   ┌──────────────────────────────────────────────────────────┐
   │ Process Virtual Address Space: 16 GB                     │
   │ Physical RAM: 4 GB                                        │
   │ Swap Space on Disk: 20 GB                                 │
   │                                                           │
   │ Currently: 3 GB of pages in RAM, 13 GB on disk            │
   └──────────────────────────────────────────────────────────┘
 
2. PROCESS ACCESSES VIRTUAL ADDRESS 0x7FF000001000:
   ┌──────────────────────────────────────────────────────────┐
   │ CPU checks page table for this virtual page...           │
   │                                                           │
   │ Page Table Entry says:                                    │
   │   Present bit = 0 (not in physical memory!)              │
   │   Disk location = Swap Block #4521                        │
   └──────────────────────────────────────────────────────────┘
 
3. PAGE FAULT OCCURS:
   ┌──────────────────────────────────────────────────────────┐
   │ CPU raises Page Fault Exception                          │
   │ Control transfers to OS page fault handler               │
   │                                                           │
   │ The process is BLOCKED until fault is resolved           │
   └──────────────────────────────────────────────────────────┘
 
4. OS HANDLING:
   ┌──────────────────────────────────────────────────────────┐
   │ a) Find a free physical frame                             │
   │    → If none free, evict a page (page replacement)        │
   │                                                           │
   │ b) Read page from Swap Block #4521 into frame            │
   │    → This takes ~100 μs (SSD) or ~10 ms (HDD)            │
   │                                                           │
   │ c) Update page table entry:                               │
   │    → Present bit = 1                                      │
   │    → Frame number = physical frame allocated              │
   │                                                           │
   │ d) Resume the faulting instruction                        │
   └──────────────────────────────────────────────────────────┘
 
5. ACCESS COMPLETES:
   ┌──────────────────────────────────────────────────────────┐
   │ The instruction that faulted re-executes                 │
   │ This time, TLB/page table has valid mapping              │
   │ Memory access succeeds!                                   │
   │                                                           │
   │ Now: 3 GB + 1 page in RAM, or if eviction occurred,      │
   │      still ~3 GB but with different pages                 │
   └──────────────────────────────────────────────────────────┘

Transparency Is the Goal

The Mathematics of Memory Over-commitment

Let's examine the quantitative aspects of having virtual address spaces exceed physical memory. Understanding these numbers helps in system sizing and performance tuning.

Virtual address space vs. allocated memory vs. resident memory:

Three different measurements describe a process's memory usage:

Memory Metrics for a Typical Process
Metric	Definition	Example Values	Where It Lives
Virtual Size (VSZ)	Total address space mapped	100 GB	Exists in page tables
Resident Set Size (RSS)	Pages currently in RAM	500 MB	Physical memory
Swap Usage	Pages currently on disk	200 MB	Swap partition/file
Private Memory	Not shared with other processes	300 MB	RAM + Swap
Shared Memory	Mapped libraries, shared segments	400 MB	RAM (counted once)

Over-commitment ratios:

Systems routinely over-commit memory—promising more virtual memory than physical memory exists:

Over-commitment Ratio = Total Virtual Memory Promised / Physical RAM Available

Example:
  - 50 processes, each with 8 GB virtual space = 400 GB
  - Physical RAM = 32 GB
  - Over-commitment ratio = 400 / 32 = 12.5x

This works because:

Processes don't use their entire virtual space
Shared libraries are counted once physically but appear in each process's virtual space
Locality means most pages aren't needed simultaneously
Memory-mapped files can be re-read from their original file (not swap)

The OOM (Out of Memory) risk:

Checking Memory Over-commitment on Linux
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
# View system-wide memory statistics
$ free -h
              total        used        free      shared  buff/cache   available
Mem:           31Gi       8.5Gi       5.2Gi       1.2Gi        17Gi        21Gi
Swap:          16Gi       0.5Gi        16Gi
 
# Check over-commit settings (Linux-specific)
$ cat /proc/sys/vm/overcommit_memory
0   # 0=heuristic, 1=always, 2=never over-commit
 
$ cat /proc/sys/vm/overcommit_ratio
50  # When mode=2, allow RAM + 50% of RAM as virtual
 
# View a process's memory breakdown
$ cat /proc/self/status | grep -E "(VmSize|VmRSS|VmSwap)"
VmSize:   123456 kB    # Virtual address space size
VmRSS:     45678 kB    # Resident set (in RAM)
VmSwap:     1234 kB    # Currently swapped out

Over-commitment Is a Bet on Locality

Backing Store and Swap Space

The backing store is the secondary storage that holds pages not currently in physical memory. Without it, virtual memory larger than physical memory would be impossible.

Types of backing store:

Types of Backing Store for Virtual Memory
Type	Contents	Writeback Required?	Examples
Swap Space	Anonymous pages (heap, stack)	Yes, if modified	swap partition, pagefile.sys
Executable Files	Code pages (text segment)	No (read-only)	ELF binary, PE executable
Shared Libraries	Library code and data	No (read-only code)	libc.so, kernel32.dll
Memory-Mapped Files	File contents mapped to memory	Yes, if mmap'd writeable	Database files, config files

Swap space design:

Swap space is dedicated storage for pages that have no other backing file (anonymous pages). Design considerations include:

1. Swap partition vs. swap file:

Partition: Slightly faster, dedicated space, harder to resize
File: Flexible size, can be added/removed dynamically, slight overhead

2. Sizing guidance:

Historical rule: swap = 2× RAM (from memory-scarce era)
Modern practice: swap = RAM or less for desktops; depends on workload for servers
Hibernation: swap ≥ RAM (to store entire memory image)

3. SSD considerations:

SSDs make swap much faster (~100× vs HDD)
Write wear is a concern for heavy swapping
Many modern systems rely more on memory compression than swap

Understanding Page Backing
Where Pages Come From When Faulted:
════════════════════════════════════
 
Page Type                 → Backed By                → Example
─────────────────────────────────────────────────────────────────
Code page                 → Executable file          → main() from /usr/bin/prog
Library code page         → Shared library file      → printf() from /lib/libc.so
Initialized data          → Executable file          → const char* msg = "Hello"
Heap page (new alloc)     → Zero-filled on demand    → malloc(1000)
Heap page (swapped out)   → Swap space               → Previously used memory
Stack page (new)          → Zero-filled on demand    → Growing stack frame
Stack page (swapped)      → Swap space               → Cold stack frames
mmap'd file page          → Original file            → Database page
mmap'd anonymous          → Swap space               → Large allocation
 
KEY INSIGHT:
─────────────
Pages backed by files don't need swap space—they can be
re-read from their original file. Only "anonymous" pages
(heap, stack, private data modified after loading) require
swap space for storage.

File-Backed Pages Are Free to Evict

Demand Paging: Load Pages Only When Needed

Why demand paging is essential:

Without demand paging, a program would need all its pages loaded before execution. Consider:

A modern browser binary: 200+ MB
Linked libraries: 500+ MB
Data files: potentially GB
Total: Would require gigabytes of free RAM before starting

With demand paging:

Process starts with almost no pages loaded (or just a few critical ones)
Execution begins immediately
Pages are faulted in as accessed
Many pages may never be loaded (unused features, error handlers)

Without Demand Paging

•Load entire program at startup
•Startup blocked until all pages loaded
•Unused code wastes memory
•Large programs need large free RAM
•Multiple large programs can't run

With Demand Paging

•Load pages as accessed
•Near-instant startup
•Only used code occupies memory
•Programs larger than RAM can run
•Many large programs can coexist

The page fault as the loading trigger:

When a program references a page not yet loaded:

The page table entry has Present=0 (not in RAM)
CPU raises a page fault exception
OS page fault handler determines the page should be loaded (vs. a genuine error)
OS reads the page from its backing store into a physical frame
Page table is updated with the frame number and Present=1
Faulting instruction is restarted

Pure vs. Prepaging

Benefits of Exceeding Physical Memory

Having virtual address spaces exceed physical memory isn't just a trick—it fundamentally changes what's possible in computing. Let's enumerate the benefits:

Transformative Benefits

•Run programs larger than RAM — A video editor can work with 100 GB timelines on a 16 GB machine, because only visible/active portions need to be resident.
•Run more programs concurrently — Twenty programs each claiming 4 GB can coexist on an 8 GB system, as long as their combined working sets fit in RAM.
•Instant program startup — Programs begin executing immediately; pages load as needed rather than forcing the user to wait for complete loading.
•Efficient memory utilization — Physical memory holds only actively-used data, not idle code paths or rarely-accessed data structures.
•Simplified programming — Programmers can allocate large arrays and data structures without worrying about physical memory limits; the OS handles the complexity.
•Graceful performance degradation — When memory pressure increases, performance degrades gradually (more page faults) rather than causing immediate failure.

The economic argument:

Virtual memory larger than physical has significant economic implications:

Approach	Cost	Experience
Buy RAM to match peak usage	Very expensive	Always fast
Use virtual memory	Moderate	Usually fast, occasionally slow
Refuse to run large programs	Cheap	Frustrating

Virtual memory finds the sweet spot: it enables capabilities that would otherwise require expensive hardware, accepting occasional slowdowns when working sets exceed RAM.

The software architecture impact:

Programmers can design software as if memory were unlimited:

Databases can index huge datasets in 'memory' (actually virtual memory)
Applications can memory-map large files for convenient access
Scientific software can work with datasets larger than any machine's RAM
Caches can be sized generously without counting every byte

But Don't Abuse It

Tradeoffs and Limitations

The ability to exceed physical memory isn't free—it comes with significant tradeoffs that system designers and performance engineers must understand.

The fundamental tradeoff: Space vs. Time

Memory Access Latencies
Storage Level	Latency	Relative Speed	Impact on Page Fault
L1 Cache	~1 ns	1×	N/A
L3 Cache	~20 ns	20×	N/A
RAM	~100 ns	100×	No page fault
NVMe SSD	~100 μs	100,000×	Causes ~0.1ms fault
SATA SSD	~200 μs	200,000×	Causes ~0.2ms fault
HDD	~10 ms	10,000,000×	Causes ~10ms fault

When virtual > physical fails:

OOM conditions — If swap space is exhausted and memory is still needed, the OS must kill processes. This is non-deterministic and can kill the wrong process.

Latency-sensitive workloads — Real-time systems, low-latency trading, interactive applications—page faults introduce unacceptable jitter.

Why locality can fail:

Random access patterns (graph traversal, hash table with random keys)
Working set grows suddenly (batch job processing large dataset)
Many concurrent processes with overlapping working sets
Memory pressure from other processes

Thrashing Detection
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
Signs of Thrashing:
════════════════════
 
# High page fault rate
$ vmstat 1
procs -----------memory---------- ---swap-- -----io----
 r  b   swpd   free   buff  cache   si   so    bi    bo
 1 15 102400  50000   1000  10000  500  600 50000  60000
 1 14 103000  48000   1000  10000  550  700 55000  70000
    ▲                              ▲    ▲    ▲▲▲▲▲
    │                              │    │    High I/O = thrashing
    │                              │    └─ swap out
    │                              └─ swap in  
    └─ blocked processes (waiting for I/O)
 
Key indicators:
  • High 'si' (swap in) and 'so' (swap out)
  • Many blocked processes ('b' column)
  • Low CPU utilization in 'top' despite high load
  • System feels slow/unresponsive

Thrashing Is a Death Spiral

Historical Context: Why This Was Revolutionary

The idea that virtual memory could exceed physical memory was not obvious—it was a breakthrough that took decades to develop and refine.

Timeline of virtual memory development:

Milestones in Virtual Memory History
Year	System	Contribution
1959	Atlas (Manchester)	First working virtual memory system; pioneered paging
1961	Burroughs B5000	Hardware-supported virtual memory, segmentation
1962	MIT Multics	Sophisticated virtual memory with demand paging and segments
1969	IBM System/370	Virtual memory becomes mainstream commercial feature
1976	VAX VMS	Advanced virtual memory with extensive tuning options
1983	4.2BSD Unix	Demand paging widespread in university/research Unix
1991	Linux 0.01	Virtual memory from the start; eventually mmap, swappiness
2000s	Modern SSDs	Fast swap storage changes virtual memory economics

The Atlas computer's insight:

Resistance to virtual memory:

Not everyone was convinced. Some concerns were valid, others less so:

'Too slow' — Early systems with drum storage were indeed slow; SSDs changed this
'Too complex' — Hardware support and OS complexity; now routine
'Wastes memory' — Page tables consume memory; hierarchical tables addressed this
'Unpredictable performance' — Real concern for real-time systems; mitigated with page locking

Despite early skepticism, virtual memory won because the programmer productivity gains outweighed the complexity and occasional performance variability.

A Universal Feature Today

Summary: Breaking the Physical Barrier

The ability for virtual address spaces to exceed physical memory is one of computing's most significant abstractions. We've explored how this works and why it matters:

Key Takeaways

•Locality makes it possible — Programs access only a small working set at any moment, allowing the rest to reside on disk.
•Demand paging is the mechanism — Pages are loaded from backing storage only when accessed, triggered by page faults.
•Backing store extends capacity — Swap space and file-backed mappings provide secondary storage for pages not in RAM.
•Benefits are transformative — Run larger programs, more programs concurrently, with instant startup and simplified programming.
•Tradeoffs are real — Page faults are slow; thrashing can cripple systems; latency-sensitive workloads suffer.
•The economics work — Virtual memory provides 'enough' memory at reasonable cost, bridging the gap between infinite memory and limited budgets.

What's next:

Page Complete

2 / 5