Operating SystemsSwapping

Swapping

LevelIntermediate

Duration60 mins

TopicSwapping

3 / 5

Standard Swapping

The Original Swapping: Moving Entire Processes

Before paging became the dominant memory management paradigm, operating systems faced a stark choice when memory ran low: swap out an entire process. This technique—known as standard swapping or whole-process swapping—was the original meaning of the term "swap."

In standard swapping, when memory pressure occurs, the operating system selects an entire process and writes all of its memory to disk. The freed memory can then be used by other processes. When the swapped-out process needs to run again, the system reads its entire memory image back from disk.

Understanding standard swapping is crucial for several reasons: it illuminates the historical evolution of memory management, explains terminology that persists in modern systems, and reveals why paging-based approaches became dominant. Moreover, some modern systems still employ process-level swapping as a last-resort mechanism under extreme memory pressure.

What You Will Learn

By the end of this page, you will understand how standard swapping works, its historical context, the role of the swapper (scheduler), the mechanics of process swap out and swap in, and why modern systems prefer page-based swapping while occasionally falling back to process-level mechanisms.

Historical Context: The Origins of Swapping

To appreciate standard swapping, we must travel back to the computing era of the 1960s and 1970s, when the constraints were radically different from today.

The Computing Landscape of Early Swapping

•Memory was extraordinarily expensive — In 1970, a megabyte of RAM cost approximately $700,000 (inflation-adjusted to 2020 dollars). Mainframes might have only 1-4 MB total.
•Multiple users shared one computer — Time-sharing systems allowed dozens of interactive users to share a single mainframe. Each user's process needed memory.
•Interactive response was expected — Users at terminals expected sub-second response times, even though the computer was shared.
•Context switches were frequent — With many users, processes ran for short time slices before yielding to others.
•Disk was the only persistent storage — No SSDs, no flash memory—only spinning magnetic disks with millisecond access times.

The time-sharing challenge:

Imagine a university mainframe in 1975. Twenty students are logged in via terminals, each running a text editor or compiler. The total memory demand might be 20 × 200KB = 4MB. But the machine has only 1MB of RAM.

The solution: multiprogramming with swapping. At any moment, only a subset of processes are in memory. When a user pauses (e.g., to read output or think), their process is swapped out, making room for another user's process to swap in. When the first user types a command, their process is swapped back.

This approach worked because:

User think time is long compared to swap time
Only the active user needs memory immediately
Swapping one process out/in takes seconds, but users pause for many seconds

The math of early swapping:

Process size: ~200KB
Disk bandwidth: ~500KB/s
Swap out time: ~400ms
Swap in time: ~400ms
Total swap: ~800ms
User think time: typically 5-30 seconds

Since user think time vastly exceeded swap time, the system remained responsive despite frequent swapping.

Early Unix and Swapping

Early Unix systems on the PDP-11 (Ken Thompson and Dennis Ritchie, 1970s) used standard swapping extensively. The entire address space of a process would be written to a swap area. The term 'swap' in Unix permanently derives from this whole-process swapping model, even though modern implementations swap pages rather than processes.

Mechanics of Standard Swapping

In standard swapping, the unit of swapping is the entire process memory image. When a process is swapped out, all of its allocated memory is written to disk. When swapped in, the entire image is read back.

Standard Swap Out Steps

•Select victim process — The swapper (or medium-term scheduler) chooses a process to swap out based on criteria like time idle, priority, and memory size.
•Stop the process — The process must not execute during swapping. It is moved to a 'suspended' or 'swapped out' state.
•Save process state — CPU registers, program counter, and other context are saved to the process control block (PCB).
•Write memory to swap space — The entire allocated memory (code, data, heap, stack) is written to swap area as a contiguous block.
•Free all physical frames — The frames occupied by the process are returned to the free list, available for other processes.
•Update process state — The PCB is marked to indicate the process is swapped out and records the swap area location.

Standard Swap In Steps

•Select process to swap in — The swapper identifies a swapped-out process that should run (e.g., received input, timer expired).
•Allocate memory frames — Sufficient contiguous physical frames must be allocated to hold the process image. If unavailable, another process may be swapped out first.
•Read memory from swap — The entire process image is read from swap space into the allocated frames.
•Restore address mapping — Page tables or base/limit registers are updated to point to the new frame locations.
•Restore process state — CPU registers and program counter are loaded from the PCB.
•Make process runnable — The process is moved to the ready queue and can be scheduled for execution.

Converting Mermaid diagram...

The contiguity requirement:

A critical aspect of early standard swapping was the requirement for contiguous memory allocation. When a process is swapped in, it needs a single block of memory large enough to hold its entire image. This requirement had significant implications:

External fragmentation — As processes swap in and out, gaps form between allocated regions. Eventually, total free memory might be sufficient, but no single contiguous block is large enough.
Compaction — To solve fragmentation, the OS might compact memory, sliding all processes to one end. This was expensive (copying gigabytes of data), but necessary.
Fixed partitions — Some systems used fixed memory partitions. Each process was assigned to a partition of appropriate size, simplifying allocation but wasting memory if processes didn't fill their partitions.

These limitations were primary motivators for the development of paging, which we'll compare in a later section.

The Swapper and Medium-Term Scheduling

In systems using standard swapping, a specialized component called the swapper (also known as the medium-term scheduler) makes swapping decisions. This component bridges the gap between short-term scheduling (which process runs next on the CPU) and long-term scheduling (which jobs are admitted to the system).

Three Levels of Scheduling
Scheduler	Time Scale	Decision	Frequency
Long-term (Job Scheduler)	Minutes to hours	Which jobs are admitted to the system?	Rarely (batch systems)
Medium-term (Swapper)	Seconds to minutes	Which processes are in memory vs. swapped?	Periodically
Short-term (CPU Scheduler)	Milliseconds	Which ready process runs next?	Constantly

The swapper's role:

The swapper daemon (often implemented as a kernel thread or privileged process) continuously monitors system memory state and makes swap decisions based on several factors:

Memory availability — When free memory drops below a threshold, the swapper identifies processes to swap out
Process state — Processes that have been waiting (blocked) for a long time are prime candidates for swap-out. Sleeping processes don't need their memory in RAM.
Priority — Lower-priority processes are preferred victims for swap-out. High-priority or real-time processes may be protected.
Residency time — Processes that have been in memory for a long time without running may be swapped out. Conversely, recently swapped-in processes get grace time before being swapped out again.
Memory size — Swapping a large process frees more memory but takes longer. The swapper balances these factors.

Swap-in decisions:

The swapper also decides when to bring swapped-out processes back:

Event occurrence — When a blocked process's event occurs (I/O completes, semaphore signaled), it should be swapped in to resume.
Time quantum — Some systems periodically swap in waiting processes to ensure fairness.
Memory availability — If memory becomes plentiful, swapped processes may be brought in proactively.

swapper_algorithm.pseudo
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
// Conceptual swapper algorithm (medium-term scheduler)
 
procedure SWAPPER:
    loop forever:
        sleep(SWAPPER_INTERVAL)  // e.g., 1 second
        
        // SWAP-OUT PHASE: Free memory if needed
        while free_memory < LOW_THRESHOLD:
            victim = select_swap_out_candidate()
            if victim == NULL:
                break  // No suitable candidates
            
            swap_out(victim)
            add_to_free_list(victim.frames)
            victim.state = SWAPPED_OUT
        
        // SWAP-IN PHASE: Bring back waiting processes
        for each process P in swapped_out_queue:
            if should_swap_in(P):
                if free_memory >= P.memory_size:
                    swap_in(P)
                    P.state = READY
                    add_to_ready_queue(P)
                else:
                    break  // Not enough memory
 
function select_swap_out_candidate():
    // Selection criteria:
    // 1. Prefer blocked processes over ready/running
    // 2. Prefer lower priority
    // 3. Prefer longer time since last active
    // 4. Prefer smaller processes (faster swap)
    
    best_candidate = NULL
    best_score = -INFINITY
    
    for each process P in memory:
        if P.is_locked or P.is_kernel:
            continue
        
        score = compute_swap_score(P)
        if score > best_score:
            best_score = score
            best_candidate = P
    
    return best_candidate
 
function should_swap_in(P):
    // Swap in if:
    // 1. P is now runnable (event occurred)
    // 2. P has been swapped out too long (fairness)
    // 3. P has high priority and we have memory
    
    return P.is_runnable and 
           (time_since_swap_out(P) > MAX_SWAP_TIME or
            P.priority > PRIORITY_THRESHOLD)

Thrashing Risk

If the swapper is too aggressive, it can cause thrashing: processes are swapped out shortly before their events occur and swapped back in, only to be swapped out again. This wastes enormous I/O bandwidth. The swapper must use hysteresis—processes that were recently swapped in should be protected from immediate swap-out.

Swap Space Allocation for Processes

In standard swapping, when a process is swapped out, its entire memory image must be written to swap space. The operating system must therefore manage swap space allocation at the process level, reserving and tracking regions large enough to hold complete process images.

Process-Level Swap Allocation Strategies

•Pre-allocation at process creation — When a process starts, swap space equal to its maximum possible size is reserved. This guarantees swap-out will never fail due to space exhaustion, but wastes swap space for processes that never grow to maximum size.
•Allocation at swap-out time — Space is allocated only when needed. More space-efficient, but swap-out may fail if swap is full—forcing the OOM killer.
•Grow on demand — Initial swap reservation is small; additional space is allocated as the process grows. Balances efficiency with safety.
•Swap maps and bitmaps — The kernel maintains data structures tracking which regions of swap are in use and by which processes.

Contiguous vs. scattered swap allocation:

Early standard swapping often allocated contiguous regions in swap space for each process, mirroring the contiguous memory allocation requirement. This simplified I/O (one large sequential read/write) but caused the same fragmentation problems as memory allocation.

Later systems allowed scattered swap allocation:

Process image is broken into chunks
Each chunk is written to available swap space
A map records where each chunk resides
Swap-in reads chunks from multiple locations and reassembles

This reduced fragmentation but increased management complexity and sometimes I/O overhead.

Modern relevance:

Although page-based swapping has largely replaced process-level swap allocation, the concept survives in:

Linux OOM handling — Under extreme pressure, Linux's OOM killer terminates processes entirely, effectively "swapping out" their memory (though not to disk in recoverable form)
Container memory limits — Container runtimes may "freeze" a container (saving state to disk) when memory limits are exceeded
Hibernation — The entire system state (all processes) is written to swap, resembling mass standard swapping

Swap Allocation Trade-offs
Strategy	Advantages	Disadvantages
Pre-allocation	Swap-out never fails; simple accounting	Wastes swap space; limits process count
On-demand allocation	Efficient space usage	Swap-out may fail; OOM risk
Contiguous regions	Fast sequential I/O	Swap fragmentation; allocation challenges
Scattered allocation	No fragmentation issues	Complex management; potential seek overhead

Limitations of Standard Swapping

While standard swapping solved the fundamental problem of limited memory, it had significant limitations that became increasingly problematic as systems evolved.

Critical Limitations

•All-or-nothing swap — A process must be entirely in memory to run or entirely on disk. No partial residence. A 100MB process needs 100MB of free RAM, even if it only actively uses 1MB.
•Long swap times for large processes — Modern processes can be gigabytes in size. Swapping a 4GB process at 500MB/s takes 8 seconds—an eternity for interactive use.
•Wasted I/O — Most processes have cold regions rarely accessed. Standard swapping writes everything, including data that won't be needed for hours.
•Memory fragmentation — Contiguous allocation requirements cause external fragmentation. Compaction wastes CPU and I/O.
•No sharing optimization — If two processes share a library, each keeps its own copy. Standard swapping doesn't exploit sharing.
•Poor fairness — Small processes are cheaper to swap than large ones, biasing victim selection toward large processes that may actually be more important.

The scaling problem:

Consider a modern workstation with:

16GB RAM
50 running processes averaging 1GB each
Total process memory: 50GB

With standard swapping:

Swapping one process out takes ~2 seconds (at 500MB/s)
Only 16 processes can be memory-resident simultaneously
Switching between two large swapped processes takes ~4 seconds round-trip
User experiences multi-second freezes on every application switch

This is clearly unacceptable for modern interactive systems. The solution—demand paging—allows portions of processes to reside in memory while the rest remains on disk, swapping individual pages rather than entire process images.

The working set insight:

The key realization driving paged memory was the working set concept: at any moment, a process actively uses only a small subset of its pages. If we keep the working set in memory and swap only cold pages, we can support many large processes simultaneously.

A 4GB process might have a 100MB working set. Five such processes need only 500MB of RAM (for their working sets), not 20GB (for complete images). The remaining pages can stay on disk until needed.

The Locality Principle

Standard swapping doesn't exploit locality of reference. Programs don't access their entire address space uniformly—they spend 90% of time in 10% of code. Paging exploits this by keeping hot pages in RAM while cold pages can remain on disk indefinitely.

Standard Swapping in Modern Systems

Despite its limitations, elements of standard swapping persist in modern operating systems, typically as emergency mechanisms when page-level swapping proves insufficient.

Modern Appearances of Process-Level Swapping

•Linux's complete fair scheduler integration — Under extreme memory pressure, Linux may suspend entire process groups (cgroups), writing their state to disk. This is rare but allows recovery from severe overcommitment.
•Windows Memory Priority — Windows assigns memory priority to processes. Very low priority processes may have all their pages evicted aggressively, approximating swap-out.
•macOS App Nap — Background applications may be entirely suspended and their memory reclaimed, similar to standard swapping.
•Mobile OS process killing — iOS and Android kill background apps entirely under memory pressure, similar to permanent swap-out.
•Container checkpoint/restore — CRIU (Checkpoint/Restore In Userspace) on Linux can write a container's complete state to files, essentially swapping it out for later restoration.
•Hibernation (suspend-to-disk) — The entire system, all processes, is written to swap space—mass standard swapping.

Linux swappiness and per-process behavior:

Linux's /proc/sys/vm/swappiness parameter (0-200, default 60) influences how aggressively the kernel swaps. At low values, the kernel strongly prefers evicting file-backed pages; at high values, it more readily swaps anonymous pages.

For individual processes, Linux supports:

# View a process's memory status
cat /proc/<pid>/status | grep -E "VmSwap|VmRSS"
# VmRSS:     12345 kB   (resident in RAM)
# VmSwap:     6789 kB   (in swap)

# Lock process memory (prevent swapping)
mlock() system call or mlockall(MCL_CURRENT | MCL_FUTURE)

When complete swap-out makes sense:

In certain scenarios, swapping entire processes remains logical:

Hibernation — No choice; all memory must go to disk
Long-idle background apps — A browser tab not viewed for days might as well be entirely swapped
Development environments — Pausing a VM or heavy IDE to run a memory-intensive build
Emergency pressure release — When page-level eviction isn't freeing memory fast enough

Process Freezing vs. Killing

Modern mobile OSes often kill background processes rather than swap them, prioritizing startup speed over preservation. Relaunching an app takes seconds; swapping it back in might take longer and still require initialization. This is a pragmatic trade-off enabled by fast flash storage and app architectures designed for quick restart.

Standard Swapping vs. Demand Paging

The transition from standard swapping to demand paging was one of the most significant advances in operating system design. Let's compare these approaches systematically.

Standard Swapping vs. Demand Paging
Aspect	Standard Swapping	Demand Paging
Unit of swap	Entire process	Individual page (e.g., 4KB)
Memory residency	All-or-nothing	Partial (working set in RAM, rest on disk)
Swap granularity	Process size (MB-GB)	Page size (KB)
I/O efficiency	Large sequential transfers	Many small transfers (with read-ahead)
Fragmentation	External (in memory and swap)	Internal only (last page of allocation)
Sharing support	Limited or none	Shared pages map to same frames
Start-up cost	Full swap-in before execution	Pages loaded on first access
Locality exploitation	None	Working set pages stay in RAM
Memory utilization	Poor (cold pages in RAM)	Excellent (only hot pages in RAM)
Implementation complexity	Relatively simple	Complex (page tables, TLB, fault handlers)

Why demand paging won:

The advantages of demand paging are overwhelming for general-purpose systems:

Better multiprogramming — With demand paging, 50 processes with 1GB address spaces but 50MB working sets need only ~2.5GB RAM, not 50GB. Memory utilization improves dramatically.
Faster startup — A process can begin executing immediately; code pages load on first access. No waiting for full swap-in.
Efficient sharing — Libraries like libc are loaded once and shared across all processes via page table mappings. Standard swapping would replicate them.
Graceful degradation — As memory pressure increases, only cold pages are evicted. The system slows but doesn't stop. With standard swapping, processes alternate between full-speed and completely stalled.
Fine-grained policies — Page replacement algorithms can make nuanced decisions: which specific pages to evict, based on access patterns. Standard swapping has only coarse process-level choices.

Trade-offs:

Demand paging isn't universally superior:

More overhead — Page tables, TLB management, and fault handling have costs absent in simpler swapping schemes
Thrashing risk — If working sets don't fit in RAM, page-level thrashing can be worse than process-level swapping (constant small faults vs. occasional complete swaps)
Unpredictable latency — Any memory access might trigger a page fault, making execution time unpredictable

For soft real-time or latency-sensitive workloads, these trade-offs may favor memory locking over either swapping approach.

Summary: Standard Swapping

Standard swapping was the original solution to the memory scarcity problem, enabling time-sharing and multiprogramming when RAM was prohibitively expensive. While demand paging has largely superseded it, understanding standard swapping illuminates operating system evolution and remains relevant for specialized scenarios.

Key Takeaways

•Standard swapping moves entire processes — All memory is written to disk; none remains in RAM. The process cannot run while swapped out.
•The swapper (medium-term scheduler) makes swap decisions — It balances memory availability, process priority, idle time, and fairness.
•Historical context explains the design — When RAM cost $700,000/MB, swapping was essential for multiuser systems.
•Contiguous allocation requirements caused fragmentation — Both memory and swap space suffered, requiring compaction.
•Limitations drove the evolution to paging — All-or-nothing residence, long swap times, and wasted I/O made standard swapping unsuitable for modern workloads.
•Elements persist in modern systems — Hibernation, process freezing, and container checkpointing embody process-level swapping concepts.

What's next:

We've seen standard (process-level) swapping and understand its limitations. The next page explores swapping with paging—how modern systems combine these concepts, using page-level granularity while still supporting process suspension and hibernation when necessary.

Page Complete

You now understand standard swapping: its historical role, mechanics, limitations, and modern appearances. Next, we'll see how paging transforms swapping into a fine-grained, efficient memory management technique.

3 / 5

Loading learning content...

Operating SystemsSwapping

Swapping

LevelIntermediate

Duration60 mins

TopicSwapping

3 / 5

Standard Swapping

The Original Swapping: Moving Entire Processes

What You Will Learn

Historical Context: The Origins of Swapping

To appreciate standard swapping, we must travel back to the computing era of the 1960s and 1970s, when the constraints were radically different from today.

The Computing Landscape of Early Swapping

•Memory was extraordinarily expensive — In 1970, a megabyte of RAM cost approximately $700,000 (inflation-adjusted to 2020 dollars). Mainframes might have only 1-4 MB total.
•Multiple users shared one computer — Time-sharing systems allowed dozens of interactive users to share a single mainframe. Each user's process needed memory.
•Interactive response was expected — Users at terminals expected sub-second response times, even though the computer was shared.
•Context switches were frequent — With many users, processes ran for short time slices before yielding to others.
•Disk was the only persistent storage — No SSDs, no flash memory—only spinning magnetic disks with millisecond access times.

The time-sharing challenge:

This approach worked because:

User think time is long compared to swap time
Only the active user needs memory immediately
Swapping one process out/in takes seconds, but users pause for many seconds

The math of early swapping:

Process size: ~200KB
Disk bandwidth: ~500KB/s
Swap out time: ~400ms
Swap in time: ~400ms
Total swap: ~800ms
User think time: typically 5-30 seconds

Since user think time vastly exceeded swap time, the system remained responsive despite frequent swapping.

Early Unix and Swapping

Mechanics of Standard Swapping

Standard Swap Out Steps

•Select victim process — The swapper (or medium-term scheduler) chooses a process to swap out based on criteria like time idle, priority, and memory size.
•Stop the process — The process must not execute during swapping. It is moved to a 'suspended' or 'swapped out' state.
•Save process state — CPU registers, program counter, and other context are saved to the process control block (PCB).
•Write memory to swap space — The entire allocated memory (code, data, heap, stack) is written to swap area as a contiguous block.
•Free all physical frames — The frames occupied by the process are returned to the free list, available for other processes.
•Update process state — The PCB is marked to indicate the process is swapped out and records the swap area location.

Standard Swap In Steps

•Select process to swap in — The swapper identifies a swapped-out process that should run (e.g., received input, timer expired).
•Allocate memory frames — Sufficient contiguous physical frames must be allocated to hold the process image. If unavailable, another process may be swapped out first.
•Read memory from swap — The entire process image is read from swap space into the allocated frames.
•Restore address mapping — Page tables or base/limit registers are updated to point to the new frame locations.
•Restore process state — CPU registers and program counter are loaded from the PCB.
•Make process runnable — The process is moved to the ready queue and can be scheduled for execution.

Converting Mermaid diagram...

The contiguity requirement:

External fragmentation — As processes swap in and out, gaps form between allocated regions. Eventually, total free memory might be sufficient, but no single contiguous block is large enough.
Compaction — To solve fragmentation, the OS might compact memory, sliding all processes to one end. This was expensive (copying gigabytes of data), but necessary.
Fixed partitions — Some systems used fixed memory partitions. Each process was assigned to a partition of appropriate size, simplifying allocation but wasting memory if processes didn't fill their partitions.

These limitations were primary motivators for the development of paging, which we'll compare in a later section.

The Swapper and Medium-Term Scheduling

Three Levels of Scheduling
Scheduler	Time Scale	Decision	Frequency
Long-term (Job Scheduler)	Minutes to hours	Which jobs are admitted to the system?	Rarely (batch systems)
Medium-term (Swapper)	Seconds to minutes	Which processes are in memory vs. swapped?	Periodically
Short-term (CPU Scheduler)	Milliseconds	Which ready process runs next?	Constantly

The swapper's role:

The swapper daemon (often implemented as a kernel thread or privileged process) continuously monitors system memory state and makes swap decisions based on several factors:

Memory availability — When free memory drops below a threshold, the swapper identifies processes to swap out
Process state — Processes that have been waiting (blocked) for a long time are prime candidates for swap-out. Sleeping processes don't need their memory in RAM.
Priority — Lower-priority processes are preferred victims for swap-out. High-priority or real-time processes may be protected.
Residency time — Processes that have been in memory for a long time without running may be swapped out. Conversely, recently swapped-in processes get grace time before being swapped out again.
Memory size — Swapping a large process frees more memory but takes longer. The swapper balances these factors.

Swap-in decisions:

The swapper also decides when to bring swapped-out processes back:

Event occurrence — When a blocked process's event occurs (I/O completes, semaphore signaled), it should be swapped in to resume.
Time quantum — Some systems periodically swap in waiting processes to ensure fairness.
Memory availability — If memory becomes plentiful, swapped processes may be brought in proactively.

swapper_algorithm.pseudo
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
// Conceptual swapper algorithm (medium-term scheduler)
 
procedure SWAPPER:
    loop forever:
        sleep(SWAPPER_INTERVAL)  // e.g., 1 second
        
        // SWAP-OUT PHASE: Free memory if needed
        while free_memory < LOW_THRESHOLD:
            victim = select_swap_out_candidate()
            if victim == NULL:
                break  // No suitable candidates
            
            swap_out(victim)
            add_to_free_list(victim.frames)
            victim.state = SWAPPED_OUT
        
        // SWAP-IN PHASE: Bring back waiting processes
        for each process P in swapped_out_queue:
            if should_swap_in(P):
                if free_memory >= P.memory_size:
                    swap_in(P)
                    P.state = READY
                    add_to_ready_queue(P)
                else:
                    break  // Not enough memory
 
function select_swap_out_candidate():
    // Selection criteria:
    // 1. Prefer blocked processes over ready/running
    // 2. Prefer lower priority
    // 3. Prefer longer time since last active
    // 4. Prefer smaller processes (faster swap)
    
    best_candidate = NULL
    best_score = -INFINITY
    
    for each process P in memory:
        if P.is_locked or P.is_kernel:
            continue
        
        score = compute_swap_score(P)
        if score > best_score:
            best_score = score
            best_candidate = P
    
    return best_candidate
 
function should_swap_in(P):
    // Swap in if:
    // 1. P is now runnable (event occurred)
    // 2. P has been swapped out too long (fairness)
    // 3. P has high priority and we have memory
    
    return P.is_runnable and 
           (time_since_swap_out(P) > MAX_SWAP_TIME or
            P.priority > PRIORITY_THRESHOLD)

Thrashing Risk

Swap Space Allocation for Processes

Process-Level Swap Allocation Strategies

•Pre-allocation at process creation — When a process starts, swap space equal to its maximum possible size is reserved. This guarantees swap-out will never fail due to space exhaustion, but wastes swap space for processes that never grow to maximum size.
•Allocation at swap-out time — Space is allocated only when needed. More space-efficient, but swap-out may fail if swap is full—forcing the OOM killer.
•Grow on demand — Initial swap reservation is small; additional space is allocated as the process grows. Balances efficiency with safety.
•Swap maps and bitmaps — The kernel maintains data structures tracking which regions of swap are in use and by which processes.

Contiguous vs. scattered swap allocation:

Later systems allowed scattered swap allocation:

Process image is broken into chunks
Each chunk is written to available swap space
A map records where each chunk resides
Swap-in reads chunks from multiple locations and reassembles

This reduced fragmentation but increased management complexity and sometimes I/O overhead.

Modern relevance:

Although page-based swapping has largely replaced process-level swap allocation, the concept survives in:

Linux OOM handling — Under extreme pressure, Linux's OOM killer terminates processes entirely, effectively "swapping out" their memory (though not to disk in recoverable form)
Container memory limits — Container runtimes may "freeze" a container (saving state to disk) when memory limits are exceeded
Hibernation — The entire system state (all processes) is written to swap, resembling mass standard swapping

Swap Allocation Trade-offs
Strategy	Advantages	Disadvantages
Pre-allocation	Swap-out never fails; simple accounting	Wastes swap space; limits process count
On-demand allocation	Efficient space usage	Swap-out may fail; OOM risk
Contiguous regions	Fast sequential I/O	Swap fragmentation; allocation challenges
Scattered allocation	No fragmentation issues	Complex management; potential seek overhead

Limitations of Standard Swapping

While standard swapping solved the fundamental problem of limited memory, it had significant limitations that became increasingly problematic as systems evolved.

Critical Limitations

•All-or-nothing swap — A process must be entirely in memory to run or entirely on disk. No partial residence. A 100MB process needs 100MB of free RAM, even if it only actively uses 1MB.
•Long swap times for large processes — Modern processes can be gigabytes in size. Swapping a 4GB process at 500MB/s takes 8 seconds—an eternity for interactive use.
•Wasted I/O — Most processes have cold regions rarely accessed. Standard swapping writes everything, including data that won't be needed for hours.
•Memory fragmentation — Contiguous allocation requirements cause external fragmentation. Compaction wastes CPU and I/O.
•No sharing optimization — If two processes share a library, each keeps its own copy. Standard swapping doesn't exploit sharing.
•Poor fairness — Small processes are cheaper to swap than large ones, biasing victim selection toward large processes that may actually be more important.

The scaling problem:

Consider a modern workstation with:

16GB RAM
50 running processes averaging 1GB each
Total process memory: 50GB

With standard swapping:

Swapping one process out takes ~2 seconds (at 500MB/s)
Only 16 processes can be memory-resident simultaneously
Switching between two large swapped processes takes ~4 seconds round-trip
User experiences multi-second freezes on every application switch

The working set insight:

A 4GB process might have a 100MB working set. Five such processes need only 500MB of RAM (for their working sets), not 20GB (for complete images). The remaining pages can stay on disk until needed.

The Locality Principle

Standard Swapping in Modern Systems

Despite its limitations, elements of standard swapping persist in modern operating systems, typically as emergency mechanisms when page-level swapping proves insufficient.

Modern Appearances of Process-Level Swapping

•Linux's complete fair scheduler integration — Under extreme memory pressure, Linux may suspend entire process groups (cgroups), writing their state to disk. This is rare but allows recovery from severe overcommitment.
•Windows Memory Priority — Windows assigns memory priority to processes. Very low priority processes may have all their pages evicted aggressively, approximating swap-out.
•macOS App Nap — Background applications may be entirely suspended and their memory reclaimed, similar to standard swapping.
•Mobile OS process killing — iOS and Android kill background apps entirely under memory pressure, similar to permanent swap-out.
•Container checkpoint/restore — CRIU (Checkpoint/Restore In Userspace) on Linux can write a container's complete state to files, essentially swapping it out for later restoration.
•Hibernation (suspend-to-disk) — The entire system, all processes, is written to swap space—mass standard swapping.

Linux swappiness and per-process behavior:

For individual processes, Linux supports:

# View a process's memory status
cat /proc/<pid>/status | grep -E "VmSwap|VmRSS"
# VmRSS:     12345 kB   (resident in RAM)
# VmSwap:     6789 kB   (in swap)

# Lock process memory (prevent swapping)
mlock() system call or mlockall(MCL_CURRENT | MCL_FUTURE)

When complete swap-out makes sense:

In certain scenarios, swapping entire processes remains logical:

Hibernation — No choice; all memory must go to disk
Long-idle background apps — A browser tab not viewed for days might as well be entirely swapped
Development environments — Pausing a VM or heavy IDE to run a memory-intensive build
Emergency pressure release — When page-level eviction isn't freeing memory fast enough

Process Freezing vs. Killing

Standard Swapping vs. Demand Paging

The transition from standard swapping to demand paging was one of the most significant advances in operating system design. Let's compare these approaches systematically.

Standard Swapping vs. Demand Paging
Aspect	Standard Swapping	Demand Paging
Unit of swap	Entire process	Individual page (e.g., 4KB)
Memory residency	All-or-nothing	Partial (working set in RAM, rest on disk)
Swap granularity	Process size (MB-GB)	Page size (KB)
I/O efficiency	Large sequential transfers	Many small transfers (with read-ahead)
Fragmentation	External (in memory and swap)	Internal only (last page of allocation)
Sharing support	Limited or none	Shared pages map to same frames
Start-up cost	Full swap-in before execution	Pages loaded on first access
Locality exploitation	None	Working set pages stay in RAM
Memory utilization	Poor (cold pages in RAM)	Excellent (only hot pages in RAM)
Implementation complexity	Relatively simple	Complex (page tables, TLB, fault handlers)

Why demand paging won:

The advantages of demand paging are overwhelming for general-purpose systems:

Better multiprogramming — With demand paging, 50 processes with 1GB address spaces but 50MB working sets need only ~2.5GB RAM, not 50GB. Memory utilization improves dramatically.
Faster startup — A process can begin executing immediately; code pages load on first access. No waiting for full swap-in.
Efficient sharing — Libraries like libc are loaded once and shared across all processes via page table mappings. Standard swapping would replicate them.
Graceful degradation — As memory pressure increases, only cold pages are evicted. The system slows but doesn't stop. With standard swapping, processes alternate between full-speed and completely stalled.
Fine-grained policies — Page replacement algorithms can make nuanced decisions: which specific pages to evict, based on access patterns. Standard swapping has only coarse process-level choices.

Trade-offs:

Demand paging isn't universally superior:

More overhead — Page tables, TLB management, and fault handling have costs absent in simpler swapping schemes
Thrashing risk — If working sets don't fit in RAM, page-level thrashing can be worse than process-level swapping (constant small faults vs. occasional complete swaps)
Unpredictable latency — Any memory access might trigger a page fault, making execution time unpredictable

For soft real-time or latency-sensitive workloads, these trade-offs may favor memory locking over either swapping approach.

Summary: Standard Swapping

Key Takeaways

•Standard swapping moves entire processes — All memory is written to disk; none remains in RAM. The process cannot run while swapped out.
•The swapper (medium-term scheduler) makes swap decisions — It balances memory availability, process priority, idle time, and fairness.
•Historical context explains the design — When RAM cost $700,000/MB, swapping was essential for multiuser systems.
•Contiguous allocation requirements caused fragmentation — Both memory and swap space suffered, requiring compaction.
•Limitations drove the evolution to paging — All-or-nothing residence, long swap times, and wasted I/O made standard swapping unsuitable for modern workloads.
•Elements persist in modern systems — Hibernation, process freezing, and container checkpointing embody process-level swapping concepts.

What's next:

Page Complete

3 / 5