Operating SystemsFile System Performance

Defragmentation

LevelIntermediate

Duration75 mins

TopicFile System Performance

2 / 5

Defragmentation Process

The Art of Reorganizing Storage

Defragmentation is the systematic process of reorganizing file data to restore contiguous storage layouts and consolidate free space. While the concept seems simple—move file pieces together—the implementation involves sophisticated algorithms, careful data safety mechanisms, and complex tradeoffs between thoroughness and system impact.

Consider the complexity: a file system may contain millions of files, billions of blocks, and intricate dependencies. A defragmenter must reorganize this data while the system remains operational, files remain accessible, and no data is ever lost—even if power fails mid-operation. This is one of the most demanding data management operations an operating system performs.

What You Will Learn

This page provides comprehensive coverage of defragmentation mechanics. You'll understand the algorithms that select and move data, safety mechanisms that protect against data loss, optimization strategies that maximize performance impact, and the complete lifecycle of a defragmentation operation from analysis to completion.

Defragmentation Fundamentals

Defragmentation (or defragging) is the process of reducing fragmentation by reorganizing file data into contiguous regions. The core operation involves:

Identifying fragmented files: Scanning the file system to find files with multiple non-contiguous extents
Planning data movement: Determining which blocks to move where to achieve better contiguity
Executing block relocations: Physically copying data blocks and updating metadata
Consolidating free space: Moving files to create large contiguous free regions

The Basic Algorithm:

At its simplest, defragmentation follows this pattern:

for each fragmented file:
    1. Find contiguous free space large enough for entire file
    2. Copy all file blocks to contiguous destination
    3. Update file metadata to point to new locations
    4. Mark old locations as free
    5. Repeat until all files defragmented

However, this naive approach fails when:

No contiguous region is large enough for any fragmented file
Moving files frees space that could enable better placements
Some files are locked or in use
Optimal arrangement requires complex multi-file coordination

It's a Puzzle, Not a Procedure

Optimal defragmentation is essentially a bin-packing problem—NP-hard in the general case. Real defragmenters use heuristics to find 'good enough' solutions in reasonable time, not globally optimal arrangements.

Goals of Defragmentation:

Defragmentation has multiple, sometimes competing objectives:

File Contiguity: Make individual files contiguous for faster sequential access
Free Space Consolidation: Create large contiguous free regions to prevent future fragmentation
Optimal File Placement: Place frequently-accessed files in optimal disk locations (outer tracks for HDDs)
Minimal System Impact: Complete quickly with minimal I/O and CPU overhead
Data Safety: Never lose data, even during power failures or crashes
Accessibility: Keep files accessible during defragmentation when possible

Defragmentation Algorithms

Various algorithms approach defragmentation with different priorities and tradeoffs. Understanding these helps select appropriate tools and strategies.

Algorithm 1: Simple Sequential Compaction

The most straightforward approach—move all files to the beginning of the disk, leaving all free space at the end.

Disk before:  [A][-][B][B][-][-][C][-][D][D][-]
              
Process:
1. A is already at start → skip
2. B has gap before it → move B after A
3. C has gaps before it → move C after B  
4. D has gaps before it → move D after C

Disk after:   [A][B][B][C][D][D][-][-][-][-][-]
                                ^all free space^

Advantages: Simple, achieves perfect free space consolidation Disadvantages: Moves even non-fragmented files, high data movement, slow

Defragmentation Algorithm Comparison
Algorithm	Data Movement	Free Space Result	Speed	Best For
Sequential Compaction	Very High	Optimal	Slow	Deep cleanup, offline use
Fragment-Only	Low-Medium	Moderate	Fast	Quick optimization
Free Space Priority	Medium	Good	Medium	Preventing future fragmentation
File Priority	Medium-High	Varies	Medium	Performance-critical files
Incremental	Low per pass	Gradual	Fastest	Background operation

Algorithm 2: Fragment-Only Defragmentation

Only moves files that are actually fragmented, leaving contiguous files in place:

1. Scan file system for fragmented files
2. For each fragmented file:
   a. Find smallest contiguous region that fits file
   b. Move all fragments to that region
   c. Update metadata
3. Skip files that are already contiguous

This minimizes data movement while addressing the primary performance issue.

Algorithm 3: Free Space Consolidation Priority

Focuses on creating large contiguous free regions:

1. Identify 'holes' in the free space map
2. Move files adjacent to holes to consolidate
3. Prioritize moves that combine multiple holes
4. Continue until target free space contiguity achieved

This prevents future fragmentation more effectively than file-focused approaches.

Algorithm 4: Intelligent Placement

Considers file access patterns for optimal placement:

1. Analyze file access frequency and patterns
2. Place frequently-read files on faster disk regions
   - Outer tracks on HDDs (higher linear velocity)
   - Beginning of SSDs (often faster NAND)
3. Group related files together (same directory, same app)
4. Place rarely-accessed files in slower regions

Modern Defragmenters Combine Strategies

Real-world defragmenters typically combine multiple algorithms. They may do quick fragment-only passes for routine maintenance, intelligent placement for boot files, and full compaction during scheduled deep cleanups.

Data Movement Mechanics

The core of defragmentation is moving data blocks safely and efficiently. This involves careful coordination between reading, writing, and metadata updates.

The Block Move Operation:

Moving a single block involves multiple steps:

1. Read source block into memory buffer
2. Write buffer contents to destination block
3. Verify write completed successfully (compare or checksum)
4. Update file system metadata to point to new location
5. Sync metadata to disk
6. Mark source block as free

Each step must complete before the next begins to maintain consistency.

Batching for Efficiency:

Moving blocks one at a time is inefficient due to I/O overhead. Practical implementations batch operations:

// Instead of moving 1 block at a time:
for each block:
    read() → write() → sync()

// Batch reads, batch writes:
buffer = read_multiple_blocks(source_blocks, count=64)
write_multiple_blocks(dest_blocks, buffer, count=64)
barrier() // ensure writes complete
update_all_metadata()
sync()
mark_sources_free()

Larger batches improve throughput but require more memory and increase crash vulnerability windows.

defrag_move.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
/**
 * Safe block move operation with crash recovery support
 */
typedef struct {
    uint64_t file_id;
    uint64_t logical_offset;
    uint64_t old_physical;
    uint64_t new_physical;
    uint8_t  state;  // PENDING, COPIED, COMMITTED, DONE
} move_record_t;
 
int safe_move_blocks(move_record_t *moves, int count, 
                     void *buffer, size_t buf_size) {
    // Phase 1: Write move intent to journal
    for (int i = 0; i < count; i++) {
        moves[i].state = PENDING;
        write_journal_record(&moves[i]);
    }
    sync_journal();
    
    // Phase 2: Copy data to new locations
    for (int i = 0; i < count; i++) {
        if (read_block(moves[i].old_physical, buffer) < 0)
            return handle_read_error(i);
            
        if (write_block(moves[i].new_physical, buffer) < 0)
            return handle_write_error(i);
            
        moves[i].state = COPIED;
        update_journal_record(&moves[i]);
    }
    sync_device();  // Ensure all writes are persistent
    
    // Phase 3: Commit metadata changes
    begin_metadata_transaction();
    for (int i = 0; i < count; i++) {
        update_block_mapping(moves[i].file_id,
                            moves[i].logical_offset,
                            moves[i].new_physical);
        moves[i].state = COMMITTED;
    }
    commit_metadata_transaction();
    
    // Phase 4: Free old blocks and complete
    for (int i = 0; i < count; i++) {
        mark_block_free(moves[i].old_physical);
        moves[i].state = DONE;
        remove_journal_record(&moves[i]);
    }
    
    return count;  // Successfully moved count blocks
}
 
/**
 * Recovery function called on mount after crash
 */
void recover_defrag_state(void) {
    move_record_t record;
    
    while (read_next_journal_record(&record)) {
        switch (record.state) {
            case PENDING:
                // Move never started - just remove record
                remove_journal_record(&record);
                break;
                
            case COPIED:
                // Data copied but metadata not updated
                // Could go either way - complete the move
                update_block_mapping(record.file_id,
                                    record.logical_offset,
                                    record.new_physical);
                mark_block_free(record.old_physical);
                remove_journal_record(&record);
                break;
                
            case COMMITTED:
                // Metadata updated, just need to free old block
                mark_block_free(record.old_physical);
                remove_journal_record(&record);
                break;
                
            case DONE:
                // Should not exist - remove stale record
                remove_journal_record(&record);
                break;
        }
    }
}

Memory Buffer Management:

Defragmenters must balance memory usage against operation efficiency:

Small buffers: Less memory, more I/O operations, slower
Large buffers: More memory, fewer I/O operations, faster
Adaptive buffers: Size based on available memory and file sizes

Typical implementations use 1-32MB buffers, adjusting based on system memory pressure.

Crash Recovery and Data Safety

Defragmentation is inherently dangerous—it moves data while maintaining system consistency. A crash at the wrong moment could corrupt files or lose data. Ensuring safety requires careful protocols.

The Danger Window:

During a block move, there's a period where data exists in two locations (old and new). The critical question: which location is authoritative?

Unsafe timeline:
1. Copy block A from location 100 to location 200
2. [CRASH HERE] - Both copies exist, metadata points to 100
3. Update metadata: file now points to 200
4. [CRASH HERE] - Metadata points to 200, which is correct
5. Mark location 100 as free
6. [CRASH HERE] - Safe, old location freed, new location valid

A crash at step 2 is safe (old data still valid). A crash between steps 3 and 5 is also safe (new data valid). The system must never reach a state where neither location is clearly authoritative.

The Cardinal Rule of Defragmentation

Never free the source block until the destination is confirmed written AND metadata is updated to point to the destination. This ensures that at every moment, either the old or new location contains valid data with valid metadata pointers.

Journaling for Defragmentation:

Most defragmenters use a journal (write-ahead log) to ensure crash recovery:

1. Write to journal: "Will move block X from A to B"
2. Perform the copy operation
3. Write to journal: "Block copied, updating metadata"
4. Update file system metadata
5. Write to journal: "Move complete"
6. Free old location
7. Remove journal entry

On restart after crash, the system reads the journal:

If entry says "will move" but no copy: cancel the operation
If entry says "block copied": complete metadata update and finish
If entry says "move complete": free old location

Copy-on-Write Simplification:

Modern copy-on-write file systems (btrfs, ZFS) simplify defragmentation safety:

Write file data to new contiguous location
Atomically update the metadata pointer
Old data blocks automatically become free (reference count drops to 0)

No explicit journal needed—COW semantics guarantee consistency.

Safety Guarantees

•Atomic metadata updates: All pointer changes happen as single atomic operation
•Write barriers: Hardware write ordering ensures writes complete in expected sequence
•Journal synchronization: Journal writes are synced before dependent operations
•Verification reads: Some systems re-read written data to verify correctness
•Checksums: Data integrity verified through checksums during moves

File Selection and Prioritization

Not all files benefit equally from defragmentation. Intelligent selection maximizes impact while minimizing unnecessary work.

Factors Affecting Priority:

1. Fragment Count: Highly fragmented files benefit most. A file with 100 fragments sees dramatic improvement; a file with 2 fragments sees minimal change.

2. File Size: Large files have more to gain from contiguity—reading a 1GB contiguous file is massively faster than 1GB scattered across 10,000 fragments.

3. Access Patterns: Sequentially-read files (videos, images, archives) benefit greatly. Randomly-accessed files (databases) benefit less since they don't read sequentially anyway.

4. Access Frequency: Frequently-accessed files provide more total value when defragmented. A file read once per year isn't worth prioritizing.

5. File Importance: Boot files, system libraries, and application executables should be prioritized—they affect perceived system responsiveness.

File Priority Matrix for Defragmentation
File Type	Typical Fragmentation	Access Pattern	Priority
Boot files (ntldr, kernel)	Medium	Sequential, frequent	Highest
System libraries (.dll, .so)	Low-Medium	Random, very frequent	High
Executables	Low	Sequential initial, then cached	High
Documents	High (frequent edits)	Sequential, moderate	Medium
Database files	Medium	Random access	Low
Log files	Very High (append-only)	Rare read	Low
Archive downloads	Minimal (write-once)	Rare read	Lowest
Temporary files	High	Temporary existence	Skip entirely

Exclusion Lists:

Certain files should be excluded from defragmentation:

Page files / Swap partitions: Often locked by OS, minimal benefit
Hibernation files: Large, OS-managed, minimal benefit
Database files with own management: Databases like SQL Server manage internal layout
Memory-mapped files in active use: May be locked
Sparse files: Logical layout doesn't match physical anyway
Compressed/encrypted volumes: Block-level layout differs from file layout

Smart Scheduling:

Modern defragmenters schedule based on value:

priority_score = (fragment_count × file_size × access_frequency) / move_cost

Process files in priority_score descending order
Stop when:
  - Time budget exhausted, OR
  - Remaining files below minimum priority threshold, OR
  - Free space insufficient for further moves

The 80/20 Rule Applies

Typically, 20% of files cause 80% of fragmentation-related slowdowns. Defragmenting just the high-priority files delivers most of the performance benefit in a fraction of the time.

Placement Optimization

Beyond contiguity, where files are placed affects performance. Intelligent placement optimizes for access patterns and storage characteristics.

HDD Placement Strategy:

Hard drives have measurable performance differences across the platter:

Outer tracks: Higher linear velocity → faster transfer rates (up to 50% faster than inner)
Inner tracks: Slower transfer, used for less-critical data
Track proximity: Files accessed together should be nearby to minimize seek time

Optimal HDD placement:
[BOOT][OS][APPS][FREQUENT DATA]...[RARELY ACCESSED][PAGE FILE][FREE]
^outer edge                                              inner edge^
^fastest                                                    slowest^

SSD Placement Strategy:

SSDs don't have seek time, but placement still matters:

NAND characteristics: Some pages are faster than others
Parallelism: Data spread across channels improves sequential throughput
Wear leveling: Avoid concentrating hot data in one area
Garbage collection: Consolidated free space reduces GC overhead

Placement Optimization Strategies

•Boot-time optimization: Place files needed at boot in sequential order at disk start for fastest startup
•Application grouping: Keep an application's executable and libraries near each other
•Directory locality: Files in the same directory often accessed together—place them adjacent
•Hot/cold separation: Frequently-modified files separate from rarely-modified to reduce churn in stable areas
•Free space preservation: Leave some free space near frequently-modified files to accommodate growth without fragmentation

Windows Boot-Time Defragmentation:

Windows performs specialized boot-time optimization:

Traces file accesses during boot sequence
Records optimal file layout for boot order
During defragmentation, places boot files according to trace
Result: measurably faster subsequent boots

# Windows boot optimization command
Defrag C: /B  # Boot optimization
Defrag C: /O  # Optimal defrag including boot files

Linux e4defrag Placement:

Linux's ext4 defragmenter focuses on extent contiguity:

# Defragment with optimal placement
e4defrag -c /path/to/check   # Check fragmentation
e4defrag /path/to/defrag      # Defragment
e4defrag -v /path             # Verbose output

The ext4 allocator itself uses sophisticated placement heuristics (Orlov allocator) to minimize fragmentation from the start.

Performance Considerations During Defragmentation

Defragmentation itself consumes system resources. Managing this impact is essential for practical use.

Resource Consumption:

1. I/O Bandwidth: Defragmentation is inherently I/O intensive—reading and rewriting potentially the entire disk. This competes with normal system I/O.

2. CPU Utilization: While not CPU-bound, defragmentation requires CPU for:

Analyzing file system structures
Computing optimal placements
Managing I/O operations
Updating metadata

3. Memory Usage: Buffers for data movement, data structures for tracking operations, and file system metadata caching consume RAM.

4. Wear (SSD Concern): Every block move writes data. On SSDs, this consumes write endurance.

Defragmentation Resource Profiles
Mode	I/O Impact	Duration	System Usability
Aggressive/Full	Very High (near 100%)	Hours	System nearly unusable
Normal	High (50-80%)	Hours	Noticeably slower
Background	Low (5-20%)	Days to weeks	Minimal impact
Idle-only	High when active	Variable	No impact during use
Scheduled night	High during window	Window duration	Zero impact during day

Throttling Mechanisms:

Modern defragmenters incorporate throttling to limit system impact:

// I/O throttling algorithm
while (more_work_to_do) {
    batch = select_next_batch()
    
    // Check system state
    if (user_io_pending() || system_load_high()) {
        sleep(BACKOFF_INTERVAL)
        continue
    }
    
    // Perform work with rate limiting
    perform_move(batch)
    
    // Self-imposed delay between operations  
    if (throttle_mode == BACKGROUND) {
        sleep(IO_INTERVAL * throttle_factor)
    }
}

Windows Automatic Maintenance:

Windows uses sophisticated scheduling:

Runs during Automatic Maintenance window (typically 2-3 AM)
Pauses immediately if user activity detected
Resumes during next maintenance window
Adjusts aggressiveness based on power source (more aggressive on AC)
Tracks completion across multiple sessions

The Defrag Paradox

Defragmentation temporarily makes the system slower to eventually make it faster. The key is scheduling: aggressive defragmentation during active use hurts more than fragmentation itself. Background and scheduled approaches provide benefit without pain.

The Complete Defragmentation Lifecycle

A complete defragmentation operation follows a well-defined lifecycle from initiation to completion.

Phase 1: Analysis

1. Scan entire file system structure
2. Build block allocation map
3. Identify fragmented files (extent count per file)
4. Calculate free space fragmentation
5. Estimate work required
6. Generate analysis report

This phase is read-only and typically fast (minutes for typical systems).

Phase 2: Planning

1. Prioritize files based on fragmentation severity and importance
2. Identify destination regions for each file
3. Detect interdependencies (moving X enables moving Y)
4. Create move schedule respecting dependencies
5. Estimate duration and I/O requirements

Phase 3: Execution

1. Initialize journal for crash recovery
2. For each planned move:
   a. Acquire file lock if possible
   b. Execute move operation with journaling
   c. Release lock
   d. Update progress
   e. Check for abort request
   f. Apply throttling delays
3. Handle locked/unmovable files (skip or queue for later)

Converting Mermaid diagram...

Phase 4: Completion

1. Clear journal entries
2. Verify file system integrity (optional but recommended)
3. Generate completion report with:
   - Files defragmented
   - Data moved (bytes)
   - Time elapsed
   - Before/after fragmentation metrics
4. Schedule next run if automatic

Handling Interruptions:

Well-designed defragmenters handle interruption gracefully:

User abort: Complete current move, update progress, exit cleanly
Power loss: Journal enables recovery without data loss
System shutdown: Same as power loss, with clean unmount
Out of space: Should never occur (moves don't net-consume space), but handled by aborting
Media errors: Mark affected areas, skip, report for user attention

Summary: The Defragmentation Process

We've explored the complete mechanics of defragmentation—from algorithm selection to crash recovery to optimal file placement. This knowledge enables informed decisions about when and how to defragment.

Key Takeaways

•Defragmentation reorganizes file data and consolidates free space through sophisticated algorithms balancing thoroughness and efficiency
•Multiple algorithms exist for different priorities: sequential compaction for thoroughness, fragment-only for speed, free-space-focused for prevention
•Data safety relies on journaling and atomic metadata updates to ensure no data loss even during power failures
•File prioritization maximizes impact: boot files, frequently-accessed files, and heavily-fragmented files provide most benefit
•Placement optimization considers storage characteristics: HDD outer-track placement, SSD wear distribution, and access-pattern grouping
•Resource management through throttling enables background operation without impacting system usability

Looking Ahead:

The next page examines the critical distinction between online and offline defragmentation—understanding when to defragment live file systems versus unmounted volumes, and the tradeoffs each approach entails.

Process Mastery Complete

You now understand the complete defragmentation process—from identifying fragmented files through safe data movement to optimal placement. This knowledge enables you to select appropriate tools, configure them effectively, and understand the tradeoffs involved in maintaining storage performance.

2 / 5

Loading learning content...

Operating SystemsFile System Performance

Defragmentation

LevelIntermediate

Duration75 mins

TopicFile System Performance

2 / 5

Defragmentation Process

The Art of Reorganizing Storage

What You Will Learn

Defragmentation Fundamentals

Defragmentation (or defragging) is the process of reducing fragmentation by reorganizing file data into contiguous regions. The core operation involves:

Identifying fragmented files: Scanning the file system to find files with multiple non-contiguous extents
Planning data movement: Determining which blocks to move where to achieve better contiguity
Executing block relocations: Physically copying data blocks and updating metadata
Consolidating free space: Moving files to create large contiguous free regions

The Basic Algorithm:

At its simplest, defragmentation follows this pattern:

for each fragmented file:
    1. Find contiguous free space large enough for entire file
    2. Copy all file blocks to contiguous destination
    3. Update file metadata to point to new locations
    4. Mark old locations as free
    5. Repeat until all files defragmented

However, this naive approach fails when:

No contiguous region is large enough for any fragmented file
Moving files frees space that could enable better placements
Some files are locked or in use
Optimal arrangement requires complex multi-file coordination

It's a Puzzle, Not a Procedure

Goals of Defragmentation:

Defragmentation has multiple, sometimes competing objectives:

File Contiguity: Make individual files contiguous for faster sequential access
Free Space Consolidation: Create large contiguous free regions to prevent future fragmentation
Optimal File Placement: Place frequently-accessed files in optimal disk locations (outer tracks for HDDs)
Minimal System Impact: Complete quickly with minimal I/O and CPU overhead
Data Safety: Never lose data, even during power failures or crashes
Accessibility: Keep files accessible during defragmentation when possible

Defragmentation Algorithms

Various algorithms approach defragmentation with different priorities and tradeoffs. Understanding these helps select appropriate tools and strategies.

Algorithm 1: Simple Sequential Compaction

The most straightforward approach—move all files to the beginning of the disk, leaving all free space at the end.

Disk before:  [A][-][B][B][-][-][C][-][D][D][-]
              
Process:
1. A is already at start → skip
2. B has gap before it → move B after A
3. C has gaps before it → move C after B  
4. D has gaps before it → move D after C

Disk after:   [A][B][B][C][D][D][-][-][-][-][-]
                                ^all free space^

Advantages: Simple, achieves perfect free space consolidation Disadvantages: Moves even non-fragmented files, high data movement, slow

Defragmentation Algorithm Comparison
Algorithm	Data Movement	Free Space Result	Speed	Best For
Sequential Compaction	Very High	Optimal	Slow	Deep cleanup, offline use
Fragment-Only	Low-Medium	Moderate	Fast	Quick optimization
Free Space Priority	Medium	Good	Medium	Preventing future fragmentation
File Priority	Medium-High	Varies	Medium	Performance-critical files
Incremental	Low per pass	Gradual	Fastest	Background operation

Algorithm 2: Fragment-Only Defragmentation

Only moves files that are actually fragmented, leaving contiguous files in place:

1. Scan file system for fragmented files
2. For each fragmented file:
   a. Find smallest contiguous region that fits file
   b. Move all fragments to that region
   c. Update metadata
3. Skip files that are already contiguous

This minimizes data movement while addressing the primary performance issue.

Algorithm 3: Free Space Consolidation Priority

Focuses on creating large contiguous free regions:

1. Identify 'holes' in the free space map
2. Move files adjacent to holes to consolidate
3. Prioritize moves that combine multiple holes
4. Continue until target free space contiguity achieved

This prevents future fragmentation more effectively than file-focused approaches.

Algorithm 4: Intelligent Placement

Considers file access patterns for optimal placement:

1. Analyze file access frequency and patterns
2. Place frequently-read files on faster disk regions
   - Outer tracks on HDDs (higher linear velocity)
   - Beginning of SSDs (often faster NAND)
3. Group related files together (same directory, same app)
4. Place rarely-accessed files in slower regions

Modern Defragmenters Combine Strategies

Data Movement Mechanics

The core of defragmentation is moving data blocks safely and efficiently. This involves careful coordination between reading, writing, and metadata updates.

The Block Move Operation:

Moving a single block involves multiple steps:

1. Read source block into memory buffer
2. Write buffer contents to destination block
3. Verify write completed successfully (compare or checksum)
4. Update file system metadata to point to new location
5. Sync metadata to disk
6. Mark source block as free

Each step must complete before the next begins to maintain consistency.

Batching for Efficiency:

Moving blocks one at a time is inefficient due to I/O overhead. Practical implementations batch operations:

// Instead of moving 1 block at a time:
for each block:
    read() → write() → sync()

// Batch reads, batch writes:
buffer = read_multiple_blocks(source_blocks, count=64)
write_multiple_blocks(dest_blocks, buffer, count=64)
barrier() // ensure writes complete
update_all_metadata()
sync()
mark_sources_free()

Larger batches improve throughput but require more memory and increase crash vulnerability windows.

defrag_move.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
/**
 * Safe block move operation with crash recovery support
 */
typedef struct {
    uint64_t file_id;
    uint64_t logical_offset;
    uint64_t old_physical;
    uint64_t new_physical;
    uint8_t  state;  // PENDING, COPIED, COMMITTED, DONE
} move_record_t;
 
int safe_move_blocks(move_record_t *moves, int count, 
                     void *buffer, size_t buf_size) {
    // Phase 1: Write move intent to journal
    for (int i = 0; i < count; i++) {
        moves[i].state = PENDING;
        write_journal_record(&moves[i]);
    }
    sync_journal();
    
    // Phase 2: Copy data to new locations
    for (int i = 0; i < count; i++) {
        if (read_block(moves[i].old_physical, buffer) < 0)
            return handle_read_error(i);
            
        if (write_block(moves[i].new_physical, buffer) < 0)
            return handle_write_error(i);
            
        moves[i].state = COPIED;
        update_journal_record(&moves[i]);
    }
    sync_device();  // Ensure all writes are persistent
    
    // Phase 3: Commit metadata changes
    begin_metadata_transaction();
    for (int i = 0; i < count; i++) {
        update_block_mapping(moves[i].file_id,
                            moves[i].logical_offset,
                            moves[i].new_physical);
        moves[i].state = COMMITTED;
    }
    commit_metadata_transaction();
    
    // Phase 4: Free old blocks and complete
    for (int i = 0; i < count; i++) {
        mark_block_free(moves[i].old_physical);
        moves[i].state = DONE;
        remove_journal_record(&moves[i]);
    }
    
    return count;  // Successfully moved count blocks
}
 
/**
 * Recovery function called on mount after crash
 */
void recover_defrag_state(void) {
    move_record_t record;
    
    while (read_next_journal_record(&record)) {
        switch (record.state) {
            case PENDING:
                // Move never started - just remove record
                remove_journal_record(&record);
                break;
                
            case COPIED:
                // Data copied but metadata not updated
                // Could go either way - complete the move
                update_block_mapping(record.file_id,
                                    record.logical_offset,
                                    record.new_physical);
                mark_block_free(record.old_physical);
                remove_journal_record(&record);
                break;
                
            case COMMITTED:
                // Metadata updated, just need to free old block
                mark_block_free(record.old_physical);
                remove_journal_record(&record);
                break;
                
            case DONE:
                // Should not exist - remove stale record
                remove_journal_record(&record);
                break;
        }
    }
}

Memory Buffer Management:

Defragmenters must balance memory usage against operation efficiency:

Small buffers: Less memory, more I/O operations, slower
Large buffers: More memory, fewer I/O operations, faster
Adaptive buffers: Size based on available memory and file sizes

Typical implementations use 1-32MB buffers, adjusting based on system memory pressure.

Crash Recovery and Data Safety

Defragmentation is inherently dangerous—it moves data while maintaining system consistency. A crash at the wrong moment could corrupt files or lose data. Ensuring safety requires careful protocols.

The Danger Window:

During a block move, there's a period where data exists in two locations (old and new). The critical question: which location is authoritative?

Unsafe timeline:
1. Copy block A from location 100 to location 200
2. [CRASH HERE] - Both copies exist, metadata points to 100
3. Update metadata: file now points to 200
4. [CRASH HERE] - Metadata points to 200, which is correct
5. Mark location 100 as free
6. [CRASH HERE] - Safe, old location freed, new location valid

A crash at step 2 is safe (old data still valid). A crash between steps 3 and 5 is also safe (new data valid). The system must never reach a state where neither location is clearly authoritative.

The Cardinal Rule of Defragmentation

Journaling for Defragmentation:

Most defragmenters use a journal (write-ahead log) to ensure crash recovery:

1. Write to journal: "Will move block X from A to B"
2. Perform the copy operation
3. Write to journal: "Block copied, updating metadata"
4. Update file system metadata
5. Write to journal: "Move complete"
6. Free old location
7. Remove journal entry

On restart after crash, the system reads the journal:

If entry says "will move" but no copy: cancel the operation
If entry says "block copied": complete metadata update and finish
If entry says "move complete": free old location

Copy-on-Write Simplification:

Modern copy-on-write file systems (btrfs, ZFS) simplify defragmentation safety:

Write file data to new contiguous location
Atomically update the metadata pointer
Old data blocks automatically become free (reference count drops to 0)

No explicit journal needed—COW semantics guarantee consistency.

Safety Guarantees

•Atomic metadata updates: All pointer changes happen as single atomic operation
•Write barriers: Hardware write ordering ensures writes complete in expected sequence
•Journal synchronization: Journal writes are synced before dependent operations
•Verification reads: Some systems re-read written data to verify correctness
•Checksums: Data integrity verified through checksums during moves

File Selection and Prioritization

Not all files benefit equally from defragmentation. Intelligent selection maximizes impact while minimizing unnecessary work.

Factors Affecting Priority:

1. Fragment Count: Highly fragmented files benefit most. A file with 100 fragments sees dramatic improvement; a file with 2 fragments sees minimal change.

2. File Size: Large files have more to gain from contiguity—reading a 1GB contiguous file is massively faster than 1GB scattered across 10,000 fragments.

3. Access Patterns: Sequentially-read files (videos, images, archives) benefit greatly. Randomly-accessed files (databases) benefit less since they don't read sequentially anyway.

4. Access Frequency: Frequently-accessed files provide more total value when defragmented. A file read once per year isn't worth prioritizing.

5. File Importance: Boot files, system libraries, and application executables should be prioritized—they affect perceived system responsiveness.

File Priority Matrix for Defragmentation
File Type	Typical Fragmentation	Access Pattern	Priority
Boot files (ntldr, kernel)	Medium	Sequential, frequent	Highest
System libraries (.dll, .so)	Low-Medium	Random, very frequent	High
Executables	Low	Sequential initial, then cached	High
Documents	High (frequent edits)	Sequential, moderate	Medium
Database files	Medium	Random access	Low
Log files	Very High (append-only)	Rare read	Low
Archive downloads	Minimal (write-once)	Rare read	Lowest
Temporary files	High	Temporary existence	Skip entirely

Exclusion Lists:

Certain files should be excluded from defragmentation:

Page files / Swap partitions: Often locked by OS, minimal benefit
Hibernation files: Large, OS-managed, minimal benefit
Database files with own management: Databases like SQL Server manage internal layout
Memory-mapped files in active use: May be locked
Sparse files: Logical layout doesn't match physical anyway
Compressed/encrypted volumes: Block-level layout differs from file layout

Smart Scheduling:

Modern defragmenters schedule based on value:

priority_score = (fragment_count × file_size × access_frequency) / move_cost

Process files in priority_score descending order
Stop when:
  - Time budget exhausted, OR
  - Remaining files below minimum priority threshold, OR
  - Free space insufficient for further moves

The 80/20 Rule Applies

Typically, 20% of files cause 80% of fragmentation-related slowdowns. Defragmenting just the high-priority files delivers most of the performance benefit in a fraction of the time.

Placement Optimization

Beyond contiguity, where files are placed affects performance. Intelligent placement optimizes for access patterns and storage characteristics.

HDD Placement Strategy:

Hard drives have measurable performance differences across the platter:

Outer tracks: Higher linear velocity → faster transfer rates (up to 50% faster than inner)
Inner tracks: Slower transfer, used for less-critical data
Track proximity: Files accessed together should be nearby to minimize seek time

Optimal HDD placement:
[BOOT][OS][APPS][FREQUENT DATA]...[RARELY ACCESSED][PAGE FILE][FREE]
^outer edge                                              inner edge^
^fastest                                                    slowest^

SSD Placement Strategy:

SSDs don't have seek time, but placement still matters:

NAND characteristics: Some pages are faster than others
Parallelism: Data spread across channels improves sequential throughput
Wear leveling: Avoid concentrating hot data in one area
Garbage collection: Consolidated free space reduces GC overhead

Placement Optimization Strategies

•Boot-time optimization: Place files needed at boot in sequential order at disk start for fastest startup
•Application grouping: Keep an application's executable and libraries near each other
•Directory locality: Files in the same directory often accessed together—place them adjacent
•Hot/cold separation: Frequently-modified files separate from rarely-modified to reduce churn in stable areas
•Free space preservation: Leave some free space near frequently-modified files to accommodate growth without fragmentation

Windows Boot-Time Defragmentation:

Windows performs specialized boot-time optimization:

Traces file accesses during boot sequence
Records optimal file layout for boot order
During defragmentation, places boot files according to trace
Result: measurably faster subsequent boots

# Windows boot optimization command
Defrag C: /B  # Boot optimization
Defrag C: /O  # Optimal defrag including boot files

Linux e4defrag Placement:

Linux's ext4 defragmenter focuses on extent contiguity:

# Defragment with optimal placement
e4defrag -c /path/to/check   # Check fragmentation
e4defrag /path/to/defrag      # Defragment
e4defrag -v /path             # Verbose output

The ext4 allocator itself uses sophisticated placement heuristics (Orlov allocator) to minimize fragmentation from the start.

Performance Considerations During Defragmentation

Defragmentation itself consumes system resources. Managing this impact is essential for practical use.

Resource Consumption:

1. I/O Bandwidth: Defragmentation is inherently I/O intensive—reading and rewriting potentially the entire disk. This competes with normal system I/O.

2. CPU Utilization: While not CPU-bound, defragmentation requires CPU for:

Analyzing file system structures
Computing optimal placements
Managing I/O operations
Updating metadata

3. Memory Usage: Buffers for data movement, data structures for tracking operations, and file system metadata caching consume RAM.

4. Wear (SSD Concern): Every block move writes data. On SSDs, this consumes write endurance.

Defragmentation Resource Profiles
Mode	I/O Impact	Duration	System Usability
Aggressive/Full	Very High (near 100%)	Hours	System nearly unusable
Normal	High (50-80%)	Hours	Noticeably slower
Background	Low (5-20%)	Days to weeks	Minimal impact
Idle-only	High when active	Variable	No impact during use
Scheduled night	High during window	Window duration	Zero impact during day

Throttling Mechanisms:

Modern defragmenters incorporate throttling to limit system impact:

// I/O throttling algorithm
while (more_work_to_do) {
    batch = select_next_batch()
    
    // Check system state
    if (user_io_pending() || system_load_high()) {
        sleep(BACKOFF_INTERVAL)
        continue
    }
    
    // Perform work with rate limiting
    perform_move(batch)
    
    // Self-imposed delay between operations  
    if (throttle_mode == BACKGROUND) {
        sleep(IO_INTERVAL * throttle_factor)
    }
}

Windows Automatic Maintenance:

Windows uses sophisticated scheduling:

Runs during Automatic Maintenance window (typically 2-3 AM)
Pauses immediately if user activity detected
Resumes during next maintenance window
Adjusts aggressiveness based on power source (more aggressive on AC)
Tracks completion across multiple sessions

The Defrag Paradox

The Complete Defragmentation Lifecycle

A complete defragmentation operation follows a well-defined lifecycle from initiation to completion.

Phase 1: Analysis

1. Scan entire file system structure
2. Build block allocation map
3. Identify fragmented files (extent count per file)
4. Calculate free space fragmentation
5. Estimate work required
6. Generate analysis report

This phase is read-only and typically fast (minutes for typical systems).

Phase 2: Planning

1. Prioritize files based on fragmentation severity and importance
2. Identify destination regions for each file
3. Detect interdependencies (moving X enables moving Y)
4. Create move schedule respecting dependencies
5. Estimate duration and I/O requirements

Phase 3: Execution

1. Initialize journal for crash recovery
2. For each planned move:
   a. Acquire file lock if possible
   b. Execute move operation with journaling
   c. Release lock
   d. Update progress
   e. Check for abort request
   f. Apply throttling delays
3. Handle locked/unmovable files (skip or queue for later)

Converting Mermaid diagram...

Phase 4: Completion

1. Clear journal entries
2. Verify file system integrity (optional but recommended)
3. Generate completion report with:
   - Files defragmented
   - Data moved (bytes)
   - Time elapsed
   - Before/after fragmentation metrics
4. Schedule next run if automatic

Handling Interruptions:

Well-designed defragmenters handle interruption gracefully:

User abort: Complete current move, update progress, exit cleanly
Power loss: Journal enables recovery without data loss
System shutdown: Same as power loss, with clean unmount
Out of space: Should never occur (moves don't net-consume space), but handled by aborting
Media errors: Mark affected areas, skip, report for user attention

Summary: The Defragmentation Process

Key Takeaways

•Defragmentation reorganizes file data and consolidates free space through sophisticated algorithms balancing thoroughness and efficiency
•Multiple algorithms exist for different priorities: sequential compaction for thoroughness, fragment-only for speed, free-space-focused for prevention
•Data safety relies on journaling and atomic metadata updates to ensure no data loss even during power failures
•File prioritization maximizes impact: boot files, frequently-accessed files, and heavily-fragmented files provide most benefit
•Placement optimization considers storage characteristics: HDD outer-track placement, SSD wear distribution, and access-pattern grouping
•Resource management through throttling enables background operation without impacting system usability

Looking Ahead:

Process Mastery Complete

2 / 5