Raid - Learning Module | OneNoughtOne

Loading content...

0/227

RAID Levels (0, 1, 5, 6, 10)

The Storage Reliability Problem

In 1988, three researchers at the University of California, Berkeley—David Patterson, Garth Gibson, and Randy Katz—published a paper that would fundamentally transform enterprise storage: "A Case for Redundant Arrays of Inexpensive Disks (RAID)." Their insight was revolutionary yet elegantly simple: instead of relying on a single, expensive, highly-reliable disk, why not combine multiple inexpensive disks to achieve both higher performance and better reliability than any single disk could provide?

This concept, now known as RAID (Redundant Array of Independent Disks), has become the cornerstone of virtually every storage system in data centers worldwide. From the database servers powering global financial transactions to the storage arrays hosting your favorite streaming service's content library, RAID technology ensures that your data survives hardware failures while maintaining the performance demands of modern applications.

What You Will Learn

By the end of this page, you will deeply understand the five most important RAID levels (0, 1, 5, 6, and 10), their internal architectures, how data is distributed across disks, the mathematical principles behind their reliability calculations, and the precise trade-offs that determine when each level is appropriate. You'll gain the knowledge to make informed storage architecture decisions in real-world systems.

The Fundamental Problem: Disk Reliability at Scale

Before diving into RAID levels, we must understand why RAID exists. The core challenge is deceptively simple: individual disk drives fail.

A modern enterprise hard drive might have a Mean Time Between Failures (MTBF) of 1.2 million hours—approximately 137 years. This sounds reassuring until you realize that MTBF represents the average time between failures across a large population of drives running continuously. For a single drive, the probability of failure in any given year is roughly:

$$P_{annual_failure} = \frac{8760\ hours/year}{MTBF} \approx 0.73% \text{ per year}$$

Now consider a data center with 10,000 drives. The expected number of drive failures per year becomes:

$$Expected_Failures = 10,000 \times 0.0073 = 73\ failures/year$$

That's more than one drive failing every five days. At this scale, disk failure isn't an exceptional event—it's a routine operational fact.

Disk Failure Probability at Scale
Number of Disks	Expected Annual Failures	Average Days Between Failures
10	0.073	5,000 days (13.7 years)
100	0.73	500 days (1.37 years)
1,000	7.3	50 days
10,000	73	5 days
100,000	730	12 hours

The implications are profound: any storage architecture that cannot tolerate disk failures will experience data loss. This is not a question of "if" but "when" and "how often." RAID addresses this reality by providing:

Fault Tolerance: The ability to survive one or more disk failures without data loss
Performance Enhancement: The ability to parallelize I/O operations across multiple disks
Capacity Aggregation: The ability to combine multiple disks into larger logical volumes

Critical Understanding

RAID is not a backup. RAID protects against hardware failure of disk drives. It does not protect against accidental deletion, file corruption, ransomware, software bugs, or catastrophic events affecting the entire array. A comprehensive data protection strategy requires both RAID for availability AND separate backups for durability.

RAID 0: Striping (No Redundancy)

RAID 0 is the simplest and, somewhat ironically named, the only RAID level that provides zero fault tolerance. Despite being called a "RAID" level, it violates the fundamental premise of redundancy. Its sole purpose is performance optimization through parallelism.

Architecture and Operation:

In RAID 0, data is divided into fixed-size blocks called stripes and distributed sequentially across all disks in the array. If you have n disks and write a file consisting of blocks B₀, B₁, B₂, ..., Bₘ, the blocks are distributed as follows:

Disk 0: B₀, Bₙ, B₂ₙ, ...
Disk 1: B₁, Bₙ₊₁, B₂ₙ₊₁, ...
Disk 2: B₂, Bₙ₊₂, B₂ₙ₊₂, ...
...
Disk n-1: Bₙ₋₁, B₂ₙ₋₁, B₃ₙ₋₁, ...

Converting Mermaid diagram...

The Stripe Size Decision:

The stripe size (also called chunk size or stripe width) is a critical tuning parameter that significantly impacts performance:

Small stripe sizes (4KB-16KB): Optimize for large sequential I/O. A single large read/write operation will span multiple disks, maximizing parallelism and throughput.
Large stripe sizes (64KB-256KB): Optimize for small random I/O. Each small operation is more likely to be satisfied by a single disk, reducing coordination overhead but sacrificing parallelism for large operations.

The optimal stripe size depends on your workload's I/O pattern. Video editing systems with large sequential reads/writes benefit from small stripes, while database systems with small random transactions might benefit from larger stripes.

RAID 0 Characteristics

•Capacity: 100% — All disk capacity is usable. With n disks of capacity C, total capacity = n × C
•Read Performance: Excellent — Theoretical maximum is n times single-disk performance for large sequential reads
•Write Performance: Excellent — Same benefit as reads; writes are parallelized across all disks
•Fault Tolerance: None — Any single disk failure destroys the entire array
•Minimum Disks: 2 (no maximum)
•Reliability: Worse than a single disk — Probability of failure increases with disk count

Reliability Analysis:

The reliability of a RAID 0 array is the probability that all disks survive. If each disk has a survival probability R over a given time period, the array survival probability is:

$$R_{RAID0} = R^n$$

For example, with 4 disks each having 99% annual survival probability: $$R_{RAID0} = 0.99^4 = 0.9606 \approx 96%$$

The array is 4% more likely to fail than any individual disk. With 8 disks: $$R_{RAID0} = 0.99^8 = 0.9227 \approx 92%$$

This inverse relationship between capacity/performance and reliability makes RAID 0 suitable only for temporary data, caches, or scenarios where data can be completely regenerated.

When to Use RAID 0

RAID 0 should ONLY be used when all of the following are true: (1) Performance is critical, (2) The data is completely replaceable or already exists elsewhere, (3) Downtime for array reconstruction is acceptable. Common examples include video editing scratch disks, game installation drives, and swap space. NEVER use RAID 0 for production data, databases, or any data you cannot afford to lose.

RAID 1: Mirroring (Complete Redundancy)

RAID 1 represents the conceptually simplest approach to fault tolerance: maintain an exact duplicate of every block of data. Every write operation is simultaneously written to two or more disks, creating perfect mirror copies.

Architecture and Operation:

In a basic RAID 1 configuration with two disks, every block exists identically on both disks:

Disk 0: A, B, C, D, E, F, G, H
Disk 1: A, B, C, D, E, F, G, H (exact mirror)

When a read operation is requested, the RAID controller can satisfy it from either disk, potentially load-balancing read operations across both mirrors. When a write operation is requested, the controller must update all mirror copies before acknowledging completion, ensuring consistency.

Converting Mermaid diagram...

Write Semantics and Consistency:

RAID 1 controllers must carefully manage write ordering to maintain consistency. There are two primary approaches:

Synchronous Mirroring: The controller waits for all mirror disks to acknowledge the write before returning success to the application. This ensures perfect consistency but adds latency equal to the slowest disk's write time.
Write Intent Logging: Some controllers maintain a small non-volatile log of in-progress writes. If power fails mid-write, the controller can use this log during recovery to identify and fix any inconsistencies.

The write penalty in RAID 1 is conceptually 2x for a two-disk mirror: every application write becomes two physical writes. However, modern controllers can often issue these writes simultaneously, so real-world write latency is typically close to a single-disk write latency (though I/O per second capacity is halved).

RAID 1 Characteristics

•Capacity: 50% — Half of total disk capacity is usable. With n disks of capacity C, total capacity = C (for 2-way mirror) or C (for n-way mirror)
•Read Performance: Excellent — Reads can be satisfied by any mirror; effective read throughput doubles for 2-way mirror
•Write Performance: Moderate — Each write must update all mirrors; latency ≈ single disk, but IOPS capacity reduced
•Fault Tolerance: Excellent — Survives failure of (n-1) out of n disks in an n-way mirror
•Minimum Disks: 2 (can extend to 3-way, 4-way mirrors for extreme reliability)
•Rebuild Time: Fast — Simply copy from surviving disk(s) to replacement

Reliability Analysis:

For a 2-way mirror, the array fails only if both disks fail. The probability of array failure equals the probability of the first disk failing AND the second disk failing before the first is replaced.

If disk failure probability is p and mean time to repair is T_repair:

$$P_{data_loss} \approx p \times \frac{T_{repair}}{MTBF} \times p$$

For a disk with 1% annual failure rate and 24-hour rebuild time: $$P_{data_loss} \approx 0.01 \times \frac{24}{8760} \times 0.01 \approx 2.7 \times 10^{-7}$$

This represents a roughly 370x improvement in reliability over a single disk. For critical data requiring even higher reliability, 3-way mirrors reduce this further by requiring three simultaneous failures.

Optimal Use Cases for RAID 1

RAID 1 excels in scenarios requiring: (1) Maximum reliability with simplest implementation, (2) Fast rebuild times to minimize vulnerability windows, (3) High random read performance (transaction processing), (4) Operating system and boot drives where quick recovery is critical. It's less optimal when storage efficiency is paramount due to its 50% capacity overhead.

RAID 5: Striping with Distributed Parity

RAID 5 represents a sophisticated compromise between the performance of RAID 0 and the fault tolerance of RAID 1. It achieves redundancy through parity calculation rather than complete duplication, significantly improving storage efficiency while maintaining single-disk fault tolerance.

The Parity Principle:

Parity is based on the XOR (exclusive or) operation. For any set of data blocks, a parity block can be computed such that XORing all data blocks with the parity block equals zero. Mathematically:

$$P = D_1 \oplus D_2 \oplus D_3 \oplus ... \oplus D_n$$

The critical property is that any single missing element can be reconstructed by XORing all other elements:

$$D_k = D_1 \oplus D_2 \oplus ... \oplus D_{k-1} \oplus P \oplus D_{k+1} \oplus ... \oplus D_n$$

This means if one disk fails, its contents can be completely reconstructed from the remaining disks.

Parity Calculation Example
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
# Demonstrating RAID 5 parity calculation
# Assume 3 data disks + 1 parity disk (conceptually)
 
# Example data blocks (8-bit values for simplicity)
disk1_data = 0b10110011  # Data block from disk 1
disk2_data = 0b01101100  # Data block from disk 2
disk3_data = 0b11001010  # Data block from disk 3
 
# Calculate parity (XOR of all data blocks)
parity = disk1_data ^ disk2_data ^ disk3_data
print(f"Disk 1: {bin(disk1_data)} ({disk1_data})")
print(f"Disk 2: {bin(disk2_data)} ({disk2_data})")
print(f"Disk 3: {bin(disk3_data)} ({disk3_data})")
print(f"Parity: {bin(parity)} ({parity})")
 
# Simulate disk 2 failure - reconstruct from others
# D2 = D1 XOR D3 XOR P
reconstructed_disk2 = disk1_data ^ disk3_data ^ parity
print(f"\nDisk 2 failed! Reconstructing...")
print(f"Reconstructed Disk 2: {bin(reconstructed_disk2)} ({reconstructed_disk2})")
print(f"Matches original: {reconstructed_disk2 == disk2_data}")
 
# Output:
# Disk 1: 0b10110011 (179)
# Disk 2: 0b1101100 (108)
# Disk 3: 0b11001010 (202)
# Parity: 0b111001 (57)
# Disk 2 failed! Reconstructing...
# Reconstructed Disk 2: 0b1101100 (108)
# Matches original: True

Distributed Parity Architecture:

A key innovation in RAID 5 is distributed parity. Rather than dedicating a single disk to parity (which creates a write bottleneck as every write must update the parity disk), RAID 5 rotates parity blocks across all disks:

Stripe	Disk 0	Disk 1	Disk 2	Disk 3
0	D₀	D₁	D₂	P₀
1	D₃	D₄	P₁	D₅
2	D₆	P₂	D₇	D₈
3	P₃	D₉	D₁₀	D₁₁
4	D₁₂	D₁₃	D₁₄	P₄

This distribution ensures that write operations are spread evenly across all disks, eliminating single-disk bottlenecks and maximizing parallel write throughput.

The RAID 5 Write Penalty:

RAID 5 has a significant write performance consideration known as the write penalty or small write problem. When updating a single data block:

Read the old data block from disk
Read the old parity block from its disk
Calculate new parity: P_new = P_old ⊕ D_old ⊕ D_new
Write the new data block to disk
Write the new parity block to its disk

This means every small write requires 4 I/O operations (2 reads + 2 writes), giving RAID 5 a write penalty of 4x for random small writes. For large sequential writes that span entire stripes, new parity can be calculated from all new data blocks without reading old data, avoiding this penalty.

RAID 5 Characteristics

•Capacity: (n-1)/n — With n disks of capacity C, total capacity = (n-1) × C. Efficiency improves with more disks.
•Read Performance: Excellent — Reads are parallelized across all disks (similar to RAID 0)
•Write Performance: Moderate to Poor — Random small writes have 4x penalty; sequential writes perform well
•Fault Tolerance: Single disk — Array survives exactly one disk failure
•Minimum Disks: 3 (minimum), typically 4-8 for balance of efficiency and reliability
•Rebuild Time: Slow — Must read from all surviving disks and calculate XOR for each block

The RAID 5 Era is Ending

With modern high-capacity drives (8TB+), RAID 5 rebuild times can exceed 24 hours. During rebuild, the array is vulnerable to a second failure, and the intense read activity during rebuild can trigger latent errors in surviving disks. For large drives, RAID 6 or other solutions are increasingly mandatory.

RAID 6: Double Distributed Parity

RAID 6 extends RAID 5's concept by adding a second, independent parity calculation, enabling the array to survive the failure of any two disks simultaneously. This is critically important for large arrays with high-capacity drives where the probability of a second failure during rebuild becomes significant.

Dual Parity Mathematics:

The first parity (P) uses the same XOR calculation as RAID 5: $$P = D_1 \oplus D_2 \oplus D_3 \oplus ... \oplus D_n$$

The second parity (Q) uses a different mathematical operation based on Galois Field (GF) arithmetic, specifically GF(2⁸). This is computed as: $$Q = g^1 \cdot D_1 \oplus g^2 \cdot D_2 \oplus g^3 \cdot D_3 \oplus ... \oplus g^n \cdot D_n$$

where g is a generator of the Galois Field. The multiplication and addition operations follow GF(2⁸) rules, not standard arithmetic.

The critical property is that P and Q together allow reconstruction of any two missing data blocks by solving a system of two equations with two unknowns in GF(2⁸).

Why Two Different Parity Schemes?

Using two XOR-based parity calculations would not work because XOR parity cannot distinguish which disk failed. Consider:

If disks A and B both fail, XOR parity tells us: A ⊕ B = known value
But there are infinitely many pairs (A, B) that satisfy any given XOR

The Galois Field approach produces an independent equation:

P equation: A ⊕ B = x
Q equation: g^a·A ⊕ g^b·B = y (in GF arithmetic)

These two independent equations uniquely determine both A and B.

Distribution Pattern:

Stripe	Disk 0	Disk 1	Disk 2	Disk 3	Disk 4
0	D₀	D₁	D₂	P₀	Q₀
1	D₃	D₄	P₁	Q₁	D₅
2	D₆	P₂	Q₂	D₇	D₈
3	P₃	Q₃	D₉	D₁₀	D₁₁
4	Q₄	D₁₂	D₁₃	D₁₄	P₄

RAID 6 Characteristics

•Capacity: (n-2)/n — With n disks of capacity C, total capacity = (n-2) × C. Two disks' worth of capacity for parity.
•Read Performance: Excellent — Identical to RAID 5 for normal operations
•Write Performance: Poor — Random small writes have 6x penalty (read data, read P, read Q, write data, write P, write Q)
•Fault Tolerance: Dual disk — Array survives any two simultaneous disk failures
•Minimum Disks: 4 (practical minimum 5-6 for reasonable efficiency)
•Rebuild Time: Very slow — More computation than RAID 5; but safer during rebuild

When is RAID 6 Essential?

The probability of data loss during a RAID 5 rebuild increases with:

Larger drive capacities: More sectors = more opportunities for latent errors
Longer rebuild times: More time exposed to second failure risk
Larger arrays: More disks = higher chance of second failure

For arrays with drives 4TB or larger, or arrays with 8+ drives, RAID 6 becomes the prudent choice. The performance penalty is offset by dramatically improved reliability.

RAID 6 Variants:

Several algorithms implement RAID 6's dual parity:

P+Q (Reed-Solomon): Original algorithm using GF(2⁸)
RAID-DP (NetApp): Diagonal parity calculation
RAID-Z2 (ZFS): Variable-width RAID 6 implementation
EVENODD: Academic algorithm with interesting mathematical properties

Beyond RAID 6

Some systems implement RAID 7 (proprietary triple-parity solutions) or RAID-Z3 (ZFS triple parity) for even higher reliability. These can survive three simultaneous failures, which becomes relevant for very large arrays with very high-capacity drives where triple failures during rebuild become statistically non-negligible.

RAID 10: Striped Mirrors (1+0)

RAID 10 (also written as RAID 1+0) combines RAID 1's mirroring with RAID 0's striping to achieve both excellent performance AND fault tolerance. It creates striped arrays of mirrored pairs, offering the best of both worlds at the cost of storage efficiency.

Architecture:

RAID 10 first creates mirrored pairs (RAID 1), then stripes data across those pairs (RAID 0):

           RAID 10 Array
                |
        ┌───────┴───────┐
    [Stripe 0]      [Stripe 1]     (RAID 0 striping)
        |               |
    ┌───┴───┐       ┌───┴───┐
  Disk0  Disk1    Disk2  Disk3     (RAID 1 mirrors)
    A      A        B      B
    C      C        D      D
    E      E        F      F

Data is written in stripes across the mirror pairs: block A goes to the first mirror pair (Disk 0 and Disk 1), block B goes to the second mirror pair (Disk 2 and Disk 3), and so on.

Fault Tolerance Analysis:

RAID 10's fault tolerance is nuanced. The array can survive:

At least one disk failure: Always
Multiple disk failures: If failures are in different mirror pairs
Maximum survivable failures: One disk from each mirror pair (n/2 failures for n-disk array)

However, RAID 10 fails if both disks in any single mirror pair fail. This makes RAID 10's failure probability dependent on which disks fail, not just how many.

For a 4-disk RAID 10 array:

First failure: 4 disks, any can fail = array survives
Second failure: 3 remaining disks
- 1 of these kills the array (the mirror partner of the first failure)
- 2 of these leave the array surviving
Probability that second failure is fatal: 1/3 (33%)

RAID 10 Advantages

•Superb read performance: Reads from any mirror, striped parallelism
•Excellent write performance: No parity calculation overhead
•Fast rebuild: Only copy from surviving mirror partner
•Predictable performance: No degradation during rebuild for unaffected pairs
•Simple implementation: No complex GF math required

RAID 10 Disadvantages

•50% capacity overhead: Half of all disk space is for redundancy
•Minimum 4 disks required: Higher entry cost than RAID 5
•Cost scales linearly: Doubling capacity = doubling disk cost
•Cannot survive specific two-disk failures: Both disks of same pair
•Requires even number of disks: Cannot use odd disk counts

RAID 10 Characteristics Summary

•Capacity: 50% — With n disks of capacity C, total capacity = (n/2) × C
•Read Performance: Excellent — Striping + mirror load balancing
•Write Performance: Excellent — No parity overhead; mirrored writes are parallel
•Fault Tolerance: 1 to n/2 disks — Depends on which disks fail
•Minimum Disks: 4 (must be even)
•Rebuild Time: Fast — Only read from single surviving mirror

RAID 10 is Often the Best Choice

For workloads requiring high random write performance (databases, virtualization hosts, mail servers), RAID 10's lack of write penalty often outweighs its storage efficiency disadvantage. The TCO (Total Cost of Ownership) may favor RAID 10 when factoring in fewer performance bottlenecks, faster rebuilds, and simpler management.

RAID Level Comparison: Choosing the Right Level

Selecting the appropriate RAID level requires balancing multiple factors: performance requirements, reliability needs, storage efficiency, budget, and workload characteristics. The following comprehensive comparison helps inform this decision:

Complete RAID Level Comparison
Characteristic	RAID 0	RAID 1	RAID 5	RAID 6	RAID 10
Min Disks	2	2	3	4	4
Capacity Efficiency	100%	50%	67-94%	50-88%	50%
Disk Failures Survived	0	n-1	1	2	1 to n/2
Read Performance	Excellent	Good-Excellent	Excellent	Excellent	Excellent
Random Write Perf	Excellent	Moderate	Poor	Very Poor	Excellent
Sequential Write Perf	Excellent	Moderate	Good	Good	Excellent
Rebuild Performance	N/A	Fast	Slow	Very Slow	Fast
Write Penalty	1x	2x	4x	6x	2x
Best For	Temp/Cache	OS/Boot	Read-heavy	Archive/Large	Databases

Decision Framework:

If data is expendable → RAID 0
If simplicity and fast recovery are priorities → RAID 1
If storage efficiency matters and writes are infrequent → RAID 5 (small arrays) or RAID 6 (large arrays)
If write performance is critical → RAID 10
If using large drives (4TB+) → RAID 6 or RAID 10 (avoid RAID 5)
If budget is unlimited and performance is paramount → RAID 10

Key Takeaway

There is no universally "best" RAID level. Each level embodies different trade-offs between performance, reliability, and cost. Understanding these trade-offs enables you to match RAID configuration to workload requirements—a skill essential for any systems architect or storage administrator.

Summary: RAID Levels Foundation

We've covered the foundational RAID levels that form the backbone of enterprise storage. Let's consolidate the key concepts:

Key Takeaways

•RAID 0 provides maximum performance through striping but offers zero fault tolerance—use only for expendable data
•RAID 1 provides complete redundancy through mirroring with simple, fast recovery at the cost of 50% storage efficiency
•RAID 5 uses distributed parity to achieve single-disk fault tolerance with better efficiency, but suffers from write penalties and is increasingly risky for large drives
•RAID 6 extends RAID 5 with dual parity, surviving two simultaneous failures—essential for large arrays and high-capacity drives
•RAID 10 combines mirroring and striping for excellent performance and fault tolerance, ideal for write-intensive workloads despite 50% overhead
•RAID selection depends on workload patterns, reliability requirements, performance needs, and budget constraints—no single level is universally optimal

What's Next:

In the following pages, we'll explore the detailed mechanics of striping and mirroring, dive deep into parity calculations and algorithms, analyze RAID performance characteristics under various workloads, and examine the reliability mathematics that govern array survival probability.

Page Complete

You now understand the five primary RAID levels, their architectures, trade-offs, and appropriate use cases. This foundation prepares you for the deeper exploration of RAID mechanics in subsequent pages, where we'll examine exactly how striping, mirroring, and parity work at the block level.

RAID Levels (0, 1, 5, 6, 10)

The Storage Reliability Problem

What You Will Learn

The Fundamental Problem: Disk Reliability at Scale

Before diving into RAID levels, we must understand why RAID exists. The core challenge is deceptively simple: individual disk drives fail.

$$P_{annual_failure} = \frac{8760\ hours/year}{MTBF} \approx 0.73% \text{ per year}$$

Now consider a data center with 10,000 drives. The expected number of drive failures per year becomes:

$$Expected_Failures = 10,000 \times 0.0073 = 73\ failures/year$$

That's more than one drive failing every five days. At this scale, disk failure isn't an exceptional event—it's a routine operational fact.

Disk Failure Probability at Scale
Number of Disks	Expected Annual Failures	Average Days Between Failures
10	0.073	5,000 days (13.7 years)
100	0.73	500 days (1.37 years)
1,000	7.3	50 days
10,000	73	5 days
100,000	730	12 hours

Fault Tolerance: The ability to survive one or more disk failures without data loss
Performance Enhancement: The ability to parallelize I/O operations across multiple disks
Capacity Aggregation: The ability to combine multiple disks into larger logical volumes

Critical Understanding

RAID 0: Striping (No Redundancy)

Architecture and Operation:

Disk 0: B₀, Bₙ, B₂ₙ, ...
Disk 1: B₁, Bₙ₊₁, B₂ₙ₊₁, ...
Disk 2: B₂, Bₙ₊₂, B₂ₙ₊₂, ...
...
Disk n-1: Bₙ₋₁, B₂ₙ₋₁, B₃ₙ₋₁, ...

Converting Mermaid diagram...

The Stripe Size Decision:

The stripe size (also called chunk size or stripe width) is a critical tuning parameter that significantly impacts performance:

Small stripe sizes (4KB-16KB): Optimize for large sequential I/O. A single large read/write operation will span multiple disks, maximizing parallelism and throughput.
Large stripe sizes (64KB-256KB): Optimize for small random I/O. Each small operation is more likely to be satisfied by a single disk, reducing coordination overhead but sacrificing parallelism for large operations.

RAID 0 Characteristics

•Capacity: 100% — All disk capacity is usable. With n disks of capacity C, total capacity = n × C
•Read Performance: Excellent — Theoretical maximum is n times single-disk performance for large sequential reads
•Write Performance: Excellent — Same benefit as reads; writes are parallelized across all disks
•Fault Tolerance: None — Any single disk failure destroys the entire array
•Minimum Disks: 2 (no maximum)
•Reliability: Worse than a single disk — Probability of failure increases with disk count

Reliability Analysis:

The reliability of a RAID 0 array is the probability that all disks survive. If each disk has a survival probability R over a given time period, the array survival probability is:

$$R_{RAID0} = R^n$$

For example, with 4 disks each having 99% annual survival probability: $$R_{RAID0} = 0.99^4 = 0.9606 \approx 96%$$

The array is 4% more likely to fail than any individual disk. With 8 disks: $$R_{RAID0} = 0.99^8 = 0.9227 \approx 92%$$

This inverse relationship between capacity/performance and reliability makes RAID 0 suitable only for temporary data, caches, or scenarios where data can be completely regenerated.

When to Use RAID 0

RAID 1: Mirroring (Complete Redundancy)

Architecture and Operation:

In a basic RAID 1 configuration with two disks, every block exists identically on both disks:

Disk 0: A, B, C, D, E, F, G, H
Disk 1: A, B, C, D, E, F, G, H (exact mirror)

Converting Mermaid diagram...

Write Semantics and Consistency:

RAID 1 controllers must carefully manage write ordering to maintain consistency. There are two primary approaches:

Synchronous Mirroring: The controller waits for all mirror disks to acknowledge the write before returning success to the application. This ensures perfect consistency but adds latency equal to the slowest disk's write time.
Write Intent Logging: Some controllers maintain a small non-volatile log of in-progress writes. If power fails mid-write, the controller can use this log during recovery to identify and fix any inconsistencies.

RAID 1 Characteristics

•Capacity: 50% — Half of total disk capacity is usable. With n disks of capacity C, total capacity = C (for 2-way mirror) or C (for n-way mirror)
•Read Performance: Excellent — Reads can be satisfied by any mirror; effective read throughput doubles for 2-way mirror
•Write Performance: Moderate — Each write must update all mirrors; latency ≈ single disk, but IOPS capacity reduced
•Fault Tolerance: Excellent — Survives failure of (n-1) out of n disks in an n-way mirror
•Minimum Disks: 2 (can extend to 3-way, 4-way mirrors for extreme reliability)
•Rebuild Time: Fast — Simply copy from surviving disk(s) to replacement

Reliability Analysis:

If disk failure probability is p and mean time to repair is T_repair:

$$P_{data_loss} \approx p \times \frac{T_{repair}}{MTBF} \times p$$

For a disk with 1% annual failure rate and 24-hour rebuild time: $$P_{data_loss} \approx 0.01 \times \frac{24}{8760} \times 0.01 \approx 2.7 \times 10^{-7}$$

Optimal Use Cases for RAID 1

RAID 5: Striping with Distributed Parity

The Parity Principle:

Parity is based on the XOR (exclusive or) operation. For any set of data blocks, a parity block can be computed such that XORing all data blocks with the parity block equals zero. Mathematically:

$$P = D_1 \oplus D_2 \oplus D_3 \oplus ... \oplus D_n$$

The critical property is that any single missing element can be reconstructed by XORing all other elements:

$$D_k = D_1 \oplus D_2 \oplus ... \oplus D_{k-1} \oplus P \oplus D_{k+1} \oplus ... \oplus D_n$$

This means if one disk fails, its contents can be completely reconstructed from the remaining disks.

Parity Calculation Example
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
# Demonstrating RAID 5 parity calculation
# Assume 3 data disks + 1 parity disk (conceptually)
 
# Example data blocks (8-bit values for simplicity)
disk1_data = 0b10110011  # Data block from disk 1
disk2_data = 0b01101100  # Data block from disk 2
disk3_data = 0b11001010  # Data block from disk 3
 
# Calculate parity (XOR of all data blocks)
parity = disk1_data ^ disk2_data ^ disk3_data
print(f"Disk 1: {bin(disk1_data)} ({disk1_data})")
print(f"Disk 2: {bin(disk2_data)} ({disk2_data})")
print(f"Disk 3: {bin(disk3_data)} ({disk3_data})")
print(f"Parity: {bin(parity)} ({parity})")
 
# Simulate disk 2 failure - reconstruct from others
# D2 = D1 XOR D3 XOR P
reconstructed_disk2 = disk1_data ^ disk3_data ^ parity
print(f"\nDisk 2 failed! Reconstructing...")
print(f"Reconstructed Disk 2: {bin(reconstructed_disk2)} ({reconstructed_disk2})")
print(f"Matches original: {reconstructed_disk2 == disk2_data}")
 
# Output:
# Disk 1: 0b10110011 (179)
# Disk 2: 0b1101100 (108)
# Disk 3: 0b11001010 (202)
# Parity: 0b111001 (57)
# Disk 2 failed! Reconstructing...
# Reconstructed Disk 2: 0b1101100 (108)
# Matches original: True

Distributed Parity Architecture:

Stripe	Disk 0	Disk 1	Disk 2	Disk 3
0	D₀	D₁	D₂	P₀
1	D₃	D₄	P₁	D₅
2	D₆	P₂	D₇	D₈
3	P₃	D₉	D₁₀	D₁₁
4	D₁₂	D₁₃	D₁₄	P₄

This distribution ensures that write operations are spread evenly across all disks, eliminating single-disk bottlenecks and maximizing parallel write throughput.

The RAID 5 Write Penalty:

RAID 5 has a significant write performance consideration known as the write penalty or small write problem. When updating a single data block:

Read the old data block from disk
Read the old parity block from its disk
Calculate new parity: P_new = P_old ⊕ D_old ⊕ D_new
Write the new data block to disk
Write the new parity block to its disk

RAID 5 Characteristics

•Capacity: (n-1)/n — With n disks of capacity C, total capacity = (n-1) × C. Efficiency improves with more disks.
•Read Performance: Excellent — Reads are parallelized across all disks (similar to RAID 0)
•Write Performance: Moderate to Poor — Random small writes have 4x penalty; sequential writes perform well
•Fault Tolerance: Single disk — Array survives exactly one disk failure
•Minimum Disks: 3 (minimum), typically 4-8 for balance of efficiency and reliability
•Rebuild Time: Slow — Must read from all surviving disks and calculate XOR for each block

The RAID 5 Era is Ending

RAID 6: Double Distributed Parity

Dual Parity Mathematics:

The first parity (P) uses the same XOR calculation as RAID 5: $$P = D_1 \oplus D_2 \oplus D_3 \oplus ... \oplus D_n$$

where g is a generator of the Galois Field. The multiplication and addition operations follow GF(2⁸) rules, not standard arithmetic.

The critical property is that P and Q together allow reconstruction of any two missing data blocks by solving a system of two equations with two unknowns in GF(2⁸).

Why Two Different Parity Schemes?

Using two XOR-based parity calculations would not work because XOR parity cannot distinguish which disk failed. Consider:

If disks A and B both fail, XOR parity tells us: A ⊕ B = known value
But there are infinitely many pairs (A, B) that satisfy any given XOR

The Galois Field approach produces an independent equation:

P equation: A ⊕ B = x
Q equation: g^a·A ⊕ g^b·B = y (in GF arithmetic)

These two independent equations uniquely determine both A and B.

Distribution Pattern:

Stripe	Disk 0	Disk 1	Disk 2	Disk 3	Disk 4
0	D₀	D₁	D₂	P₀	Q₀
1	D₃	D₄	P₁	Q₁	D₅
2	D₆	P₂	Q₂	D₇	D₈
3	P₃	Q₃	D₉	D₁₀	D₁₁
4	Q₄	D₁₂	D₁₃	D₁₄	P₄

RAID 6 Characteristics

•Capacity: (n-2)/n — With n disks of capacity C, total capacity = (n-2) × C. Two disks' worth of capacity for parity.
•Read Performance: Excellent — Identical to RAID 5 for normal operations
•Write Performance: Poor — Random small writes have 6x penalty (read data, read P, read Q, write data, write P, write Q)
•Fault Tolerance: Dual disk — Array survives any two simultaneous disk failures
•Minimum Disks: 4 (practical minimum 5-6 for reasonable efficiency)
•Rebuild Time: Very slow — More computation than RAID 5; but safer during rebuild

When is RAID 6 Essential?

The probability of data loss during a RAID 5 rebuild increases with:

Larger drive capacities: More sectors = more opportunities for latent errors
Longer rebuild times: More time exposed to second failure risk
Larger arrays: More disks = higher chance of second failure

For arrays with drives 4TB or larger, or arrays with 8+ drives, RAID 6 becomes the prudent choice. The performance penalty is offset by dramatically improved reliability.

RAID 6 Variants:

Several algorithms implement RAID 6's dual parity:

P+Q (Reed-Solomon): Original algorithm using GF(2⁸)
RAID-DP (NetApp): Diagonal parity calculation
RAID-Z2 (ZFS): Variable-width RAID 6 implementation
EVENODD: Academic algorithm with interesting mathematical properties

Beyond RAID 6

RAID 10: Striped Mirrors (1+0)

Architecture:

RAID 10 first creates mirrored pairs (RAID 1), then stripes data across those pairs (RAID 0):

           RAID 10 Array
                |
        ┌───────┴───────┐
    [Stripe 0]      [Stripe 1]     (RAID 0 striping)
        |               |
    ┌───┴───┐       ┌───┴───┐
  Disk0  Disk1    Disk2  Disk3     (RAID 1 mirrors)
    A      A        B      B
    C      C        D      D
    E      E        F      F

Data is written in stripes across the mirror pairs: block A goes to the first mirror pair (Disk 0 and Disk 1), block B goes to the second mirror pair (Disk 2 and Disk 3), and so on.

Fault Tolerance Analysis:

RAID 10's fault tolerance is nuanced. The array can survive:

At least one disk failure: Always
Multiple disk failures: If failures are in different mirror pairs
Maximum survivable failures: One disk from each mirror pair (n/2 failures for n-disk array)

However, RAID 10 fails if both disks in any single mirror pair fail. This makes RAID 10's failure probability dependent on which disks fail, not just how many.

For a 4-disk RAID 10 array:

First failure: 4 disks, any can fail = array survives
Second failure: 3 remaining disks
- 1 of these kills the array (the mirror partner of the first failure)
- 2 of these leave the array surviving
Probability that second failure is fatal: 1/3 (33%)

RAID 10 Advantages

•Superb read performance: Reads from any mirror, striped parallelism
•Excellent write performance: No parity calculation overhead
•Fast rebuild: Only copy from surviving mirror partner
•Predictable performance: No degradation during rebuild for unaffected pairs
•Simple implementation: No complex GF math required

RAID 10 Disadvantages

•50% capacity overhead: Half of all disk space is for redundancy
•Minimum 4 disks required: Higher entry cost than RAID 5
•Cost scales linearly: Doubling capacity = doubling disk cost
•Cannot survive specific two-disk failures: Both disks of same pair
•Requires even number of disks: Cannot use odd disk counts

RAID 10 Characteristics Summary

•Capacity: 50% — With n disks of capacity C, total capacity = (n/2) × C
•Read Performance: Excellent — Striping + mirror load balancing
•Write Performance: Excellent — No parity overhead; mirrored writes are parallel
•Fault Tolerance: 1 to n/2 disks — Depends on which disks fail
•Minimum Disks: 4 (must be even)
•Rebuild Time: Fast — Only read from single surviving mirror

RAID 10 is Often the Best Choice

RAID Level Comparison: Choosing the Right Level

Complete RAID Level Comparison
Characteristic	RAID 0	RAID 1	RAID 5	RAID 6	RAID 10
Min Disks	2	2	3	4	4
Capacity Efficiency	100%	50%	67-94%	50-88%	50%
Disk Failures Survived	0	n-1	1	2	1 to n/2
Read Performance	Excellent	Good-Excellent	Excellent	Excellent	Excellent
Random Write Perf	Excellent	Moderate	Poor	Very Poor	Excellent
Sequential Write Perf	Excellent	Moderate	Good	Good	Excellent
Rebuild Performance	N/A	Fast	Slow	Very Slow	Fast
Write Penalty	1x	2x	4x	6x	2x
Best For	Temp/Cache	OS/Boot	Read-heavy	Archive/Large	Databases

Decision Framework:

If data is expendable → RAID 0
If simplicity and fast recovery are priorities → RAID 1
If storage efficiency matters and writes are infrequent → RAID 5 (small arrays) or RAID 6 (large arrays)
If write performance is critical → RAID 10
If using large drives (4TB+) → RAID 6 or RAID 10 (avoid RAID 5)
If budget is unlimited and performance is paramount → RAID 10

Key Takeaway

Summary: RAID Levels Foundation

We've covered the foundational RAID levels that form the backbone of enterprise storage. Let's consolidate the key concepts:

Key Takeaways

•RAID 0 provides maximum performance through striping but offers zero fault tolerance—use only for expendable data
•RAID 1 provides complete redundancy through mirroring with simple, fast recovery at the cost of 50% storage efficiency
•RAID 5 uses distributed parity to achieve single-disk fault tolerance with better efficiency, but suffers from write penalties and is increasingly risky for large drives
•RAID 6 extends RAID 5 with dual parity, surviving two simultaneous failures—essential for large arrays and high-capacity drives
•RAID 10 combines mirroring and striping for excellent performance and fault tolerance, ideal for write-intensive workloads despite 50% overhead
•RAID selection depends on workload patterns, reliability requirements, performance needs, and budget constraints—no single level is universally optimal

What's Next:

Page Complete