Database Management SystemsTransaction Problems

Solving Transaction Problems in DBMS Interviews

LevelAdvanced

Duration90 mins

TopicTransaction Problems

5 / 5

Recovery Scenarios

Ensuring Durability Through Recovery

Systems fail. Power goes out. Disks crash. Software bugs cause segmentation faults. The D in ACID—Durability—promises that committed transactions survive any such failure.

Recovery scenarios test your understanding of how database systems use logging to survive failures and restore consistency. In interviews, you'll analyze log sequences, determine which operations to redo and undo after a crash, and reason about checkpoint optimization.

This page gives you complete mastery of recovery concepts, from fundamental logging principles to the industry-standard ARIES algorithm.

What You Will Master

By completing this page, you will be able to: (1) Understand Write-Ahead Logging (WAL) principles; (2) Analyze log records and determine transaction states after crash; (3) Apply the ARIES recovery algorithm (Analysis, Redo, Undo); (4) Reason about checkpoint mechanisms and their impact on recovery; (5) Solve recovery-focused interview problems systematically.

Recovery Fundamentals

Why Recovery Is Necessary

Database systems use volatile memory (RAM) for performance—buffer pools, lock tables, and transaction state all reside in memory. A crash loses this volatile state. Recovery must:

Ensure durability: Committed transaction effects must persist
Ensure atomicity: Uncommitted transaction effects must be removed
Restore consistency: Database must be in a consistent state after recovery

Types of Failures

Transaction Failure: A single transaction aborts (due to error, deadlock, etc.)

Recovery: Undo this transaction's changes
Other transactions unaffected

System Failure (Crash): Entire system fails (power outage, OS crash)

All buffer pool contents lost
Stable storage (disk) intact
Recovery: Redo committed transactions, undo uncommitted

Media Failure: Disk failure loses data

Requires backup restoration
Most expensive; beyond scope of standard recovery

Storage and Failure Impact
Storage Type	Survives Crash?	Survives Disk Failure?	Examples
Volatile (RAM)	No	No	Buffer pool, lock table
Non-volatile (Disk)	Yes	No	Database files, log files
Stable (Replicated)	Yes	Yes	Replicated logs, backups

The Fundamental Trade-off

Database systems face a choice:

Force policy: Write all changes to disk before commit

Pro: Recovery is simple (committed = on disk)
Con: Terrible performance (random I/O for every commit)

No-Force policy: Allow commit before all changes are on disk

Pro: Fast commits (just write log sequentially)
Con: Must redo committed transactions during recovery

Steal policy: Allow uncommitted changes to be written to disk

Pro: Can evict dirty pages from buffer pool anytime
Con: Must undo uncommitted transactions during recovery

No-Steal policy: Never write uncommitted changes to disk

Pro: No undo needed during recovery
Con: Limits buffer pool flexibility, may run out of memory

Most systems use Steal/No-Force for performance, accepting complex recovery.

ARIES Uses Steal/No-Force

The ARIES recovery algorithm (used by most modern DBMS) is designed for Steal/No-Force policies. This means recovery must handle both redo (because of No-Force) and undo (because of Steal).

Write-Ahead Logging (WAL)

Write-Ahead Logging (WAL) is the fundamental mechanism enabling recovery. The principle is simple but critical:

The WAL Protocol

Rule 1: Undo Logging Before a page is flushed to disk (stolen), all log records for updates to that page must be flushed to stable storage.

Rule 2: Redo Logging Before a transaction commits, all log records for that transaction must be flushed to stable storage.

In essence: Always write the log BEFORE the data. If we crash:

The log tells us everything we did
We can undo or redo as needed

Why Sequential Log Writes Help Performance

Log writes are sequential (append-only), which is much faster than random I/O:

Disk sequential write: ~100+ MB/s
Disk random write: ~1-5 MB/s (seek time dominates)

By logging changes sequentially and lazily writing data pages, we get the best of both worlds.

Log Record Structure

A typical log record contains:

Field	Description
LSN	Log Sequence Number (unique, increasing)
TransID	Transaction that performed the operation
Type	Type of record (UPDATE, COMMIT, ABORT, etc.)
PageID	Page modified (for UPDATE records)
Offset	Position within page
Before Image	Value before the update (for undo)
After Image	Value after the update (for redo)
PrevLSN	LSN of previous log record for this transaction

Log Record Types

UPDATE: Records a data modification
COMMIT: Marks transaction as committed
ABORT: Marks transaction as aborted
END: Marks transaction as completely finished (after undo if needed)
CLR (Compensation Log Record): Records an undo operation (prevents re-undoing)
CHECKPOINT: Marks a recovery checkpoint

log_example.txt

Log Records

LSN   TransID   Type      PageID  Before  After   PrevLSN
---   -------   ----      ------  ------  -----   -------
001   T1        UPDATE    P1      10      20      null
002   T2        UPDATE    P2      50      55      null
003   T1        UPDATE    P3      100     120     001
004   T2        UPDATE    P1      20      25      002
005   T1        COMMIT    -       -       -       003
006   T2        UPDATE    P4      30      35      004
007   CHECKPOINT          (active: T2)
008   T2        UPDATE    P2      55      60      006
009   T3        UPDATE    P5      0       10      null
010   T2        COMMIT    -       -       -       008
---   CRASH    ---       ---     ---     ---     ---
 
Analysis:
- T1: Committed at LSN 005 → REDO all T1 updates (001, 003)
- T2: Committed at LSN 010 → REDO all T2 updates (002, 004, 006, 008)
- T3: No commit → UNDO T3 changes (009)

Before vs. After Images

Before images are used for UNDO (restoring old values). After images are used for REDO (reapplying changes). Some interview problems give only one direction—make sure you know which operations are possible with available data.

The ARIES Recovery Algorithm

ARIES (Algorithms for Recovery and Isolation Exploiting Semantics) is the industry-standard recovery algorithm, used by IBM DB2, Microsoft SQL Server, PostgreSQL, and others.

ARIES Principles

Write-Ahead Logging: All changes logged before writing to disk
Repeat History: During recovery, replay ALL actions (even for aborted transactions)
Logging Changes During Undo: Undo operations are logged as CLRs to prevent re-undoing on repeated crashes

The Three Phases of ARIES Recovery

ARIES Three-Phase Recovery

•Analysis Phase: Scan log forward from last checkpoint. Determine: (a) which transactions were active at crash (losers), (b) which pages might need redo (dirty page table), (c) where to start redo phase.
•Redo Phase: Scan log forward from earliest dirty page LSN. Redo ALL updates to recreate exact state at crash—including uncommitted transactions. This 'repeats history.'
•Undo Phase: Scan log backward from end. Undo all updates by 'loser' transactions. Write CLRs to log each undo. Continue until all loser transactions are completely undone.

ARIES Data Structures
Structure	Purpose	Contents
Transaction Table	Track active transactions	TransID, Status, LastLSN, UndoNextLSN
Dirty Page Table (DPT)	Track dirty pages in buffer	PageID, RecLSN (first LSN that dirtied it)
Log	Record all changes	Sequence of log records (UPDATE, COMMIT, CLR, etc.)

Compensation Log Records (CLRs)

When undoing an operation, ARIES writes a CLR to the log. The CLR has:

Records the undo action taken
UndoNextLSN: Points to the next record to undo (skips the CLR'd record)

Why CLRs matter: If the system crashes during recovery (during the undo phase), we don't want to undo the same operation again. CLRs ensure idempotence—we can crash and restart recovery any number of times and get the same result.

Repeating History

A common interview question: 'Why redo uncommitted transactions in ARIES?' Answer: Repeating history exactly recreates the system state at crash, including all locks. This enables proper undo and handles complex dependencies. It's simpler than trying to selectively redo.

Checkpoint Mechanisms

Checkpoints reduce recovery time by limiting how much log must be scanned. Without checkpoints, recovery must scan the entire log from the beginning—potentially hours of work for a long-running system.

Checkpoint Types

Naive Checkpoint (System Halt):

Stop accepting new transactions
Wait for all active transactions to complete
Flush all dirty pages to disk
Write checkpoint record
Resume operation

Pro: Simple, recovery starts from checkpoint Con: System unavailable during checkpoint (unacceptable for production)

Fuzzy Checkpoint (ARIES-style):

Write BEGIN_CHECKPOINT record
Record current Transaction Table and Dirty Page Table
Write END_CHECKPOINT record
Continue normal operation throughout

Pro: No system pause Con: Recovery must process some log before checkpoint

Fuzzy Checkpoint Contents

An ARIES fuzzy checkpoint records:

Active Transaction Table: List of transactions in progress at checkpoint time, with their LastLSN
Dirty Page Table: List of dirty pages and their RecLSN (first LSN that dirtied the page)

Using Checkpoints in Recovery

Analysis Phase starts at: Last checkpoint's BEGIN_CHECKPOINT record

Redo Phase starts at: Minimum of:

RecLSN of oldest dirty page in checkpoint's DPT
Oldest LSN of active transactions in checkpoint's Transaction Table

This is called the RedoLSN—the earliest point from which we might need to redo.

Checkpoint Frequency Trade-off

Frequent checkpoints:

Pro: Faster recovery (less log to process)
Con: More I/O overhead during normal operation

Infrequent checkpoints:

Pro: Less overhead during normal operation
Con: Slower recovery after crash

Checkpoint Analysis in RecoveryDetermine recovery start points from checkpoint information.

Input

Checkpoint at LSN 100:
- Active Transactions: {T1: LastLSN=95, T2: LastLSN=80}
- Dirty Page Table: {P1: RecLSN=60, P2: RecLSN=90}

Subsequent log:
LSN 101: T1 UPDATE P3
LSN 102: T3 UPDATE P4
LSN 103: T2 COMMIT
LSN 104: T1 COMMIT
--- CRASH ---

Output

Analysis Phase:
- Start from LSN 100 (checkpoint)
- At LSN 103: Remove T2 from active set (committed)
- At LSN 104: Remove T1 from active set (committed)
- Add T3 to active set at LSN 102
- Losers at crash: {T3}

Redo Phase starts at:
- Min(RecLSN) = min(60, 90) = 60
- Or: RedoLSN = 60

Redo from LSN 60 forward, reapplying all updates.

Undo Phase:
- Undo T3's changes (LSN 102)
- Write CLR for the undo

Master Record

ARIES maintains a 'master record' on disk pointing to the last checkpoint. On recovery, the system reads this to find where to start. Without it, you'd need to scan the entire log to find checkpoints.

Complete Recovery Walk-through

Let's work through a complete ARIES-style recovery scenario step by step.

Problem Setup

recovery_scenario.txt

Log Sequence

Log before crash:
 
LSN   Trans  Type       Page  Before  After  PrevLSN
---   -----  ----       ----  ------  -----  -------
10    T1     UPDATE     P1    A       B      null
20    T2     UPDATE     P2    C       D      null
30    T1     UPDATE     P3    E       F      10
40    CHECKPOINT: ActiveTxns={T1:30, T2:20}, DirtyPages={P1:10, P2:20, P3:30}
50    T3     UPDATE     P4    G       H      null
60    T1     UPDATE     P2    D       I      30
70    T2     COMMIT     -     -       -      20
80    T1     UPDATE     P1    B       J      60
90    T3     UPDATE     P5    K       L      50
---   CRASH ---
 
Post-crash disk state (due to steal policy, some pages flushed):
- P1 contains 'B' (T1's first update at LSN 10, not second at 80)
- P2 contains 'D' (T2's update at LSN 20, not T1's at 60)
- P3 contains 'F' (T1's update at LSN 30)
- P4, P5 in original state (no flushes)

Phase 1: Analysis

Start at checkpoint (LSN 40)

Initial state from checkpoint:

Transaction Table: {T1: status=active, lastLSN=30}, {T2: status=active, lastLSN=20}
Dirty Page Table: {P1: recLSN=10}, {P2: recLSN=20}, {P3: recLSN=30}

Scan forward from LSN 50:

LSN 50: T3 UPDATE P4

Add T3 to Transaction Table: {T3: lastLSN=50}
Add P4 to DPT: {P4: recLSN=50}

LSN 60: T1 UPDATE P2

Update T1: lastLSN=60
P2 already in DPT (recLSN stays 20)

LSN 70: T2 COMMIT

Mark T2 as committed, remove from loser set

LSN 80: T1 UPDATE P1

Update T1: lastLSN=80
P1 already in DPT

LSN 90: T3 UPDATE P5

Update T3: lastLSN=90
Add P5 to DPT: {P5: recLSN=90}

End of Analysis:

Winners (committed): {T2}
Losers (uncommitted at crash): {T1, T3}
DPT: {P1:10, P2:20, P3:30, P4:50, P5:90}
RedoLSN = min(RecLSNs) = 10

Phase 2: Redo

Scan forward from RedoLSN (LSN 10):

For each UPDATE, check if redo is needed:

Redo if page's pageLSN < record's LSN
(If page was flushed after this update, pageLSN ≥ LSN, skip redo)

LSN 10: T1 UPDATE P1 (A→B)

Disk has 'B', suggests this was applied, but let's assume pageLSN < 10
Redo: Set P1 to 'B', set P1.pageLSN = 10

LSN 20: T2 UPDATE P2 (C→D)

Redo: Set P2 to 'D', set P2.pageLSN = 20

LSN 30: T1 UPDATE P3 (E→F)

Redo: Set P3 to 'F', set P3.pageLSN = 30

LSN 50: T3 UPDATE P4 (G→H)

P4 never flushed, pageLSN < 50
Redo: Set P4 to 'H', set P4.pageLSN = 50

LSN 60: T1 UPDATE P2 (D→I)

Redo: Set P2 to 'I', set P2.pageLSN = 60

LSN 70: COMMIT (no redo for commits)

LSN 80: T1 UPDATE P1 (B→J)

Redo: Set P1 to 'J', set P1.pageLSN = 80

LSN 90: T3 UPDATE P5 (K→L)

Redo: Set P5 to 'L', set P5.pageLSN = 90

After Redo: Database reflects exact state at crash (including uncommitted changes)

Phase 3: Undo

Losers: {T1, T3}

Build ToUndo list from lastLSN of each loser:

T1: lastLSN = 80
T3: lastLSN = 90

ToUndo = {90, 80} (process in reverse order)

Undo LSN 90: T3 UPDATE P5 (K→L)

Write CLR: "Undo LSN 90, set P5 back to 'K'"
CLR.UndoNextLSN = 90.PrevLSN = 50
Add new LSN (91) to log
Update P5 to 'K'
ToUndo = {80, 50} (added T3's previous: 50)

Undo LSN 80: T1 UPDATE P1 (B→J)

Write CLR: "Undo LSN 80, set P1 back to 'B'"
CLR.UndoNextLSN = 80.PrevLSN = 60
Add new LSN (92) to log
ToUndo = {60, 50}

Undo LSN 60: T1 UPDATE P2 (D→I)

Write CLR, set P2 back to 'D'
UndoNextLSN = 30
ToUndo = {50, 30}

Undo LSN 50: T3 UPDATE P4 (G→H)

Write CLR, set P4 back to 'G'
UndoNextLSN = null (first record for T3)
T3 fully undone, write END record for T3
ToUndo = {30}

Undo LSN 30: T1 UPDATE P3 (E→F)

Write CLR, set P3 back to 'E'
UndoNextLSN = 10
ToUndo = {10}

Undo LSN 10: T1 UPDATE P1 (A→B)

Write CLR, set P1 back to 'A'
UndoNextLSN = null
T1 fully undone, write END record for T1
ToUndo = {} (empty)

Recovery Complete!

Interview Problem Patterns

Recovery problems in interviews follow predictable patterns. Here's how to recognize and solve each type.

Pattern: Given a log, identify which transactions committed (winners) and which didn't (losers).

Method:

Scan log from beginning (or checkpoint)
Track all transaction IDs seen
For each COMMIT record, mark that transaction as winner
After crash point, all non-committed = losers

Quick Check: Transactions with COMMIT record = winners. All others = losers.

Tricky case: ABORT is also "finished" — but the undo may not be complete if crash happened during abort's undo phase.

Common Interview Questions

'Why does ARIES redo uncommitted transactions?' — To repeat history and recreate exact crash state. 'Why write CLRs during undo?' — To prevent re-undoing if we crash during recovery. 'Why are checkpoints needed?' — To limit recovery time by providing a known-good starting point.

Transaction State Diagrams

Understanding transaction state transitions helps with recovery analysis.

Transaction States

ACTIVE: Transaction is executing PARTIALLY COMMITTED: Transaction has finished final statement, but commit not yet logged COMMITTED: Commit record written to log (durable) FAILED: Error occurred, transaction will abort ABORTED: Abort complete, changes undone

State Transition Diagram

Converting Mermaid diagram...

Recovery and Transaction States

On recovery, determine each transaction's state:

Last Record Seen	Recovery Action
COMMIT	No action (winner)
ABORT followed by END	No action (already undone)
ABORT without END	Complete the undo
UPDATE (no COMMIT/ABORT)	Full undo needed (loser)
END	Transaction fully complete

The END Record

The END record marks complete transaction termination:

Written after COMMIT when all changes are durable
Written after ABORT when all undo is complete
Key insight: If ABORT exists but no END, undo was incomplete (crash during abort)

Partially Committed vs Committed

A transaction is 'partially committed' after finishing execution but BEFORE the commit record is durable. If system crashes in this state, the transaction is a loser—it must be undone. Only after the COMMIT record is on stable storage is the transaction truly committed.

Summary: Mastering Recovery Scenarios

Recovery is the mechanism that makes the ACID durability guarantee possible. Let's consolidate the essential skills:

Key Takeaways

•WAL is fundamental: Write log before data; log enables both redo and undo.
•ARIES uses three phases: Analysis (determine what to do), Redo (repeat history), Undo (remove uncommitted changes).
•Repeat history: ARIES redoes ALL operations including losers' to restore exact crash state before undoing.
•CLRs prevent re-undo: Compensation records during undo ensure idempotent recovery.
•Checkpoints limit recovery time: Fuzzy checkpoints allow concurrent operation while providing recovery start points.

Module Complete!

You have now completed the Transaction Problems module. You've mastered:

Schedule Analysis: Reading and interpreting transaction schedules
Serializability Testing: Precedence graphs and conflict detection
Lock Analysis: 2PL protocols and lock compatibility
Deadlock Detection: Wait-for graphs and prevention strategies
Recovery Scenarios: ARIES algorithm and checkpoint analysis

These skills prepare you for the most challenging transaction-related interview questions at any level.

Module Complete

Congratulations! You now have comprehensive mastery of transaction problem-solving for DBMS interviews. From schedule analysis through serializability, locking, deadlocks, and recovery—you can tackle any transaction-related interview question with confidence and precision.

5 / 5

Loading learning content...

Database Management SystemsTransaction Problems

Solving Transaction Problems in DBMS Interviews

LevelAdvanced

Duration90 mins

TopicTransaction Problems

5 / 5

Recovery Scenarios

Ensuring Durability Through Recovery

Systems fail. Power goes out. Disks crash. Software bugs cause segmentation faults. The D in ACID—Durability—promises that committed transactions survive any such failure.

This page gives you complete mastery of recovery concepts, from fundamental logging principles to the industry-standard ARIES algorithm.

What You Will Master

Recovery Fundamentals

Why Recovery Is Necessary

Database systems use volatile memory (RAM) for performance—buffer pools, lock tables, and transaction state all reside in memory. A crash loses this volatile state. Recovery must:

Ensure durability: Committed transaction effects must persist
Ensure atomicity: Uncommitted transaction effects must be removed
Restore consistency: Database must be in a consistent state after recovery

Types of Failures

Transaction Failure: A single transaction aborts (due to error, deadlock, etc.)

Recovery: Undo this transaction's changes
Other transactions unaffected

System Failure (Crash): Entire system fails (power outage, OS crash)

All buffer pool contents lost
Stable storage (disk) intact
Recovery: Redo committed transactions, undo uncommitted

Media Failure: Disk failure loses data

Requires backup restoration
Most expensive; beyond scope of standard recovery

Storage and Failure Impact
Storage Type	Survives Crash?	Survives Disk Failure?	Examples
Volatile (RAM)	No	No	Buffer pool, lock table
Non-volatile (Disk)	Yes	No	Database files, log files
Stable (Replicated)	Yes	Yes	Replicated logs, backups

The Fundamental Trade-off

Database systems face a choice:

Force policy: Write all changes to disk before commit

Pro: Recovery is simple (committed = on disk)
Con: Terrible performance (random I/O for every commit)

No-Force policy: Allow commit before all changes are on disk

Pro: Fast commits (just write log sequentially)
Con: Must redo committed transactions during recovery

Steal policy: Allow uncommitted changes to be written to disk

Pro: Can evict dirty pages from buffer pool anytime
Con: Must undo uncommitted transactions during recovery

No-Steal policy: Never write uncommitted changes to disk

Pro: No undo needed during recovery
Con: Limits buffer pool flexibility, may run out of memory

Most systems use Steal/No-Force for performance, accepting complex recovery.

ARIES Uses Steal/No-Force

The ARIES recovery algorithm (used by most modern DBMS) is designed for Steal/No-Force policies. This means recovery must handle both redo (because of No-Force) and undo (because of Steal).

Write-Ahead Logging (WAL)

Write-Ahead Logging (WAL) is the fundamental mechanism enabling recovery. The principle is simple but critical:

The WAL Protocol

Rule 1: Undo Logging Before a page is flushed to disk (stolen), all log records for updates to that page must be flushed to stable storage.

Rule 2: Redo Logging Before a transaction commits, all log records for that transaction must be flushed to stable storage.

In essence: Always write the log BEFORE the data. If we crash:

The log tells us everything we did
We can undo or redo as needed

Why Sequential Log Writes Help Performance

Log writes are sequential (append-only), which is much faster than random I/O:

Disk sequential write: ~100+ MB/s
Disk random write: ~1-5 MB/s (seek time dominates)

By logging changes sequentially and lazily writing data pages, we get the best of both worlds.

Log Record Structure

A typical log record contains:

Field	Description
LSN	Log Sequence Number (unique, increasing)
TransID	Transaction that performed the operation
Type	Type of record (UPDATE, COMMIT, ABORT, etc.)
PageID	Page modified (for UPDATE records)
Offset	Position within page
Before Image	Value before the update (for undo)
After Image	Value after the update (for redo)
PrevLSN	LSN of previous log record for this transaction

Log Record Types

UPDATE: Records a data modification
COMMIT: Marks transaction as committed
ABORT: Marks transaction as aborted
END: Marks transaction as completely finished (after undo if needed)
CLR (Compensation Log Record): Records an undo operation (prevents re-undoing)
CHECKPOINT: Marks a recovery checkpoint

log_example.txt

Log Records

LSN   TransID   Type      PageID  Before  After   PrevLSN
---   -------   ----      ------  ------  -----   -------
001   T1        UPDATE    P1      10      20      null
002   T2        UPDATE    P2      50      55      null
003   T1        UPDATE    P3      100     120     001
004   T2        UPDATE    P1      20      25      002
005   T1        COMMIT    -       -       -       003
006   T2        UPDATE    P4      30      35      004
007   CHECKPOINT          (active: T2)
008   T2        UPDATE    P2      55      60      006
009   T3        UPDATE    P5      0       10      null
010   T2        COMMIT    -       -       -       008
---   CRASH    ---       ---     ---     ---     ---
 
Analysis:
- T1: Committed at LSN 005 → REDO all T1 updates (001, 003)
- T2: Committed at LSN 010 → REDO all T2 updates (002, 004, 006, 008)
- T3: No commit → UNDO T3 changes (009)

Before vs. After Images

The ARIES Recovery Algorithm

ARIES (Algorithms for Recovery and Isolation Exploiting Semantics) is the industry-standard recovery algorithm, used by IBM DB2, Microsoft SQL Server, PostgreSQL, and others.

ARIES Principles

Write-Ahead Logging: All changes logged before writing to disk
Repeat History: During recovery, replay ALL actions (even for aborted transactions)
Logging Changes During Undo: Undo operations are logged as CLRs to prevent re-undoing on repeated crashes

The Three Phases of ARIES Recovery

ARIES Three-Phase Recovery

•Analysis Phase: Scan log forward from last checkpoint. Determine: (a) which transactions were active at crash (losers), (b) which pages might need redo (dirty page table), (c) where to start redo phase.
•Redo Phase: Scan log forward from earliest dirty page LSN. Redo ALL updates to recreate exact state at crash—including uncommitted transactions. This 'repeats history.'
•Undo Phase: Scan log backward from end. Undo all updates by 'loser' transactions. Write CLRs to log each undo. Continue until all loser transactions are completely undone.

ARIES Data Structures
Structure	Purpose	Contents
Transaction Table	Track active transactions	TransID, Status, LastLSN, UndoNextLSN
Dirty Page Table (DPT)	Track dirty pages in buffer	PageID, RecLSN (first LSN that dirtied it)
Log	Record all changes	Sequence of log records (UPDATE, COMMIT, CLR, etc.)

Compensation Log Records (CLRs)

When undoing an operation, ARIES writes a CLR to the log. The CLR has:

Records the undo action taken
UndoNextLSN: Points to the next record to undo (skips the CLR'd record)

Repeating History

Checkpoint Mechanisms

Checkpoint Types

Naive Checkpoint (System Halt):

Stop accepting new transactions
Wait for all active transactions to complete
Flush all dirty pages to disk
Write checkpoint record
Resume operation

Pro: Simple, recovery starts from checkpoint Con: System unavailable during checkpoint (unacceptable for production)

Fuzzy Checkpoint (ARIES-style):

Write BEGIN_CHECKPOINT record
Record current Transaction Table and Dirty Page Table
Write END_CHECKPOINT record
Continue normal operation throughout

Pro: No system pause Con: Recovery must process some log before checkpoint

Fuzzy Checkpoint Contents

An ARIES fuzzy checkpoint records:

Active Transaction Table: List of transactions in progress at checkpoint time, with their LastLSN
Dirty Page Table: List of dirty pages and their RecLSN (first LSN that dirtied the page)

Using Checkpoints in Recovery

Analysis Phase starts at: Last checkpoint's BEGIN_CHECKPOINT record

Redo Phase starts at: Minimum of:

RecLSN of oldest dirty page in checkpoint's DPT
Oldest LSN of active transactions in checkpoint's Transaction Table

This is called the RedoLSN—the earliest point from which we might need to redo.

Checkpoint Frequency Trade-off

Frequent checkpoints:

Pro: Faster recovery (less log to process)
Con: More I/O overhead during normal operation

Infrequent checkpoints:

Pro: Less overhead during normal operation
Con: Slower recovery after crash

Checkpoint Analysis in RecoveryDetermine recovery start points from checkpoint information.

Input

Checkpoint at LSN 100:
- Active Transactions: {T1: LastLSN=95, T2: LastLSN=80}
- Dirty Page Table: {P1: RecLSN=60, P2: RecLSN=90}

Subsequent log:
LSN 101: T1 UPDATE P3
LSN 102: T3 UPDATE P4
LSN 103: T2 COMMIT
LSN 104: T1 COMMIT
--- CRASH ---

Output

Analysis Phase:
- Start from LSN 100 (checkpoint)
- At LSN 103: Remove T2 from active set (committed)
- At LSN 104: Remove T1 from active set (committed)
- Add T3 to active set at LSN 102
- Losers at crash: {T3}

Redo Phase starts at:
- Min(RecLSN) = min(60, 90) = 60
- Or: RedoLSN = 60

Redo from LSN 60 forward, reapplying all updates.

Undo Phase:
- Undo T3's changes (LSN 102)
- Write CLR for the undo

Master Record

Complete Recovery Walk-through

Let's work through a complete ARIES-style recovery scenario step by step.

Problem Setup

recovery_scenario.txt

Log Sequence

Log before crash:
 
LSN   Trans  Type       Page  Before  After  PrevLSN
---   -----  ----       ----  ------  -----  -------
10    T1     UPDATE     P1    A       B      null
20    T2     UPDATE     P2    C       D      null
30    T1     UPDATE     P3    E       F      10
40    CHECKPOINT: ActiveTxns={T1:30, T2:20}, DirtyPages={P1:10, P2:20, P3:30}
50    T3     UPDATE     P4    G       H      null
60    T1     UPDATE     P2    D       I      30
70    T2     COMMIT     -     -       -      20
80    T1     UPDATE     P1    B       J      60
90    T3     UPDATE     P5    K       L      50
---   CRASH ---
 
Post-crash disk state (due to steal policy, some pages flushed):
- P1 contains 'B' (T1's first update at LSN 10, not second at 80)
- P2 contains 'D' (T2's update at LSN 20, not T1's at 60)
- P3 contains 'F' (T1's update at LSN 30)
- P4, P5 in original state (no flushes)

Phase 1: Analysis

Start at checkpoint (LSN 40)

Initial state from checkpoint:

Transaction Table: {T1: status=active, lastLSN=30}, {T2: status=active, lastLSN=20}
Dirty Page Table: {P1: recLSN=10}, {P2: recLSN=20}, {P3: recLSN=30}

Scan forward from LSN 50:

LSN 50: T3 UPDATE P4

Add T3 to Transaction Table: {T3: lastLSN=50}
Add P4 to DPT: {P4: recLSN=50}

LSN 60: T1 UPDATE P2

Update T1: lastLSN=60
P2 already in DPT (recLSN stays 20)

LSN 70: T2 COMMIT

Mark T2 as committed, remove from loser set

LSN 80: T1 UPDATE P1

Update T1: lastLSN=80
P1 already in DPT

LSN 90: T3 UPDATE P5

Update T3: lastLSN=90
Add P5 to DPT: {P5: recLSN=90}

End of Analysis:

Winners (committed): {T2}
Losers (uncommitted at crash): {T1, T3}
DPT: {P1:10, P2:20, P3:30, P4:50, P5:90}
RedoLSN = min(RecLSNs) = 10

Phase 2: Redo

Scan forward from RedoLSN (LSN 10):

For each UPDATE, check if redo is needed:

Redo if page's pageLSN < record's LSN
(If page was flushed after this update, pageLSN ≥ LSN, skip redo)

LSN 10: T1 UPDATE P1 (A→B)

Disk has 'B', suggests this was applied, but let's assume pageLSN < 10
Redo: Set P1 to 'B', set P1.pageLSN = 10

LSN 20: T2 UPDATE P2 (C→D)

Redo: Set P2 to 'D', set P2.pageLSN = 20

LSN 30: T1 UPDATE P3 (E→F)

Redo: Set P3 to 'F', set P3.pageLSN = 30

LSN 50: T3 UPDATE P4 (G→H)

P4 never flushed, pageLSN < 50
Redo: Set P4 to 'H', set P4.pageLSN = 50

LSN 60: T1 UPDATE P2 (D→I)

Redo: Set P2 to 'I', set P2.pageLSN = 60

LSN 70: COMMIT (no redo for commits)

LSN 80: T1 UPDATE P1 (B→J)

Redo: Set P1 to 'J', set P1.pageLSN = 80

LSN 90: T3 UPDATE P5 (K→L)

Redo: Set P5 to 'L', set P5.pageLSN = 90

After Redo: Database reflects exact state at crash (including uncommitted changes)

Phase 3: Undo

Losers: {T1, T3}

Build ToUndo list from lastLSN of each loser:

T1: lastLSN = 80
T3: lastLSN = 90

ToUndo = {90, 80} (process in reverse order)

Undo LSN 90: T3 UPDATE P5 (K→L)

Write CLR: "Undo LSN 90, set P5 back to 'K'"
CLR.UndoNextLSN = 90.PrevLSN = 50
Add new LSN (91) to log
Update P5 to 'K'
ToUndo = {80, 50} (added T3's previous: 50)

Undo LSN 80: T1 UPDATE P1 (B→J)

Write CLR: "Undo LSN 80, set P1 back to 'B'"
CLR.UndoNextLSN = 80.PrevLSN = 60
Add new LSN (92) to log
ToUndo = {60, 50}

Undo LSN 60: T1 UPDATE P2 (D→I)

Write CLR, set P2 back to 'D'
UndoNextLSN = 30
ToUndo = {50, 30}

Undo LSN 50: T3 UPDATE P4 (G→H)

Write CLR, set P4 back to 'G'
UndoNextLSN = null (first record for T3)
T3 fully undone, write END record for T3
ToUndo = {30}

Undo LSN 30: T1 UPDATE P3 (E→F)

Write CLR, set P3 back to 'E'
UndoNextLSN = 10
ToUndo = {10}

Undo LSN 10: T1 UPDATE P1 (A→B)

Write CLR, set P1 back to 'A'
UndoNextLSN = null
T1 fully undone, write END record for T1
ToUndo = {} (empty)

Recovery Complete!

Interview Problem Patterns

Recovery problems in interviews follow predictable patterns. Here's how to recognize and solve each type.

Pattern: Given a log, identify which transactions committed (winners) and which didn't (losers).

Method:

Scan log from beginning (or checkpoint)
Track all transaction IDs seen
For each COMMIT record, mark that transaction as winner
After crash point, all non-committed = losers

Quick Check: Transactions with COMMIT record = winners. All others = losers.

Tricky case: ABORT is also "finished" — but the undo may not be complete if crash happened during abort's undo phase.

Common Interview Questions

Transaction State Diagrams

Understanding transaction state transitions helps with recovery analysis.

Transaction States

State Transition Diagram

Converting Mermaid diagram...

Recovery and Transaction States

On recovery, determine each transaction's state:

Last Record Seen	Recovery Action
COMMIT	No action (winner)
ABORT followed by END	No action (already undone)
ABORT without END	Complete the undo
UPDATE (no COMMIT/ABORT)	Full undo needed (loser)
END	Transaction fully complete

The END Record

The END record marks complete transaction termination:

Written after COMMIT when all changes are durable
Written after ABORT when all undo is complete
Key insight: If ABORT exists but no END, undo was incomplete (crash during abort)

Partially Committed vs Committed

Summary: Mastering Recovery Scenarios

Recovery is the mechanism that makes the ACID durability guarantee possible. Let's consolidate the essential skills:

Key Takeaways

•WAL is fundamental: Write log before data; log enables both redo and undo.
•ARIES uses three phases: Analysis (determine what to do), Redo (repeat history), Undo (remove uncommitted changes).
•Repeat history: ARIES redoes ALL operations including losers' to restore exact crash state before undoing.
•CLRs prevent re-undo: Compensation records during undo ensure idempotent recovery.
•Checkpoints limit recovery time: Fuzzy checkpoints allow concurrent operation while providing recovery start points.

Module Complete!

You have now completed the Transaction Problems module. You've mastered:

Schedule Analysis: Reading and interpreting transaction schedules
Serializability Testing: Precedence graphs and conflict detection
Lock Analysis: 2PL protocols and lock compatibility
Deadlock Detection: Wait-for graphs and prevention strategies
Recovery Scenarios: ARIES algorithm and checkpoint analysis

These skills prepare you for the most challenging transaction-related interview questions at any level.

Module Complete

5 / 5