Database Management SystemsTransaction States

Transaction States

LevelIntermediate

Duration75 mins

TopicTransaction States

5 / 5

Aborted

The Clean Slate

When a transaction fails and its rollback completes, it enters the Aborted state. This is the terminal state for unsuccessful transactions—the point where the database has completely erased all evidence of the transaction's existence as if it never happened.

The Aborted state is the mirror image of the Committed state:

Committed = Success; changes are permanent
Aborted = Failure; changes are completely undone

Both are terminal states—once reached, the transaction's lifecycle is complete. But while Committed transactions leave their mark on the database forever, Aborted transactions leave no trace whatsoever in the final database state.

Understanding the Aborted state completes our picture of the transaction state machine and provides crucial insight into how databases maintain atomicity—the 'A' in ACID.

What You Will Learn

By the end of this page, you will understand the formal definition of the Aborted state, comprehend what 'cleaned up' means at a technical level, learn how abort affects concurrency and locks, understand retry strategies after abort, and see the complete transaction state diagram in its entirety.

Formal Definition of the Aborted State

Let's establish a precise understanding of what it means for a transaction to be in the Aborted state.

Formal Definition:

A transaction T is in the Aborted state if and only if:

T was in the Failed state
The rollback of T has been completed
All of T's changes have been undone
All of T's locks have been released
T's resources have been deallocated

Using formal notation:

T ∈ Aborted ⟺ 
    (previous_state(T) = Failed) ∧
    (rollback_complete(T) = true) ∧
    (changes_undone(T) = true) ∧
    (locks_released(T) = true) ∧
    (resources_freed(T) = true)

Key Characteristics of the Aborted State:

Aborted State Properties
Property	Value	Implication
State Type	Terminal (final)	No more transitions possible
Data Effect	None	Database unchanged from before transaction
Visible Changes	None	Other transactions see pre-transaction state
Locks	Released	No resources blocked
Log Records	Remain	ABORT record in log for recovery information
Transaction Descriptor	Deallocated	Memory freed

Atomicity in Action

The Aborted state is the embodiment of the Atomicity property. Atomicity guarantees 'all or nothing'—either a transaction's changes are fully applied (Committed), or they are fully reversed (Aborted). There is no in-between state where some changes persist and others don't.

The Complete Transaction State Diagram:

With the Aborted state, we can now present the complete transaction state machine:

Converting Mermaid diagram...

What Rollback Leaves Behind

While we say an aborted transaction leaves 'no trace' in the database, this is true from the perspective of data but not from the perspective of system internals. Let's examine what remains and what is removed.

What Is Removed (The User Perspective):

From the application's viewpoint, an aborted transaction might as well have never happened:

Completely Reversed

•All row inserts — Rows added by the transaction are deleted
•All row updates — Modified columns are restored to original values
•All row deletes — Deleted rows are restored (reinserted)
•Index entries — Index modifications are reversed
•Constraint state — Any deferred constraint effects are removed

What Remains (The System Perspective):

While user-visible effects are erased, the system retains some information:

Still Present in System

•Transaction log records — Original operations and CLRs remain in log (needed for recovery)
•ABORT log record — Explicitly marks the transaction as aborted
•Statistics updates — Transaction counter increments may persist
•MVCC version remnants — Old row versions may linger until vacuum (in MVCC systems)
•Sequence gaps — Sequence values used by aborted transactions are not returned

sequence-gaps-example.sql
PostgreSQL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
-- Sequence values are NOT rolled back
-- This is a common source of confusion
 
CREATE SEQUENCE order_id_seq;
 
-- Transaction 1:
BEGIN;
INSERT INTO orders (id, ...) VALUES (nextval('order_id_seq'), ...);
-- Gets order_id = 1
ROLLBACK;  -- Transaction aborted
 
-- Transaction 2:
BEGIN;  
INSERT INTO orders (id, ...) VALUES (nextval('order_id_seq'), ...);
-- Gets order_id = 2, NOT 1!
COMMIT;
 
-- Result: order_id 1 is "missing" from the orders table
-- This is a feature, not a bug - sequences are designed for 
-- high concurrency and don't participate in transaction rollback
 
-- Same behavior applies to:
-- Serial columns (use sequences internally)
-- Identity columns
-- Some auto-increment implementations

Side Effects That Don't Roll Back

Not everything can be rolled back: (1) Sequence values are consumed and not returned, (2) DDL in some databases auto-commits and can't be rolled back, (3) External API calls made during the transaction remain, (4) Files written to disk by procedures aren't removed, (5) Notifications/alerts sent are not recalled. Design your transactions with these limitations in mind.

Lock Release and Concurrency Impact

When a transaction reaches the Aborted state, all its locks are released. This has important implications for other transactions that may have been waiting.

Lock Release Sequence:

During rollback, locks are typically still held (for consistency)
Only after rollback completes does the system release locks
Waiting transactions are signaled and can proceed
Lock table entries for this transaction are removed

Impact on Waiting Transactions:

abort-unblocks-waiters.sql
SQL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
-- Demonstrating how abort releases locks and unblocks waiters
 
-- Session 1: Acquire exclusive lock
BEGIN;
UPDATE accounts SET balance = 1000 WHERE id = 100;
-- Holds X lock on row id=100
 
-- Session 2: Tries to access same row, BLOCKS
BEGIN;
UPDATE accounts SET balance = 2000 WHERE id = 100;
-- Waiting... (blocked by Session 1's X lock)
 
-- Session 3: Also waiting for same row
BEGIN;
SELECT * FROM accounts WHERE id = 100 FOR UPDATE;
-- Also waiting...
 
-- Session 1: Encounters error or decides to rollback
ROLLBACK;  -- Transitions: Failed → (rollback) → Aborted
-- All locks released!
 
-- Session 2 and 3: Now unblocked!
-- One of them acquires the lock and proceeds
-- The other waits for the new lock holder
 
-- After Session 2 completes, Session 3 can proceed

The Cascading Effect:

When a long-running transaction aborts, it may trigger a 'cascade' of activity:

Multiple blocked transactions become unblocked
They may all rush to acquire locks (contention)
Some may deadlock with each other
System load may temporarily spike

This is why long-running transactions are problematic—their abort can cause as much disruption as their execution.

Read Consistency After Abort:

In MVCC systems, abort affects version visibility:

mvcc-abort-visibility.sql
PostgreSQL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
-- MVCC visibility after abort
 
-- Initial state: balance = 1000
 
-- Transaction T1:
BEGIN;
UPDATE accounts SET balance = 5000 WHERE id = 100;
-- Creates new row version with T1's transaction ID
-- Concurrent readers may or may not see this depending on snapshots
 
-- Transaction T2 (started after T1's update):
BEGIN ISOLATION LEVEL READ COMMITTED;
SELECT balance FROM accounts WHERE id = 100;
-- May see 5000 if T1 committed, or 1000 if T1 is uncommitted/aborted
-- Visibility check says: "Is T1 committed?" No → show old version
 
-- T1 aborts:
ROLLBACK;  -- T1 is now marked as aborted
 
-- T2's next read:
SELECT balance FROM accounts WHERE id = 100;
-- Returns 1000 - the original value
-- The new version created by T1 is marked as created by an aborted 
-- transaction and is therefore invisible (and will be vacuumed later)
 
COMMIT;
 
-- The row version T1 created still physically exists on disk
-- But it's effectively invisible - no transaction will ever see it
-- VACUUM will eventually remove it

Dead Tuple Cleanup (MVCC)

In MVCC databases, aborted transactions leave 'dead' row versions in the table. These are invisible to all transactions but consume space. Regular maintenance (VACUUM in PostgreSQL, InnoDB background threads in MySQL) cleans up these dead tuples. Heavy abort rates increase maintenance load.

Recovery and Aborted Transactions

How does the database treat aborted transactions during crash recovery? This is crucial for understanding the durability of abort decisions.

Scenario: Abort Before Crash

If a transaction was aborted before a crash:

The ABORT record is in the log (durable)
The rollback was completed
Upon recovery, no action needed—transaction was already handled

Scenario: Active Transaction During Crash

If a transaction was active when crash occurred:

No COMMIT record in log (only BEGIN and operations)
Recovery treats this as an incomplete transaction
Recovery performs undo using log records
Transaction reaches Aborted state through recovery

Scenario: Rollback In Progress During Crash

If crash occurred during rollback (transaction in Failed state):

Some CLR (Compensation Log Records) may be in log
Recovery continues the rollback from where it left off
CLRs prevent re-undoing already-undone operations
Transaction eventually reaches Aborted state

Converting Mermaid diagram...

recovery-undo.pseudo
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
// Recovery processing for transactions that need to abort
 
function recovery_undo_phase(active_transactions) {
    // active_transactions = transactions that were in Active or Failed 
    // state at crash time (no COMMIT or complete ABORT in log)
    
    undo_list = active_transactions.copy()
    
    // Process log from end to beginning
    current_lsn = log.end_position()
    
    while undo_list is not empty:
        record = read_log_record(current_lsn)
        
        if record.transaction_id in undo_list:
            
            if record.type == CLR:
                // This is a compensation record - skip to undo-next-lsn
                // The original operation was already undone before crash
                current_lsn = record.undo_next_lsn
                if record.undo_next_lsn == NULL:
                    // This transaction's rollback is complete
                    write_abort_record(record.transaction_id)
                    undo_list.remove(record.transaction_id)
                continue
                
            elif record.type == UPDATE:
                // Need to undo this update
                page = read_page(record.page_id)
                if page.lsn >= record.lsn:
                    // Page has this update; undo it
                    apply_undo(record.old_value)
                    write_clr(record.transaction_id, record)
                    
            elif record.type == BEGIN:
                // Reached the beginning of this transaction
                write_abort_record(record.transaction_id)
                undo_list.remove(record.transaction_id)
        
        current_lsn = record.prev_lsn
    
    // All previously active transactions are now Aborted
}

The Durability of Abort

Once a transaction reaches the Aborted state (whether through normal execution or recovery), it remains aborted permanently. The ABORT record in the log ensures this is durable. If there's another crash after recovery, the already-aborted transactions won't be processed again—their ABORT records indicate they're already handled.

Retry Strategies After Abort

When a transaction is aborted, the application must decide whether and how to retry. This decision depends on why the abort occurred and the nature of the operation.

Framework for Retry Decisions:

Retry Decision Framework
Abort Cause	Retry?	Strategy	Example
Deadlock victim	Yes	Immediate retry with backoff	Two transfers deadlocked
Serialization failure	Yes	Immediate retry	MVCC conflict
Lock timeout	Yes	Retry with exponential backoff	Heavily contended rows
Constraint violation	No*	Fix data, then retry	Duplicate primary key
Explicit user cancel	No	N/A - intentional	User clicked Cancel
System resource exhaustion	Yes	Wait, then retry	Out of memory
Connection lost	Yes	Reconnect and retry	Network hiccup

smart-retry-logic.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
import time
import random
from enum import Enum
from dataclasses import dataclass
from typing import Callable, TypeVar, Optional
 
class AbortReason(Enum):
    DEADLOCK = "deadlock"
    SERIALIZATION = "serialization_failure"
    LOCK_TIMEOUT = "lock_timeout"
    CONSTRAINT = "constraint_violation"
    USER_CANCEL = "user_cancel"
    RESOURCE = "resource_exhaustion"
    CONNECTION = "connection_lost"
    UNKNOWN = "unknown"
 
@dataclass
class RetryPolicy:
    max_attempts: int
    initial_delay: float  # seconds
    max_delay: float
    exponential_base: float = 2.0
    jitter: bool = True
 
# Default policies by abort reason
DEFAULT_POLICIES = {
    AbortReason.DEADLOCK: RetryPolicy(max_attempts=5, initial_delay=0.01, max_delay=1.0),
    AbortReason.SERIALIZATION: RetryPolicy(max_attempts=5, initial_delay=0.01, max_delay=1.0),
    AbortReason.LOCK_TIMEOUT: RetryPolicy(max_attempts=3, initial_delay=0.5, max_delay=10.0),
    AbortReason.RESOURCE: RetryPolicy(max_attempts=3, initial_delay=1.0, max_delay=30.0),
    AbortReason.CONNECTION: RetryPolicy(max_attempts=3, initial_delay=0.5, max_delay=5.0),
    # These should not be retried automatically
    AbortReason.CONSTRAINT: None,
    AbortReason.USER_CANCEL: None,
    AbortReason.UNKNOWN: None,
}
 
T = TypeVar('T')
 
def execute_with_smart_retry(
    operation: Callable[[], T],
    classify_error: Callable[[Exception], AbortReason],
    custom_policies: Optional[dict] = None
) -> T:
    """
    Execute an operation with intelligent retry logic based on abort reason.
    
    :param operation: Function to execute (should manage its own transaction)
    :param classify_error: Function to classify an exception into AbortReason
    :param custom_policies: Optional custom retry policies
    :return: Result of successful operation
    :raises: Last exception if all retries exhausted or non-retriable error
    """
    policies = {**DEFAULT_POLICIES, **(custom_policies or {})}
    
    attempt = 0
    last_error = None
    
    while True:
        try:
            return operation()
            
        except Exception as e:
            last_error = e
            reason = classify_error(e)
            policy = policies.get(reason)
            
            if policy is None:
                # Non-retriable error
                print(f"Non-retriable abort: {reason.value}")
                raise
            
            attempt += 1
            if attempt > policy.max_attempts:
                print(f"Max retries ({policy.max_attempts}) exceeded for {reason.value}")
                raise
            
            # Calculate delay with exponential backoff
            delay = min(
                policy.initial_delay * (policy.exponential_base ** (attempt - 1)),
                policy.max_delay
            )
            
            # Add jitter to prevent thundering herd
            if policy.jitter:
                delay = delay * (0.5 + random.random())
            
            print(f"Abort ({reason.value}), attempt {attempt}/{policy.max_attempts}, "
                  f"retrying in {delay:.2f}s")
            
            time.sleep(delay)
    
    return None  # Should never reach here
 
 
# Example usage
def transfer_funds(from_id: int, to_id: int, amount: float):
    """Business operation that may be retried."""
    with get_connection() as conn:
        with conn.cursor() as cur:
            cur.execute("BEGIN")
            # ... perform transfer ...
            conn.commit()
 
def classify_postgres_error(e: Exception) -> AbortReason:
    """Classify PostgreSQL errors into retry categories."""
    if hasattr(e, 'pgcode'):
        code = e.pgcode
        if code == '40001':
            return AbortReason.SERIALIZATION
        elif code == '40P01':
            return AbortReason.DEADLOCK
        elif code in ('23505', '23503', '23502', '23514'):
            return AbortReason.CONSTRAINT
        elif code.startswith('08'):
            return AbortReason.CONNECTION
    return AbortReason.UNKNOWN
 
# Execute with retry
result = execute_with_smart_retry(
    lambda: transfer_funds(1, 2, 100.00),
    classify_postgres_error
)

Idempotency is Essential for Retries

Before retrying an aborted transaction, ensure the operation is idempotent—running it twice should have the same effect as running it once. If the transaction was partially visible to other transactions (shouldn't happen with proper isolation, but edge cases exist), retrying might cause duplicate effects. Use unique transaction identifiers to detect and handle duplicates.

Monitoring Aborted Transactions

Monitoring abort patterns helps identify system problems and application issues. Here's how to track aborted transactions across different database systems.

Key Metrics to Monitor:

Critical Abort Metrics

•Abort rate — Percentage of transactions that abort vs commit
•Abort by cause — Breakdown of deadlocks, serialization failures, constraints, etc.
•Abort trends — Are abort rates increasing over time?
•Rollback duration — How long does rollback take for aborted transactions?
•Retry success rate — Of retried transactions, how many eventually succeed?
•Dead tuple accumulation — Rate of dead tuples from aborts (MVCC systems)

abort-monitoring.sql
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
-- Calculate abort/rollback rate by database
SELECT 
    datname,
    xact_commit AS commits,
    xact_rollback AS rollbacks,
    ROUND(100.0 * xact_rollback / NULLIF(xact_commit + xact_rollback, 0), 2) 
        AS abort_percentage,
    conflicts AS replication_conflicts
FROM pg_stat_database
WHERE datname NOT LIKE 'template%'
ORDER BY abort_percentage DESC;
 
-- Healthy systems typically see < 1% abort rate
-- > 5% abort rate suggests problems
 
-- Monitor conflict-specific aborts (replication)
SELECT 
    datname,
    confl_tablespace,
    confl_lock,
    confl_snapshot,
    confl_bufferpin,
    confl_deadlock,
    (confl_tablespace + confl_lock + confl_snapshot + 
     confl_bufferpin + confl_deadlock) AS total_conflicts
FROM pg_stat_database_conflicts;
 
-- Dead tuple accumulation (indicates abort/update activity)
SELECT 
    schemaname,
    relname,
    n_live_tup,
    n_dead_tup,
    ROUND(100.0 * n_dead_tup / NULLIF(n_live_tup + n_dead_tup, 0), 2) 
        AS dead_tuple_pct,
    last_vacuum,
    last_autovacuum
FROM pg_stat_user_tables
WHERE n_dead_tup > 1000
ORDER BY n_dead_tup DESC;

Alerting Thresholds:

Set up alerts based on your system's baseline. Example thresholds:

Suggested Alert Thresholds
Metric	Warning	Critical	Notes
Abort rate	2%	5%	Baseline depends on application
Deadlocks/hour	10	50	Should be near zero normally
Avg rollback time	100ms	1s	Long rollbacks block resources
Dead tuple ratio	10%	25%	May indicate vacuum issues
Lock wait time	1s avg	5s avg	Indicates contention

Trends Matter More Than Absolutes

A sudden increase in abort rate is more concerning than a stable but slightly elevated rate. Establish baselines during normal operation, then alert on significant deviations. Correlate abort spikes with deployments, traffic patterns, or system changes.

Best Practices for Minimizing Aborts

While some aborts are unavoidable (and even desirable—like aborting on constraint violations), excessive aborts waste resources and hurt performance. Here are best practices to minimize unnecessary aborts:

Transaction Design Best Practices

•Keep transactions short — The longer a transaction runs, the more likely it encounters conflicts, timeouts, or resource issues. Minimize time in Active state.
•Validate data before transaction — Check business rules before beginning the transaction. Catch errors before they cause aborts.
•Use consistent access ordering — Access tables and rows in the same order across all transactions to prevent deadlocks.
•Choose appropriate isolation levels — Higher isolation levels (SERIALIZABLE) have higher abort rates. Use the minimum level that meets requirements.
•Handle expected errors in application — If duplicates are possible, use conflict-handling clauses (ON CONFLICT/INSERT OR IGNORE) instead of letting constraints abort.
•Use optimistic locking for low-conflict scenarios — Check-and-retry is often better than pessimistic locking when conflicts are rare.
•Batch operations appropriately — Very large batches risk timeouts; consider smaller batches with explicit commits.

preventing-aborts.sql
PostgreSQL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
-- Techniques to prevent unnecessary aborts
 
-- 1. Use ON CONFLICT instead of letting constraint abort
INSERT INTO users (email, name)
VALUES ('alice@example.com', 'Alice')
ON CONFLICT (email) DO UPDATE SET name = EXCLUDED.name;
-- Never aborts on duplicate email
 
-- 2. Use SELECT FOR UPDATE SKIP LOCKED for queue processing
-- Instead of waiting and risking deadlock/timeout:
BEGIN;
SELECT * FROM job_queue 
WHERE status = 'pending'
ORDER BY created_at
LIMIT 1
FOR UPDATE SKIP LOCKED;  -- Skip rows locked by other transactions
-- Process the job...
COMMIT;
 
-- 3. Use advisory locks for coordinating without row locks
SELECT pg_advisory_lock(12345);  -- Acquire advisory lock
-- Do work that would otherwise contend on rows
SELECT pg_advisory_unlock(12345);
 
-- 4. Check before attempting operations
DO $$
DECLARE
    current_balance DECIMAL;
BEGIN
    -- Check first (outside main logic)
    SELECT balance INTO current_balance
    FROM accounts WHERE id = 100;
    
    IF current_balance < 500 THEN
        RAISE EXCEPTION 'Insufficient balance';
    END IF;
    
    -- Now do the actual update
    UPDATE accounts SET balance = balance - 500 WHERE id = 100;
END $$;

Aborts Are Not Always Bad

Some aborts are correct behavior: constraint violations protecting data integrity, serialization failures maintaining correctness, user cancellations respecting user intent. The goal is to minimize UNNECESSARY aborts from poor design, not to eliminate all aborts. A system with zero aborts might be missing important integrity checks.

Summary: The Aborted State

We've completed our exploration of the transaction state machine with the Aborted state—the terminal state for unsuccessful transactions. Let's consolidate our understanding:

Key Takeaways

•Terminal failure state — Aborted is the final state for unsuccessful transactions; the transaction's lifecycle is complete and cannot be restarted.
•Complete rollback — All data modifications by the transaction are reversed; the database state is as if the transaction never happened.
•Atomicity embodied — Aborted state, paired with Committed, implements atomicity: all changes apply (Committed) or none do (Aborted).
•Log records persist — While data changes are reversed, log records (including ABORT) remain for recovery purposes.
•Some side effects remain — Sequence values, external API calls, and other non-transactional effects are not rolled back.
•Lock release unblocks waiters — Other transactions waiting for this transaction's locks can proceed once Aborted state is reached.
•Recovery handles incomplete aborts — If crash occurs during rollback, recovery completes the abort process.
•Retry strategies are essential — Applications should intelligently retry retriable aborts while escalating non-retriable ones.

Module Complete: Transaction States

You've now mastered the complete transaction state model:

Active — Transaction executing, performing operations
Partially Committed — Operations complete, awaiting durability confirmation
Committed — Success; changes permanent
Failed — Error detected, awaiting rollback
Aborted — Failure; all changes reversed

This state machine is fundamental to understanding how databases maintain ACID properties, especially Atomicity (A) and Durability (D). Every transaction you ever execute follows this state model, whether you're aware of it or not.

Module Complete

Congratulations! You now have a comprehensive understanding of transaction states—from Active through to the terminal states of Committed and Aborted. This knowledge is essential for designing reliable applications, troubleshooting transaction-related issues, and understanding database behavior during both normal operation and recovery scenarios.

5 / 5

Loading learning content...

Database Management SystemsTransaction States

Transaction States

LevelIntermediate

Duration75 mins

TopicTransaction States

5 / 5

Aborted

The Clean Slate

The Aborted state is the mirror image of the Committed state:

Committed = Success; changes are permanent
Aborted = Failure; changes are completely undone

Understanding the Aborted state completes our picture of the transaction state machine and provides crucial insight into how databases maintain atomicity—the 'A' in ACID.

What You Will Learn

Formal Definition of the Aborted State

Let's establish a precise understanding of what it means for a transaction to be in the Aborted state.

Formal Definition:

A transaction T is in the Aborted state if and only if:

T was in the Failed state
The rollback of T has been completed
All of T's changes have been undone
All of T's locks have been released
T's resources have been deallocated

Using formal notation:

T ∈ Aborted ⟺ 
    (previous_state(T) = Failed) ∧
    (rollback_complete(T) = true) ∧
    (changes_undone(T) = true) ∧
    (locks_released(T) = true) ∧
    (resources_freed(T) = true)

Key Characteristics of the Aborted State:

Aborted State Properties
Property	Value	Implication
State Type	Terminal (final)	No more transitions possible
Data Effect	None	Database unchanged from before transaction
Visible Changes	None	Other transactions see pre-transaction state
Locks	Released	No resources blocked
Log Records	Remain	ABORT record in log for recovery information
Transaction Descriptor	Deallocated	Memory freed

Atomicity in Action

The Complete Transaction State Diagram:

With the Aborted state, we can now present the complete transaction state machine:

Converting Mermaid diagram...

What Rollback Leaves Behind

What Is Removed (The User Perspective):

From the application's viewpoint, an aborted transaction might as well have never happened:

Completely Reversed

•All row inserts — Rows added by the transaction are deleted
•All row updates — Modified columns are restored to original values
•All row deletes — Deleted rows are restored (reinserted)
•Index entries — Index modifications are reversed
•Constraint state — Any deferred constraint effects are removed

What Remains (The System Perspective):

While user-visible effects are erased, the system retains some information:

Still Present in System

•Transaction log records — Original operations and CLRs remain in log (needed for recovery)
•ABORT log record — Explicitly marks the transaction as aborted
•Statistics updates — Transaction counter increments may persist
•MVCC version remnants — Old row versions may linger until vacuum (in MVCC systems)
•Sequence gaps — Sequence values used by aborted transactions are not returned

sequence-gaps-example.sql
PostgreSQL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
-- Sequence values are NOT rolled back
-- This is a common source of confusion
 
CREATE SEQUENCE order_id_seq;
 
-- Transaction 1:
BEGIN;
INSERT INTO orders (id, ...) VALUES (nextval('order_id_seq'), ...);
-- Gets order_id = 1
ROLLBACK;  -- Transaction aborted
 
-- Transaction 2:
BEGIN;  
INSERT INTO orders (id, ...) VALUES (nextval('order_id_seq'), ...);
-- Gets order_id = 2, NOT 1!
COMMIT;
 
-- Result: order_id 1 is "missing" from the orders table
-- This is a feature, not a bug - sequences are designed for 
-- high concurrency and don't participate in transaction rollback
 
-- Same behavior applies to:
-- Serial columns (use sequences internally)
-- Identity columns
-- Some auto-increment implementations

Side Effects That Don't Roll Back

Lock Release and Concurrency Impact

When a transaction reaches the Aborted state, all its locks are released. This has important implications for other transactions that may have been waiting.

Lock Release Sequence:

During rollback, locks are typically still held (for consistency)
Only after rollback completes does the system release locks
Waiting transactions are signaled and can proceed
Lock table entries for this transaction are removed

Impact on Waiting Transactions:

abort-unblocks-waiters.sql
SQL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
-- Demonstrating how abort releases locks and unblocks waiters
 
-- Session 1: Acquire exclusive lock
BEGIN;
UPDATE accounts SET balance = 1000 WHERE id = 100;
-- Holds X lock on row id=100
 
-- Session 2: Tries to access same row, BLOCKS
BEGIN;
UPDATE accounts SET balance = 2000 WHERE id = 100;
-- Waiting... (blocked by Session 1's X lock)
 
-- Session 3: Also waiting for same row
BEGIN;
SELECT * FROM accounts WHERE id = 100 FOR UPDATE;
-- Also waiting...
 
-- Session 1: Encounters error or decides to rollback
ROLLBACK;  -- Transitions: Failed → (rollback) → Aborted
-- All locks released!
 
-- Session 2 and 3: Now unblocked!
-- One of them acquires the lock and proceeds
-- The other waits for the new lock holder
 
-- After Session 2 completes, Session 3 can proceed

The Cascading Effect:

When a long-running transaction aborts, it may trigger a 'cascade' of activity:

Multiple blocked transactions become unblocked
They may all rush to acquire locks (contention)
Some may deadlock with each other
System load may temporarily spike

This is why long-running transactions are problematic—their abort can cause as much disruption as their execution.

Read Consistency After Abort:

In MVCC systems, abort affects version visibility:

mvcc-abort-visibility.sql
PostgreSQL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
-- MVCC visibility after abort
 
-- Initial state: balance = 1000
 
-- Transaction T1:
BEGIN;
UPDATE accounts SET balance = 5000 WHERE id = 100;
-- Creates new row version with T1's transaction ID
-- Concurrent readers may or may not see this depending on snapshots
 
-- Transaction T2 (started after T1's update):
BEGIN ISOLATION LEVEL READ COMMITTED;
SELECT balance FROM accounts WHERE id = 100;
-- May see 5000 if T1 committed, or 1000 if T1 is uncommitted/aborted
-- Visibility check says: "Is T1 committed?" No → show old version
 
-- T1 aborts:
ROLLBACK;  -- T1 is now marked as aborted
 
-- T2's next read:
SELECT balance FROM accounts WHERE id = 100;
-- Returns 1000 - the original value
-- The new version created by T1 is marked as created by an aborted 
-- transaction and is therefore invisible (and will be vacuumed later)
 
COMMIT;
 
-- The row version T1 created still physically exists on disk
-- But it's effectively invisible - no transaction will ever see it
-- VACUUM will eventually remove it

Dead Tuple Cleanup (MVCC)

Recovery and Aborted Transactions

How does the database treat aborted transactions during crash recovery? This is crucial for understanding the durability of abort decisions.

Scenario: Abort Before Crash

If a transaction was aborted before a crash:

The ABORT record is in the log (durable)
The rollback was completed
Upon recovery, no action needed—transaction was already handled

Scenario: Active Transaction During Crash

If a transaction was active when crash occurred:

No COMMIT record in log (only BEGIN and operations)
Recovery treats this as an incomplete transaction
Recovery performs undo using log records
Transaction reaches Aborted state through recovery

Scenario: Rollback In Progress During Crash

If crash occurred during rollback (transaction in Failed state):

Some CLR (Compensation Log Records) may be in log
Recovery continues the rollback from where it left off
CLRs prevent re-undoing already-undone operations
Transaction eventually reaches Aborted state

Converting Mermaid diagram...

recovery-undo.pseudo
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
// Recovery processing for transactions that need to abort
 
function recovery_undo_phase(active_transactions) {
    // active_transactions = transactions that were in Active or Failed 
    // state at crash time (no COMMIT or complete ABORT in log)
    
    undo_list = active_transactions.copy()
    
    // Process log from end to beginning
    current_lsn = log.end_position()
    
    while undo_list is not empty:
        record = read_log_record(current_lsn)
        
        if record.transaction_id in undo_list:
            
            if record.type == CLR:
                // This is a compensation record - skip to undo-next-lsn
                // The original operation was already undone before crash
                current_lsn = record.undo_next_lsn
                if record.undo_next_lsn == NULL:
                    // This transaction's rollback is complete
                    write_abort_record(record.transaction_id)
                    undo_list.remove(record.transaction_id)
                continue
                
            elif record.type == UPDATE:
                // Need to undo this update
                page = read_page(record.page_id)
                if page.lsn >= record.lsn:
                    // Page has this update; undo it
                    apply_undo(record.old_value)
                    write_clr(record.transaction_id, record)
                    
            elif record.type == BEGIN:
                // Reached the beginning of this transaction
                write_abort_record(record.transaction_id)
                undo_list.remove(record.transaction_id)
        
        current_lsn = record.prev_lsn
    
    // All previously active transactions are now Aborted
}

The Durability of Abort

Retry Strategies After Abort

When a transaction is aborted, the application must decide whether and how to retry. This decision depends on why the abort occurred and the nature of the operation.

Framework for Retry Decisions:

Retry Decision Framework
Abort Cause	Retry?	Strategy	Example
Deadlock victim	Yes	Immediate retry with backoff	Two transfers deadlocked
Serialization failure	Yes	Immediate retry	MVCC conflict
Lock timeout	Yes	Retry with exponential backoff	Heavily contended rows
Constraint violation	No*	Fix data, then retry	Duplicate primary key
Explicit user cancel	No	N/A - intentional	User clicked Cancel
System resource exhaustion	Yes	Wait, then retry	Out of memory
Connection lost	Yes	Reconnect and retry	Network hiccup

smart-retry-logic.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
import time
import random
from enum import Enum
from dataclasses import dataclass
from typing import Callable, TypeVar, Optional
 
class AbortReason(Enum):
    DEADLOCK = "deadlock"
    SERIALIZATION = "serialization_failure"
    LOCK_TIMEOUT = "lock_timeout"
    CONSTRAINT = "constraint_violation"
    USER_CANCEL = "user_cancel"
    RESOURCE = "resource_exhaustion"
    CONNECTION = "connection_lost"
    UNKNOWN = "unknown"
 
@dataclass
class RetryPolicy:
    max_attempts: int
    initial_delay: float  # seconds
    max_delay: float
    exponential_base: float = 2.0
    jitter: bool = True
 
# Default policies by abort reason
DEFAULT_POLICIES = {
    AbortReason.DEADLOCK: RetryPolicy(max_attempts=5, initial_delay=0.01, max_delay=1.0),
    AbortReason.SERIALIZATION: RetryPolicy(max_attempts=5, initial_delay=0.01, max_delay=1.0),
    AbortReason.LOCK_TIMEOUT: RetryPolicy(max_attempts=3, initial_delay=0.5, max_delay=10.0),
    AbortReason.RESOURCE: RetryPolicy(max_attempts=3, initial_delay=1.0, max_delay=30.0),
    AbortReason.CONNECTION: RetryPolicy(max_attempts=3, initial_delay=0.5, max_delay=5.0),
    # These should not be retried automatically
    AbortReason.CONSTRAINT: None,
    AbortReason.USER_CANCEL: None,
    AbortReason.UNKNOWN: None,
}
 
T = TypeVar('T')
 
def execute_with_smart_retry(
    operation: Callable[[], T],
    classify_error: Callable[[Exception], AbortReason],
    custom_policies: Optional[dict] = None
) -> T:
    """
    Execute an operation with intelligent retry logic based on abort reason.
    
    :param operation: Function to execute (should manage its own transaction)
    :param classify_error: Function to classify an exception into AbortReason
    :param custom_policies: Optional custom retry policies
    :return: Result of successful operation
    :raises: Last exception if all retries exhausted or non-retriable error
    """
    policies = {**DEFAULT_POLICIES, **(custom_policies or {})}
    
    attempt = 0
    last_error = None
    
    while True:
        try:
            return operation()
            
        except Exception as e:
            last_error = e
            reason = classify_error(e)
            policy = policies.get(reason)
            
            if policy is None:
                # Non-retriable error
                print(f"Non-retriable abort: {reason.value}")
                raise
            
            attempt += 1
            if attempt > policy.max_attempts:
                print(f"Max retries ({policy.max_attempts}) exceeded for {reason.value}")
                raise
            
            # Calculate delay with exponential backoff
            delay = min(
                policy.initial_delay * (policy.exponential_base ** (attempt - 1)),
                policy.max_delay
            )
            
            # Add jitter to prevent thundering herd
            if policy.jitter:
                delay = delay * (0.5 + random.random())
            
            print(f"Abort ({reason.value}), attempt {attempt}/{policy.max_attempts}, "
                  f"retrying in {delay:.2f}s")
            
            time.sleep(delay)
    
    return None  # Should never reach here
 
 
# Example usage
def transfer_funds(from_id: int, to_id: int, amount: float):
    """Business operation that may be retried."""
    with get_connection() as conn:
        with conn.cursor() as cur:
            cur.execute("BEGIN")
            # ... perform transfer ...
            conn.commit()
 
def classify_postgres_error(e: Exception) -> AbortReason:
    """Classify PostgreSQL errors into retry categories."""
    if hasattr(e, 'pgcode'):
        code = e.pgcode
        if code == '40001':
            return AbortReason.SERIALIZATION
        elif code == '40P01':
            return AbortReason.DEADLOCK
        elif code in ('23505', '23503', '23502', '23514'):
            return AbortReason.CONSTRAINT
        elif code.startswith('08'):
            return AbortReason.CONNECTION
    return AbortReason.UNKNOWN
 
# Execute with retry
result = execute_with_smart_retry(
    lambda: transfer_funds(1, 2, 100.00),
    classify_postgres_error
)

Idempotency is Essential for Retries

Monitoring Aborted Transactions

Monitoring abort patterns helps identify system problems and application issues. Here's how to track aborted transactions across different database systems.

Key Metrics to Monitor:

Critical Abort Metrics

•Abort rate — Percentage of transactions that abort vs commit
•Abort by cause — Breakdown of deadlocks, serialization failures, constraints, etc.
•Abort trends — Are abort rates increasing over time?
•Rollback duration — How long does rollback take for aborted transactions?
•Retry success rate — Of retried transactions, how many eventually succeed?
•Dead tuple accumulation — Rate of dead tuples from aborts (MVCC systems)

abort-monitoring.sql
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
-- Calculate abort/rollback rate by database
SELECT 
    datname,
    xact_commit AS commits,
    xact_rollback AS rollbacks,
    ROUND(100.0 * xact_rollback / NULLIF(xact_commit + xact_rollback, 0), 2) 
        AS abort_percentage,
    conflicts AS replication_conflicts
FROM pg_stat_database
WHERE datname NOT LIKE 'template%'
ORDER BY abort_percentage DESC;
 
-- Healthy systems typically see < 1% abort rate
-- > 5% abort rate suggests problems
 
-- Monitor conflict-specific aborts (replication)
SELECT 
    datname,
    confl_tablespace,
    confl_lock,
    confl_snapshot,
    confl_bufferpin,
    confl_deadlock,
    (confl_tablespace + confl_lock + confl_snapshot + 
     confl_bufferpin + confl_deadlock) AS total_conflicts
FROM pg_stat_database_conflicts;
 
-- Dead tuple accumulation (indicates abort/update activity)
SELECT 
    schemaname,
    relname,
    n_live_tup,
    n_dead_tup,
    ROUND(100.0 * n_dead_tup / NULLIF(n_live_tup + n_dead_tup, 0), 2) 
        AS dead_tuple_pct,
    last_vacuum,
    last_autovacuum
FROM pg_stat_user_tables
WHERE n_dead_tup > 1000
ORDER BY n_dead_tup DESC;

Alerting Thresholds:

Set up alerts based on your system's baseline. Example thresholds:

Suggested Alert Thresholds
Metric	Warning	Critical	Notes
Abort rate	2%	5%	Baseline depends on application
Deadlocks/hour	10	50	Should be near zero normally
Avg rollback time	100ms	1s	Long rollbacks block resources
Dead tuple ratio	10%	25%	May indicate vacuum issues
Lock wait time	1s avg	5s avg	Indicates contention

Trends Matter More Than Absolutes

Best Practices for Minimizing Aborts

Transaction Design Best Practices

•Keep transactions short — The longer a transaction runs, the more likely it encounters conflicts, timeouts, or resource issues. Minimize time in Active state.
•Validate data before transaction — Check business rules before beginning the transaction. Catch errors before they cause aborts.
•Use consistent access ordering — Access tables and rows in the same order across all transactions to prevent deadlocks.
•Choose appropriate isolation levels — Higher isolation levels (SERIALIZABLE) have higher abort rates. Use the minimum level that meets requirements.
•Handle expected errors in application — If duplicates are possible, use conflict-handling clauses (ON CONFLICT/INSERT OR IGNORE) instead of letting constraints abort.
•Use optimistic locking for low-conflict scenarios — Check-and-retry is often better than pessimistic locking when conflicts are rare.
•Batch operations appropriately — Very large batches risk timeouts; consider smaller batches with explicit commits.

preventing-aborts.sql
PostgreSQL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
-- Techniques to prevent unnecessary aborts
 
-- 1. Use ON CONFLICT instead of letting constraint abort
INSERT INTO users (email, name)
VALUES ('alice@example.com', 'Alice')
ON CONFLICT (email) DO UPDATE SET name = EXCLUDED.name;
-- Never aborts on duplicate email
 
-- 2. Use SELECT FOR UPDATE SKIP LOCKED for queue processing
-- Instead of waiting and risking deadlock/timeout:
BEGIN;
SELECT * FROM job_queue 
WHERE status = 'pending'
ORDER BY created_at
LIMIT 1
FOR UPDATE SKIP LOCKED;  -- Skip rows locked by other transactions
-- Process the job...
COMMIT;
 
-- 3. Use advisory locks for coordinating without row locks
SELECT pg_advisory_lock(12345);  -- Acquire advisory lock
-- Do work that would otherwise contend on rows
SELECT pg_advisory_unlock(12345);
 
-- 4. Check before attempting operations
DO $$
DECLARE
    current_balance DECIMAL;
BEGIN
    -- Check first (outside main logic)
    SELECT balance INTO current_balance
    FROM accounts WHERE id = 100;
    
    IF current_balance < 500 THEN
        RAISE EXCEPTION 'Insufficient balance';
    END IF;
    
    -- Now do the actual update
    UPDATE accounts SET balance = balance - 500 WHERE id = 100;
END $$;

Aborts Are Not Always Bad

Summary: The Aborted State

We've completed our exploration of the transaction state machine with the Aborted state—the terminal state for unsuccessful transactions. Let's consolidate our understanding:

Key Takeaways

•Terminal failure state — Aborted is the final state for unsuccessful transactions; the transaction's lifecycle is complete and cannot be restarted.
•Complete rollback — All data modifications by the transaction are reversed; the database state is as if the transaction never happened.
•Atomicity embodied — Aborted state, paired with Committed, implements atomicity: all changes apply (Committed) or none do (Aborted).
•Log records persist — While data changes are reversed, log records (including ABORT) remain for recovery purposes.
•Some side effects remain — Sequence values, external API calls, and other non-transactional effects are not rolled back.
•Lock release unblocks waiters — Other transactions waiting for this transaction's locks can proceed once Aborted state is reached.
•Recovery handles incomplete aborts — If crash occurs during rollback, recovery completes the abort process.
•Retry strategies are essential — Applications should intelligently retry retriable aborts while escalating non-retriable ones.

Module Complete: Transaction States

You've now mastered the complete transaction state model:

Active — Transaction executing, performing operations
Partially Committed — Operations complete, awaiting durability confirmation
Committed — Success; changes permanent
Failed — Error detected, awaiting rollback
Aborted — Failure; all changes reversed

Module Complete

5 / 5