Database Management SystemsTimestamp Ordering

Timestamp Ordering Protocol

LevelIntermediate

Duration60 mins

TopicTimestamp Ordering

2 / 5

System Clock Timestamps

The Allure of Real Time

Every computer has a clock. It ticks away constantly, tracking the passage of real-world seconds, milliseconds, and nanoseconds. When a database needs unique, monotonically increasing timestamps, the system clock presents an obvious solution: just ask the operating system what time it is.

This approach is intuitive—the clock is already there, already incrementing, already providing a universally understood ordering. When transaction T₁ starts at 10:00:00.000 and T₂ starts at 10:00:00.001, the timestamps directly reflect real-world causality.

But as we'll discover, using system clocks for database timestamps introduces subtle challenges that can undermine correctness. Understanding these challenges—and how to address them—is essential for building reliable timestamp-based systems.

What You Will Learn

By the end of this page, you will understand how system clock timestamps work, the precision and resolution requirements for database use, the challenges of clock skew and synchronization, clock adjustment problems (NTP leaps), and when system clocks are appropriate versus when alternatives are needed.

How System Clocks Work

Before using system clocks for timestamps, we need to understand how computers track time. This involves multiple layers of hardware and software working together.

Hardware Timekeeping:

At the lowest level, computers use hardware oscillators to generate periodic signals:

Real-Time Clock (RTC): A battery-backed chip that maintains time even when powered off. Typically uses a 32.768 kHz crystal oscillator. Drifts several seconds per day.
High-Precision Event Timer (HPET): Modern motherboard timer providing nanosecond-resolution timing for the operating system.
Time Stamp Counter (TSC): A register in modern CPUs that increments with each clock cycle. Provides the finest granularity (sub-nanosecond) but requires calibration.

Software Time Management:

The operating system uses hardware timers to maintain:

System Time: Wall-clock time since a reference epoch (e.g., January 1, 1970 for Unix)
Monotonic Time: A counter that never goes backward, regardless of clock adjustments
High-Resolution Timers: APIs for nanosecond-precision time queries

Common Time APIs Across Operating Systems
OS/Language	Wall Clock API	Monotonic API	Typical Resolution
Linux C	gettimeofday(), clock_gettime(CLOCK_REALTIME)	clock_gettime(CLOCK_MONOTONIC)	Nanoseconds
Windows C++	GetSystemTimeAsFileTime()	QueryPerformanceCounter()	100 nanoseconds / varies
Java	System.currentTimeMillis()	System.nanoTime()	Milliseconds / Nanoseconds
Python	time.time()	time.monotonic()	Microseconds / Nanoseconds
PostgreSQL	now(), statement_timestamp()	pg_catalog.timeofday()	Microseconds

Wall Clock vs Monotonic Clock

For timestamp ordering, you might think the wall clock is ideal because it reflects 'real' time. But the wall clock can jump backward during NTP synchronization! Monotonic clocks never go backward, making them safer for ordering—though they don't correspond to human-readable times. Many databases use a combination: wall clock for human-visible timestamps, monotonic components for ordering guarantees.

Precision and Resolution Requirements

For timestamps to provide unique, ordered identifiers, the clock's resolution must be fine enough that no two transactions receive the same timestamp. This imposes strict requirements on clock precision.

Key Terminology:

Resolution: The smallest time unit the clock can distinguish (e.g., 1 microsecond)
Precision: The consistency of measurements—how much variation occurs when measuring the same interval
Accuracy: How close the clock is to true physical time (matters for wall clocks)

The Uniqueness Challenge:

Consider a database handling 100,000 transactions per second. With millisecond-resolution timestamps:

Each millisecond: 100 transactions on average
Problem: Multiple transactions get the same timestamp

With microsecond resolution:

Each microsecond: 0.1 transactions on average
Most microseconds have 0 or 1 transaction—much better!

Throughput vs Resolution Requirements:

Let's calculate the minimum required resolution for given transaction rates:

Transactions/Second	Minimum Resolution for Uniqueness	Notes
1,000	1 millisecond	Very low volume, any modern clock works
10,000	100 microseconds	Standard OLTP workloads
100,000	10 microseconds	High-performance systems
1,000,000	1 microsecond	Extreme throughput
10,000,000	100 nanoseconds	Requires specialized approaches

Modern hardware easily provides microsecond resolution. Nanosecond resolution is available but less consistent across platforms. For transactions exceeding hardware clock resolution, we need tie-breaking mechanisms.

Resolution Is Not Enough

Even with nanosecond resolution, concurrent calls to the clock API from multiple CPU cores can return the same value. The hardware might increment between reads, but there's no guarantee. Database systems must handle ties explicitly—typically with a secondary counter or by rejecting concurrent timestamp requests until the clock advances.

Handling Timestamp Collisions

When two transactions request timestamps within the same clock tick—or when clock resolution is insufficient—we face timestamp collisions. Since timestamps must be unique by definition, we need systematic strategies to resolve these conflicts.

Strategy 1: Wait for Clock Advance

The simplest approach: if the current clock value equals the last assigned timestamp, wait until it changes.

last_timestamp = 0

function get_timestamp():
    current = system_clock()
    while current <= last_timestamp:
        current = system_clock()  // busy-wait or sleep
    last_timestamp = current
    return current

Pros: Guarantees uniqueness with pure clock values Cons: Limits throughput to clock resolution; can cause contention

Strategy 2: Sub-Clock Counter Extension

Append a secondary counter that increments within each clock tick:

last_clock = 0
sub_counter = 0

function get_timestamp():
    current_clock = system_clock()
    if current_clock == last_clock:
        sub_counter = sub_counter + 1
    else:
        last_clock = current_clock
        sub_counter = 0
    return (current_clock, sub_counter)  // composite timestamp

Pros: Higher throughput; no waiting Cons: Timestamps become composite values; need handling if counter overflows

Strategy 3: Hybrid Logical Clocks (HLC)

A sophisticated approach combining physical time with logical components:

Physical Component (pt): Maximum of local clock and highest pt seen from other nodes
Logical Component (l): Counter reset when pt advances; incremented within same pt

HLC timestamps look like (physical_time, logical_counter) and provide:

Comparable ordering with real time (approximately)
Guaranteed uniqueness
Causality tracking: if A happens-before B, HLC(A) < HLC(B)

This approach is used by CockroachDB, MongoDB, and other distributed databases.

Production Systems Use Composites

Pure system-clock timestamps are rare in high-performance databases. Most production systems use composite approaches: the clock provides the major component for real-time correlation, while counters or node IDs ensure uniqueness. The 'timestamp' becomes a structured value rather than a simple integer, though it still provides total ordering.

Clock Drift and Synchronization

Computer clocks are imperfect—they drift relative to true physical time. A clock that runs 1 part per million (ppm) fast will gain about 86 milliseconds per day. This drift creates challenges for timestamp ordering.

Sources of Clock Drift:

Crystal Oscillator Variation: Temperature changes, aging, manufacturing tolerances cause frequency shifts
Frequency Scaling: Modern CPUs adjust clock speeds for power management, affecting timing
Virtualization: Virtual machines may have less accurate timekeeping than bare metal
Initial Synchronization Delay: Clocks may be significantly off until NTP corrects them after boot

Typical Clock Drift Rates
Hardware/Environment	Typical Drift	Error Per Day	Notes
PC RTC (cheap crystal)	20-100 ppm	1.7-8.6 seconds	Without NTP correction
Server-grade hardware	50-100 ppm	4.3-8.6 seconds	Better crystals, still drifts
GPS-disciplined clock	< 0.001 ppm	< 86 microseconds	Expensive, high-accuracy
Atomic clock (Cesium)	~10⁻¹² ppm	< 1 nanosecond	Laboratory/infrastructure grade
VM guest clock	100-1000 ppm	8.6-86 seconds	Virtualization overhead

Network Time Protocol (NTP):

NTP corrects clock drift by synchronizing with reference time servers. Key characteristics:

Typical accuracy: 1-50 milliseconds over the internet
LAN accuracy: sub-millisecond possible with local NTP servers
Correction methods: gradual frequency adjustment (slewing) or instant jump (stepping)

The Stepping Problem:

When NTP determines the clock is significantly off (typically > 128ms), it may step the clock—instantly changing the time. If the clock moves backward:

Timestamps assigned before the step may be larger than timestamps after
Violates monotonicity—a fundamental timestamp property
Can cause serious consistency issues in timestamp ordering

Most production systems configure NTP to only slew (never step) after initial synchronization, accepting temporary drift in exchange for monotonicity.

Clock Jumps Can Cause Data Corruption

If a database relies on wall-clock timestamps and the clock jumps backward 10 minutes, new transactions get timestamps from 10 minutes ago. They appear 'older' than transactions that already committed, potentially leading to lost updates or phantom reads. This is why monotonic clocks or logical timestamps are often preferred for correctness-critical ordering.

Distributed System Challenges

When multiple database nodes each assign timestamps using their local clocks, the challenges multiply. Different nodes have different clocks, and keeping them perfectly synchronized is physically impossible.

The Fundamental Problem:

Imagine two database nodes, A and B:

Node A's clock runs 5ms fast relative to true time
Node B's clock runs 3ms slow relative to true time
Total skew: 8ms between A and B

User 1 sends transaction T₁ to Node A at true time t=100ms:

Node A sees clock time 105ms, assigns TS(T₁) = 105

User 2 sends transaction T₂ to Node B at true time t=101ms (1ms later):

Node B sees clock time 98ms, assigns TS(T₂) = 98

Result: T₂ (actually later) has a smaller timestamp than T₁.

If T₁ and T₂ both access the same data, the timestamp ordering is inverted from real causality.

Why This Matters:

Consider a scenario where:

User reads their bank balance ($1000) from Node A at TS=105
User initiates withdrawal ($500) on Node B at TS=98

The read appears to have happened "after" the withdrawal in timestamp order, even though it happened before. If the system uses timestamps strictly, the withdrawal might see stale data or the read might miss the withdrawal—both are incorrect.

Approaches to Distributed Timestamps:

Centralized Timestamp Server: A single node assigns all timestamps
- Simple but creates bottleneck and single point of failure
Clock Synchronization Bounds: Characterize maximum clock skew and build protocols around it
- Google Spanner's TrueTime: knows uncertainty bounds, waits them out
Logical Timestamps: Use vector clocks or Lamport clocks instead of physical time
- Guaranteed correct ordering based on causality, not time
Hybrid Logical Clocks: Combine physical and logical components
- Best of both worlds: approximate real-time with correctness guarantees

Google Spanner's TrueTime

Spanner uses GPS receivers and atomic clocks at each datacenter to bound clock uncertainty to a few milliseconds. The TrueTime API returns an interval [earliest, latest] rather than a point. Transactions wait for the uncertainty interval to pass before committing, ensuring that if TS(T₁) < TS(T₂), then T₁ actually committed before T₂ started. This provides external consistency—stronger than serializability—at the cost of commit latency.

Implementation Patterns for System Clock Timestamps

Let's examine concrete implementation patterns used by real database systems to leverage system clocks while mitigating their limitations.

timestamp_generator.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
import time
import threading
 
class MonotonicTimestampGenerator:
    """
    Generates monotonically increasing timestamps using
    system clock with tie-breaking.
    """
    def __init__(self):
        self._last_timestamp = 0
        self._lock = threading.Lock()
    
    def get_timestamp(self) -> int:
        """
        Returns a unique, monotonically increasing timestamp.
        Uses microseconds with logical counter for sub-microsecond ordering.
        
        Returns:
            64-bit integer: high 48 bits = microseconds, low 16 bits = counter
        """
        with self._lock:
            # Get current time in microseconds
            current_us = int(time.time() * 1_000_000)
            
            # Extract microseconds and counter from last timestamp
            last_us = self._last_timestamp >> 16
            last_counter = self._last_timestamp & 0xFFFF
            
            if current_us > last_us:
                # Clock advanced - use new time, reset counter
                new_timestamp = (current_us << 16) | 0
            elif current_us == last_us:
                # Same microsecond - increment counter
                if last_counter >= 0xFFFF:
                    # Counter overflow - wait for clock to advance
                    while int(time.time() * 1_000_000) <= current_us:
                        time.sleep(0.000001)  # 1 microsecond
                    current_us = int(time.time() * 1_000_000)
                    new_timestamp = (current_us << 16) | 0
                else:
                    new_timestamp = (current_us << 16) | (last_counter + 1)
            else:
                # Clock went backward! Use last time + 1 counter
                # This handles NTP adjustments gracefully
                new_timestamp = self._last_timestamp + 1
            
            self._last_timestamp = new_timestamp
            return new_timestamp
 
# Usage example
generator = MonotonicTimestampGenerator()
 
# Sequential timestamps are guaranteed unique and increasing
ts1 = generator.get_timestamp()  # e.g., 1704067200000000 << 16 | 0
ts2 = generator.get_timestamp()  # e.g., 1704067200000000 << 16 | 1
ts3 = generator.get_timestamp()  # e.g., 1704067200000001 << 16 | 0
 
print(f"ts1: {ts1}, ts2: {ts2}, ts3: {ts3}")
print(f"Ordering holds: {ts1 < ts2 < ts3}")  # True

Key Implementation Details:

Composite Timestamp Structure: High bits for clock time, low bits for counter
Lock Protection: Ensures atomicity across concurrent threads
Counter Overflow Handling: Waits for clock advance rather than wrapping
Backward Clock Handling: Increments from last timestamp instead of using "past" value
Thread Safety: Critical for multi-threaded database engines

This pattern provides:

Uniqueness: Counter ensures no collisions within same microsecond
Monotonicity: Never returns a smaller value than previously
Approximate real-time: High bits correlate with wall clock
Bounded waiting: Only waits when counter overflows (rare)

When to Use System Clock Timestamps

System clock timestamps are appropriate in specific scenarios. Understanding these helps make informed architectural decisions.

Good Fit for System Clocks

•Single-node databases: No cross-node synchronization needed
•Audit/logging timestamps: Correlation with real time is valuable
•Low-contention workloads: Rare timestamp collisions
•Human-readable requirements: Users need to see real times
•Time-based partitioning: Data organized by calendar time
•Approximate ordering acceptable: Some reordering tolerable

Poor Fit for System Clocks

•Distributed databases: Clock skew causes ordering issues
•High-throughput systems: Collisions become frequent
•Strict serializability required: Cannot tolerate any inversions
•Unstable clock environments: VMs, containers with poor sync
•Cross-datacenter replication: Wide clock skew possible
•Causality-critical applications: Must track happens-before

Real-World Hybrid Approaches

Most production databases don't use pure system clock timestamps. PostgreSQL's transaction IDs are sequential counters. MySQL's InnoDB uses transaction IDs internally, wall clocks for visibility. CockroachDB and Spanner use hybrid logical clocks. The 'timestamp' concept is adapted to each system's needs, often combining clock components with counters, node IDs, or other ordering guarantees.

Summary: System Clock Timestamps

We've thoroughly examined system clock-based timestamp generation. Let's consolidate the essential insights:

Key Takeaways

•System clocks provide intuitive timestamps — They correlate with real-world time and are immediately available on all systems.
•Resolution limits throughput — Microsecond resolution supports ~1M transactions/second before collisions require handling.
•Collision handling is mandatory — Sub-clock counters, busy-waiting, or composite schemes ensure uniqueness.
•Clock drift and jumps threaten monotonicity — NTP adjustments can move clocks backward; systems must handle this.
•Distributed systems face clock skew — Different nodes have different clocks, potentially inverting transaction order.
•Production systems use hybrid approaches — Pure wall-clock timestamps are rare; counters, HLC, or bounded uncertainty augment clocks.

What's Next:

System clocks are one approach to timestamp generation. An alternative avoids clock complexity entirely: logical counters provide guaranteed uniqueness and monotonicity without any dependency on physical time. We'll explore this elegant alternative next.

Page Complete

You now understand the mechanics, challenges, and trade-offs of system clock-based timestamps. From hardware oscillators through NTP synchronization to distributed skew, you can analyze whether clock-based timestamps suit a given application. Next, we'll examine the simpler, more reliable alternative: logical counters.

2 / 5

Loading learning content...

Database Management SystemsTimestamp Ordering

Timestamp Ordering Protocol

LevelIntermediate

Duration60 mins

TopicTimestamp Ordering

2 / 5

System Clock Timestamps

The Allure of Real Time

What You Will Learn

How System Clocks Work

Before using system clocks for timestamps, we need to understand how computers track time. This involves multiple layers of hardware and software working together.

Hardware Timekeeping:

At the lowest level, computers use hardware oscillators to generate periodic signals:

Real-Time Clock (RTC): A battery-backed chip that maintains time even when powered off. Typically uses a 32.768 kHz crystal oscillator. Drifts several seconds per day.
High-Precision Event Timer (HPET): Modern motherboard timer providing nanosecond-resolution timing for the operating system.
Time Stamp Counter (TSC): A register in modern CPUs that increments with each clock cycle. Provides the finest granularity (sub-nanosecond) but requires calibration.

Software Time Management:

The operating system uses hardware timers to maintain:

System Time: Wall-clock time since a reference epoch (e.g., January 1, 1970 for Unix)
Monotonic Time: A counter that never goes backward, regardless of clock adjustments
High-Resolution Timers: APIs for nanosecond-precision time queries

Common Time APIs Across Operating Systems
OS/Language	Wall Clock API	Monotonic API	Typical Resolution
Linux C	gettimeofday(), clock_gettime(CLOCK_REALTIME)	clock_gettime(CLOCK_MONOTONIC)	Nanoseconds
Windows C++	GetSystemTimeAsFileTime()	QueryPerformanceCounter()	100 nanoseconds / varies
Java	System.currentTimeMillis()	System.nanoTime()	Milliseconds / Nanoseconds
Python	time.time()	time.monotonic()	Microseconds / Nanoseconds
PostgreSQL	now(), statement_timestamp()	pg_catalog.timeofday()	Microseconds

Wall Clock vs Monotonic Clock

Precision and Resolution Requirements

Key Terminology:

Resolution: The smallest time unit the clock can distinguish (e.g., 1 microsecond)
Precision: The consistency of measurements—how much variation occurs when measuring the same interval
Accuracy: How close the clock is to true physical time (matters for wall clocks)

The Uniqueness Challenge:

Consider a database handling 100,000 transactions per second. With millisecond-resolution timestamps:

Each millisecond: 100 transactions on average
Problem: Multiple transactions get the same timestamp

With microsecond resolution:

Each microsecond: 0.1 transactions on average
Most microseconds have 0 or 1 transaction—much better!

Throughput vs Resolution Requirements:

Let's calculate the minimum required resolution for given transaction rates:

Transactions/Second	Minimum Resolution for Uniqueness	Notes
1,000	1 millisecond	Very low volume, any modern clock works
10,000	100 microseconds	Standard OLTP workloads
100,000	10 microseconds	High-performance systems
1,000,000	1 microsecond	Extreme throughput
10,000,000	100 nanoseconds	Requires specialized approaches

Resolution Is Not Enough

Handling Timestamp Collisions

Strategy 1: Wait for Clock Advance

The simplest approach: if the current clock value equals the last assigned timestamp, wait until it changes.

last_timestamp = 0

function get_timestamp():
    current = system_clock()
    while current <= last_timestamp:
        current = system_clock()  // busy-wait or sleep
    last_timestamp = current
    return current

Pros: Guarantees uniqueness with pure clock values Cons: Limits throughput to clock resolution; can cause contention

Strategy 2: Sub-Clock Counter Extension

Append a secondary counter that increments within each clock tick:

last_clock = 0
sub_counter = 0

function get_timestamp():
    current_clock = system_clock()
    if current_clock == last_clock:
        sub_counter = sub_counter + 1
    else:
        last_clock = current_clock
        sub_counter = 0
    return (current_clock, sub_counter)  // composite timestamp

Pros: Higher throughput; no waiting Cons: Timestamps become composite values; need handling if counter overflows

Strategy 3: Hybrid Logical Clocks (HLC)

A sophisticated approach combining physical time with logical components:

Physical Component (pt): Maximum of local clock and highest pt seen from other nodes
Logical Component (l): Counter reset when pt advances; incremented within same pt

HLC timestamps look like (physical_time, logical_counter) and provide:

Comparable ordering with real time (approximately)
Guaranteed uniqueness
Causality tracking: if A happens-before B, HLC(A) < HLC(B)

This approach is used by CockroachDB, MongoDB, and other distributed databases.

Production Systems Use Composites

Clock Drift and Synchronization

Sources of Clock Drift:

Crystal Oscillator Variation: Temperature changes, aging, manufacturing tolerances cause frequency shifts
Frequency Scaling: Modern CPUs adjust clock speeds for power management, affecting timing
Virtualization: Virtual machines may have less accurate timekeeping than bare metal
Initial Synchronization Delay: Clocks may be significantly off until NTP corrects them after boot

Typical Clock Drift Rates
Hardware/Environment	Typical Drift	Error Per Day	Notes
PC RTC (cheap crystal)	20-100 ppm	1.7-8.6 seconds	Without NTP correction
Server-grade hardware	50-100 ppm	4.3-8.6 seconds	Better crystals, still drifts
GPS-disciplined clock	< 0.001 ppm	< 86 microseconds	Expensive, high-accuracy
Atomic clock (Cesium)	~10⁻¹² ppm	< 1 nanosecond	Laboratory/infrastructure grade
VM guest clock	100-1000 ppm	8.6-86 seconds	Virtualization overhead

Network Time Protocol (NTP):

NTP corrects clock drift by synchronizing with reference time servers. Key characteristics:

Typical accuracy: 1-50 milliseconds over the internet
LAN accuracy: sub-millisecond possible with local NTP servers
Correction methods: gradual frequency adjustment (slewing) or instant jump (stepping)

The Stepping Problem:

When NTP determines the clock is significantly off (typically > 128ms), it may step the clock—instantly changing the time. If the clock moves backward:

Timestamps assigned before the step may be larger than timestamps after
Violates monotonicity—a fundamental timestamp property
Can cause serious consistency issues in timestamp ordering

Most production systems configure NTP to only slew (never step) after initial synchronization, accepting temporary drift in exchange for monotonicity.

Clock Jumps Can Cause Data Corruption

Distributed System Challenges

The Fundamental Problem:

Imagine two database nodes, A and B:

Node A's clock runs 5ms fast relative to true time
Node B's clock runs 3ms slow relative to true time
Total skew: 8ms between A and B

User 1 sends transaction T₁ to Node A at true time t=100ms:

Node A sees clock time 105ms, assigns TS(T₁) = 105

User 2 sends transaction T₂ to Node B at true time t=101ms (1ms later):

Node B sees clock time 98ms, assigns TS(T₂) = 98

Result: T₂ (actually later) has a smaller timestamp than T₁.

If T₁ and T₂ both access the same data, the timestamp ordering is inverted from real causality.

Why This Matters:

Consider a scenario where:

User reads their bank balance ($1000) from Node A at TS=105
User initiates withdrawal ($500) on Node B at TS=98

Approaches to Distributed Timestamps:

Centralized Timestamp Server: A single node assigns all timestamps
- Simple but creates bottleneck and single point of failure
Clock Synchronization Bounds: Characterize maximum clock skew and build protocols around it
- Google Spanner's TrueTime: knows uncertainty bounds, waits them out
Logical Timestamps: Use vector clocks or Lamport clocks instead of physical time
- Guaranteed correct ordering based on causality, not time
Hybrid Logical Clocks: Combine physical and logical components
- Best of both worlds: approximate real-time with correctness guarantees

Google Spanner's TrueTime

Implementation Patterns for System Clock Timestamps

Let's examine concrete implementation patterns used by real database systems to leverage system clocks while mitigating their limitations.

timestamp_generator.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
import time
import threading
 
class MonotonicTimestampGenerator:
    """
    Generates monotonically increasing timestamps using
    system clock with tie-breaking.
    """
    def __init__(self):
        self._last_timestamp = 0
        self._lock = threading.Lock()
    
    def get_timestamp(self) -> int:
        """
        Returns a unique, monotonically increasing timestamp.
        Uses microseconds with logical counter for sub-microsecond ordering.
        
        Returns:
            64-bit integer: high 48 bits = microseconds, low 16 bits = counter
        """
        with self._lock:
            # Get current time in microseconds
            current_us = int(time.time() * 1_000_000)
            
            # Extract microseconds and counter from last timestamp
            last_us = self._last_timestamp >> 16
            last_counter = self._last_timestamp & 0xFFFF
            
            if current_us > last_us:
                # Clock advanced - use new time, reset counter
                new_timestamp = (current_us << 16) | 0
            elif current_us == last_us:
                # Same microsecond - increment counter
                if last_counter >= 0xFFFF:
                    # Counter overflow - wait for clock to advance
                    while int(time.time() * 1_000_000) <= current_us:
                        time.sleep(0.000001)  # 1 microsecond
                    current_us = int(time.time() * 1_000_000)
                    new_timestamp = (current_us << 16) | 0
                else:
                    new_timestamp = (current_us << 16) | (last_counter + 1)
            else:
                # Clock went backward! Use last time + 1 counter
                # This handles NTP adjustments gracefully
                new_timestamp = self._last_timestamp + 1
            
            self._last_timestamp = new_timestamp
            return new_timestamp
 
# Usage example
generator = MonotonicTimestampGenerator()
 
# Sequential timestamps are guaranteed unique and increasing
ts1 = generator.get_timestamp()  # e.g., 1704067200000000 << 16 | 0
ts2 = generator.get_timestamp()  # e.g., 1704067200000000 << 16 | 1
ts3 = generator.get_timestamp()  # e.g., 1704067200000001 << 16 | 0
 
print(f"ts1: {ts1}, ts2: {ts2}, ts3: {ts3}")
print(f"Ordering holds: {ts1 < ts2 < ts3}")  # True

Key Implementation Details:

Composite Timestamp Structure: High bits for clock time, low bits for counter
Lock Protection: Ensures atomicity across concurrent threads
Counter Overflow Handling: Waits for clock advance rather than wrapping
Backward Clock Handling: Increments from last timestamp instead of using "past" value
Thread Safety: Critical for multi-threaded database engines

This pattern provides:

Uniqueness: Counter ensures no collisions within same microsecond
Monotonicity: Never returns a smaller value than previously
Approximate real-time: High bits correlate with wall clock
Bounded waiting: Only waits when counter overflows (rare)

When to Use System Clock Timestamps

System clock timestamps are appropriate in specific scenarios. Understanding these helps make informed architectural decisions.

Good Fit for System Clocks

•Single-node databases: No cross-node synchronization needed
•Audit/logging timestamps: Correlation with real time is valuable
•Low-contention workloads: Rare timestamp collisions
•Human-readable requirements: Users need to see real times
•Time-based partitioning: Data organized by calendar time
•Approximate ordering acceptable: Some reordering tolerable

Poor Fit for System Clocks

•Distributed databases: Clock skew causes ordering issues
•High-throughput systems: Collisions become frequent
•Strict serializability required: Cannot tolerate any inversions
•Unstable clock environments: VMs, containers with poor sync
•Cross-datacenter replication: Wide clock skew possible
•Causality-critical applications: Must track happens-before

Real-World Hybrid Approaches

Summary: System Clock Timestamps

We've thoroughly examined system clock-based timestamp generation. Let's consolidate the essential insights:

Key Takeaways

•System clocks provide intuitive timestamps — They correlate with real-world time and are immediately available on all systems.
•Resolution limits throughput — Microsecond resolution supports ~1M transactions/second before collisions require handling.
•Collision handling is mandatory — Sub-clock counters, busy-waiting, or composite schemes ensure uniqueness.
•Clock drift and jumps threaten monotonicity — NTP adjustments can move clocks backward; systems must handle this.
•Distributed systems face clock skew — Different nodes have different clocks, potentially inverting transaction order.
•Production systems use hybrid approaches — Pure wall-clock timestamps are rare; counters, HLC, or bounded uncertainty augment clocks.

What's Next:

Page Complete

2 / 5