System Design (HLD)WhatsApp Messaging

WhatsApp Messaging System Design

LevelAdvanced

Duration120 mins

TopicWhatsApp Messaging

2 / 6

Message Delivery Guarantees

The Art and Science of Reliable Delivery

When you send a message and see that single checkmark appear, what exactly does it mean? Behind that simple UI indicator lies one of the most challenging problems in distributed systems: guaranteeing that your message will reach its destination, exactly once, in the correct order, regardless of network failures, server crashes, or device disconnections.

The fundamental challenge is that we're building reliable communication on inherently unreliable foundations. Networks drop packets. Servers crash mid-operation. Mobile devices lose connectivity in tunnels. And yet, users expect their messages to 'just work' with 100% reliability.

This page explores the intricate machinery that makes reliable messaging possible, examining the theoretical foundations, the practical engineering patterns, and the subtle trade-offs that distinguish amateur implementations from production-grade systems processing 100 billion messages daily.

What You Will Master

By the end of this page, you will understand the different delivery semantics (at-most-once, at-least-once, exactly-once), how acknowledgment flows work in practice, strategies for message ordering across distributed systems, and the storage patterns that enable durability guarantees. You'll gain the vocabulary and mental models to discuss these concepts confidently in any system design context.

Delivery Semantics: The Theoretical Foundation

Before diving into implementation, we must understand the theoretical framework that governs message delivery. Distributed systems theory defines three fundamental delivery semantics, each with distinct guarantees and trade-offs:

The Three Delivery Semantics
Semantic	Guarantee	Risk	Complexity
At-Most-Once	Message delivered 0 or 1 times	Message may be lost	Lowest
At-Least-Once	Message delivered 1 or more times	Message may be duplicated	Medium
Exactly-Once	Message delivered exactly 1 time	None (ideal)	Highest

1.1 At-Most-Once Delivery

The simplest approach: send the message and don't worry about confirmation. If it arrives, great. If not, we don't retry.

Implementation: Fire-and-forget. Send one UDP packet or TCP request with no acknowledgment handling.

Example use case: Telemetry data, analytics events where occasional loss is acceptable.

Why it fails for messaging: Users absolutely cannot accept lost messages. A message saying 'I'll pick you up from the airport' that never arrives has real-world consequences.

1.2 At-Least-Once Delivery

We keep retrying until we get confirmation. This guarantees delivery but may result in duplicates if acknowledgments are lost.

Implementation:

1. Send message
2. Wait for acknowledgment
3. If no acknowledgment within timeout, retry from step 1
4. Stop after receiving acknowledgment

The duplicate problem: Suppose the server receives the message, stores it, but the acknowledgment back to the client is lost. The client retries, the server receives it again, and now we have a duplicate.

Why it's problematic for messaging: Seeing the same message twice is confusing and annoying. In group chats, it could mean 999 duplicate notifications.

1.3 Exactly-Once Delivery

The holy grail: every message arrives exactly once, never lost, never duplicated.

The hard truth: True exactly-once delivery is theoretically impossible in asynchronous distributed systems with arbitrary failures. The Two Generals' Problem proves this mathematically.

The practical solution: We implement 'effectively exactly-once' through a combination of:

At-least-once delivery (retry until acknowledged)
Idempotent processing (server detects and ignores duplicates)

For users, the experience is exactly-once. Internally, we may process the same message multiple times but only apply it once.

Interview Gold

When asked about exactly-once delivery in interviews, demonstrate sophistication by saying: 'True exactly-once is impossible in distributed systems. We achieve effectively-exactly-once through at-least-once delivery combined with idempotent processing using unique message IDs.' This shows you understand both the theory and practical engineering.

The Message Lifecycle: From Send to Delivery

Understanding the complete lifecycle of a message is essential for designing reliable delivery. Let's trace the journey of a single message from sender to recipient:

Message Lifecycle Flow
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
SENDER'S DEVICE                    SERVER                    RECIPIENT'S DEVICE
━━━━━━━━━━━━━━━━━                    ━━━━━━                    ━━━━━━━━━━━━━━━━━━
 
1. User types message
   └── [Draft stored locally]
 
2. User presses send
   └── [Clock icon: Pending]
       │
       ├──── CREATE(msg_id, content, timestamp) ────────►
       │                                                  │
       │                                     3. Validate & persist
       │                                        └── [Write to queue]
       │                                        └── [Write to DB]
       │                                                  │
       │     ◄──────── ACK(msg_id, "ACCEPTED") ──────────┤
       │                                                  │
   └── [Single ✓: Sent]                                   │
                                                          │
                                          4. Route to recipient
                                             └── [Find recipient connection]
                                             └── [Push if online]
                                                          │
                                                          ├────── PUSH(msg_id, content) ────────►
                                                          │                                       │
                                                          │              5. Recipient receives
                                                          │                 └── [Store locally]
                                                          │                 └── [Display in UI]
                                                          │                 └── [Notify user]
                                                          │                                       │
                                                          │     ◄─── ACK(msg_id, "DELIVERED") ───┤
                                                          │
                                          6. Record delivery
                                             └── [Update message status]
                                                          │
       │     ◄────── NOTIFY(msg_id, "DELIVERED") ────────┤
       │
   └── [Double ✓✓: Delivered]
 
                                      [... User opens chat ...]
 
                                                          ├──── SEEN(msg_id, timestamp) ────────►
                                                          │                                       │
       │     ◄────── NOTIFY(msg_id, "READ") ─────────────┤
       │
   └── [Blue ✓✓: Read]

2.1 Key Observations from the Lifecycle

Five distinct states: A message transitions through multiple states, each representing a durability guarantee:

State	Indicator	Guarantee
Pending	⏱️ Clock	Message constructed, not yet sent to server
Sent	✓ Single check	Server has accepted and persisted the message
Delivered	✓✓ Double check	Recipient's device has received and acknowledged
Read	✓✓ Blue checks	Recipient has viewed the message in the chat
Failed	❌ Error	Sending failed; retry required

Each transition requires network round-trip: This means each state change is vulnerable to network failures, requiring careful timeout and retry logic.

The 'Sent' checkpoint is critical: Once the server acknowledges receipt (single checkmark), the message is durably stored. From this point, delivery to the recipient is the server's responsibility, not the sender's device. The sender can go offline, and the message will still eventually reach the recipient.

Optimistic UI Updates

WhatsApp and modern messengers use 'optimistic UI'—the message appears in chat immediately upon hitting send, even before server acknowledgment. This makes the app feel instant. If sending fails, the message is marked with an error indicator and retry option. This UX pattern requires tracking local message state carefully.

Unique Message Identification

The foundation of idempotent delivery is the ability to uniquely identify every message. If we can't distinguish between 'same message retried' and 'different message with same content,' we cannot implement exactly-once semantics.

3.1 Requirements for Message IDs

Globally unique: No two messages anywhere in the system share an ID
Locally generatable: Client must assign ID before sending (can't wait for server)
Non-colliding under high concurrency: Billions of messages per day from millions of devices
Reasonable size: Must be efficient to store, index, and transmit
Ordered (preferably): IDs that encode creation time aid debugging and partitioning

Message ID Generation Strategies
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
// Strategy 1: UUID v4 (Random)
// ─────────────────────────────
// Pros: Truly random, extremely low collision probability
// Cons: No ordering, 128 bits (16 bytes), not human-readable
import { v4 as uuidv4 } from 'uuid';
const messageId = uuidv4();  // "550e8400-e29b-41d4-a716-446655440000"
 
// Strategy 2: Snowflake ID (Twitter's approach)
// ─────────────────────────────────────────────
// 64-bit ID: timestamp (41b) + datacenter (5b) + worker (5b) + sequence (12b)
// Pros: Time-ordered, compact, high throughput (4096 IDs/ms per worker)
// Cons: Requires worker ID coordination
function generateSnowflakeId(datacenterId: number, workerId: number): bigint {
    const timestamp = BigInt(Date.now() - EPOCH);  // Custom epoch
    const sequence = getNextSequence();  // Increment per millisecond
    return (timestamp << 22n) | (BigInt(datacenterId) << 17n) | 
           (BigInt(workerId) << 12n) | BigInt(sequence);
}
 
// Strategy 3: ULID (Universally Unique Lexicographically Sortable ID)
// ──────────────────────────────────────────────────────────────────
// 128-bit: timestamp (48b) + randomness (80b)
// Pros: Time-ordered, sortable as strings, no coordination needed
// Cons: Slightly longer than Snowflake
import { ulid } from 'ulid';
const messageId = ulid();  // "01ARZ3NDEKTSV4RRFFQ69G5FAV"
 
// Strategy 4: Composite ID (Recommended for messaging)
// ────────────────────────────────────────────────────
// Combine device ID + local counter + timestamp
const messageId = `${deviceId}_${localCounter}_${Date.now()}`;
// This allows:
// - Client-side generation without server coordination
// - Natural ordering within a device's messages
// - Deduplication based on device + sequence

3.2 Why Client-Side ID Generation Matters

A critical design decision is where to generate message IDs:

Server-side generation (problematic):

Client sends message content
Server assigns ID, stores message, returns ID
Client receives ID, stores locally

Problem: If network fails between steps 2 and 3, client doesn't know if message was stored. Retry means potential duplicate, but client has no ID to deduplicate.

Client-side generation (preferred):

Client generates unique ID locally
Client sends (ID, content) to server
Server stores with client-provided ID
Server acknowledges with same ID

Advantage: If acknowledgment is lost, client can safely retry with the same ID. Server detects duplicate by ID and acknowledges idempotently without creating duplicates.

This pattern is fundamental to exactly-once semantics in any distributed system.

ID Collision Considerations

With 100 billion messages/day, even astronomically unlikely collisions happen eventually. UUIDs have ~10^-37 collision probability per pair, but at 10^11 messages/day across years, we must handle collisions gracefully. Defense: treat ID + sender ID as the unique key, not just ID alone.

Acknowledgment Protocol Design

Acknowledgments (ACKs) are the heartbeat of reliable messaging. Every state transition in the message lifecycle depends on an acknowledgment. Let's design a robust ACK protocol.

4.1 Types of Acknowledgments

We need multiple acknowledgment types to represent different states:

Acknowledgment Types in Messaging
ACK Type	Sender	Meaning	Action on Receipt
ACCEPTED	Server → Sender	Message durably stored on server	Show single checkmark; stop retrying
DELIVERED	Recipient → Server	Message received by recipient device	Server notifies sender; update UI
SEEN	Recipient → Server	Recipient opened the chat	Server notifies sender; show read receipts
REJECTED	Server → Sender	Message rejected (spam, blocked, etc.)	Show error; do not retry automatically
FAILED	Server → Sender	Delivery failed after all attempts	Show failure; offer manual retry

Acknowledgment Message Schema
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
// Protocol Buffer definition for acknowledgments
message Acknowledgment {
    string message_id = 1;           // Which message this ACK is for
    AckType type = 2;                // Type of acknowledgment
    int64 timestamp = 3;             // When the event occurred
    string sender_id = 4;            // Who sent this ACK
    optional string error_code = 5;  // Error details if REJECTED/FAILED
}
 
enum AckType {
    ACK_UNKNOWN = 0;
    ACK_ACCEPTED = 1;     // Server accepted message
    ACK_DELIVERED = 2;    // Recipient device received
    ACK_SEEN = 3;         // Recipient viewed message
    ACK_REJECTED = 4;     // Message rejected by server
    ACK_FAILED = 5;       // Delivery failed permanently
}
 
// Batch acknowledgments for efficiency
message BatchAcknowledgment {
    repeated Acknowledgment acks = 1;
}
 
// Example: Acknowledge 10 "delivered" receipts in one network call
// This reduces connections by 10x for high-volume scenarios

4.2 ACK Batching for Efficiency

Sending individual acknowledgments for each message creates massive overhead:

100 billion messages/day = 100 billion delivery ACKs/day
Each ACK requires network round-trip
At scale, this is unsustainable

Solution: Batched acknowledgments

Accumulate multiple ACKs locally
Send batch every N milliseconds or when batch reaches M items
Server processes batch atomically

Typical configuration:

Batch window: 100-500ms
Max batch size: 50-100 ACKs

This reduces network calls by 10-50x while adding only 250ms average latency to receipt delivery—acceptable for delivery/read receipts that aren't latency-critical.

Cumulative ACKs

TCP uses cumulative acknowledgments: ACK(100) means 'I've received everything up to byte 100.' Apply this to messaging: if messages are sequence-numbered per conversation, ACK(seq=47) could mean 'I've received all messages through sequence 47.' This dramatically reduces ACK traffic for catch-up scenarios.

Retry Strategies and Backoff Algorithms

When sending fails, we must retry. But naive retry strategies can overwhelm servers during recovery or create thundering herds. Sophisticated retry logic is essential for system stability.

5.1 Exponential Backoff with Jitter

The industry-standard retry strategy combines exponential backoff (increasing delays) with jitter (randomization) to prevent synchronized retries:

Exponential Backoff Implementation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
interface RetryConfig {
    initialDelayMs: number;      // First retry delay
    maxDelayMs: number;          // Cap on delay growth
    exponentialBase: number;     // Multiplier per retry
    maxRetries: number;          // Give up after this many
    jitterFactor: number;        // 0-1, randomization amount
}
 
const DEFAULT_CONFIG: RetryConfig = {
    initialDelayMs: 500,         // Start at 500ms
    maxDelayMs: 60000,           // Cap at 1 minute
    exponentialBase: 2,          // Double each time
    maxRetries: 10,              // Try up to 10 times
    jitterFactor: 0.3,           // ±30% randomization
};
 
function calculateRetryDelay(attempt: number, config: RetryConfig): number {
    // Exponential growth: delay = initial * base^attempt
    const exponentialDelay = config.initialDelayMs * 
                             Math.pow(config.exponentialBase, attempt);
    
    // Cap at maximum
    const cappedDelay = Math.min(exponentialDelay, config.maxDelayMs);
    
    // Add jitter: ±jitterFactor around the delay
    const jitterRange = cappedDelay * config.jitterFactor;
    const jitter = (Math.random() * 2 - 1) * jitterRange;  // Random in [-range, +range]
    
    return Math.max(0, cappedDelay + jitter);
}
 
// Example retry sequence (approximate, varies with jitter):
// Attempt 1: ~500ms
// Attempt 2: ~1000ms  
// Attempt 3: ~2000ms
// Attempt 4: ~4000ms
// Attempt 5: ~8000ms
// Attempt 6+: ~60000ms (capped)
 
async function sendWithRetry<T>(
    operation: () => Promise<T>,
    config: RetryConfig = DEFAULT_CONFIG
): Promise<T> {
    let lastError: Error;
    
    for (let attempt = 0; attempt < config.maxRetries; attempt++) {
        try {
            return await operation();
        } catch (error) {
            lastError = error as Error;
            
            // Don't retry non-retryable errors
            if (isNonRetryable(error)) {
                throw error;
            }
            
            const delay = calculateRetryDelay(attempt, config);
            console.log(`Retry attempt ${attempt + 1} after ${delay}ms`);
            await sleep(delay);
        }
    }
    
    throw new Error(`Operation failed after ${config.maxRetries} retries: ${lastError.message}`);
}
 
function isNonRetryable(error: any): boolean {
    // These errors won't be fixed by retrying
    return [
        'INVALID_MESSAGE',
        'USER_BLOCKED',
        'GROUP_NOT_FOUND',
        'AUTHENTICATION_FAILED',
    ].includes(error.code);
}

Error Classification for Retry Decisions
Error Category	Examples	Retry Strategy
Transient Network	Timeout, connection reset, 503 Service Unavailable	Retry with exponential backoff
Rate Limiting	429 Too Many Requests	Retry after Retry-After header delay
Server Error	500 Internal Server Error	Retry with backoff (server may recover)
Client Error	400 Bad Request, 401 Unauthorized	Do NOT retry (request itself is wrong)
Business Logic	User blocked, group full	Do NOT retry (semantically correct failure)

The Thundering Herd Problem

Without jitter, all clients that failed at 10:00:00 will retry at exactly 10:00:00.5, then 10:00:01.5, etc. This creates synchronized waves that overwhelm the recovering server. Jitter spreads these retries randomly, converting a spike into a gradual flow. Always use jitter in production retry logic.

Message Ordering Guarantees

Messages in a conversation must appear in a meaningful order. But defining 'correct order' in a distributed system is surprisingly complex. Different scenarios require different ordering guarantees.

6.1 Types of Ordering Guarantees

Total Order: All parties see messages in exactly the same sequence.

Strongest guarantee
Requires global coordination (expensive)
Necessary for: financial transactions, collaborative editing

Causal Order: If message B is a reply to message A, everyone sees A before B.

Preserves logical relationships
More practical than total order
Suitable for chat: replies always appear after the message they reply to

FIFO Order (per-sender): Messages from the same sender appear in send order.

Weakest meaningful guarantee
Sufficient for many chat scenarios
Example: My messages 1, 2, 3 arrive as 1, 2, 3 (not 2, 1, 3)

No Ordering: Messages may arrive in any order.

Unacceptable for chat (confusing conversations)
May be okay for independent events/telemetry

Ordering Scenarios
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
SCENARIO 1: FIFO VIOLATION (Unacceptable)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Alice sends:     [1] "Hey!" → [2] "How are you?" → [3] "Want to grab lunch?"
Bob receives:    [2] "How are you?" → [1] "Hey!" → [3] "Want to grab lunch?"
 
Confusing! Bob sees "How are you?" before "Hey!"
 
 
SCENARIO 2: CAUSALITY VIOLATION (Unacceptable)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Alice: [1] "What's the capital of France?"
Bob:   [2] "Paris" (reply to message 1)
 
Charlie sees: [2] "Paris" → [1] "What's the capital of France?"
 
Completely confusing! Answer appears before question.
 
 
SCENARIO 3: ACCEPTABLE TOTAL ORDER DIFFERENCE
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
At 10:00:00.100, Alice sends "Hello"
At 10:00:00.150, Bob sends "Hi there" (independent, not a reply)
 
Alice sees: "Hello" → "Hi there"
Bob sees:   "Hi there" → "Hello"
 
These are causally unrelated (sent concurrently without knowledge of each other).
Different ordering is acceptable, though showing same order is nicer.

6.2 Implementing Causal Ordering

For messaging, causal ordering is the sweet spot: strong enough for coherent conversations, practical enough to implement.

Technique: Lamport Timestamps + Causal Dependencies

Each message carries a vector clock or list of message IDs it 'depends on'
A message depends on all messages the sender has seen when sending
Recipients buffer messages until all dependencies are delivered
This ensures causal ordering without centralized coordination

Simplified approach for 1:1 chat:

Each party maintains a monotonic sequence number
Messages are displayed in (sender, sequence) order
Gaps in sequence indicate missing messages (trigger re-fetch)

This simpler approach works because in 1:1 chat, there are only two senders, making ordering much easier than in general distributed systems.

Prgmatic Ordering for Groups

For group chats, strict causal ordering across all members is expensive. A practical compromise: use server receive time as the ordering key. Since all messages flow through the server, its clock provides a total order. This sacrifices true causality but is 'good enough' for most conversations and much simpler to implement.

Durable Message Storage

Once a message is accepted by the server, it must survive server crashes, disk failures, and data center outages. This durability guarantee is what the 'single checkmark' represents.

7.1 Write-Ahead Logging (WAL)

Before acknowledging a message, we must ensure it survives crashes:

1. Receive message from client
2. Write to WAL (append-only log on durable storage)
3. Acknowledge to client (single checkmark)
4. [Later, asynchronously] Move from WAL to permanent storage

The WAL guarantees that even if the server crashes between steps 3 and 4, we can recover all acknowledged messages by replaying the log.

7.2 Replication for Durability

A single disk can fail. True durability requires replication:

Replication Strategies for Message Storage
Strategy	Durability	Latency	Complexity
Single node	Low (disk failure = data loss)	Lowest	Lowest
Synchronous replication (2/3 nodes)	High (survives 1 node failure)	Medium (+5-50ms)	Medium
Synchronous replication (3/5 nodes)	Very high (survives 2 node failures)	Higher	Higher
Asynchronous replication	Medium (may lose recent data)	Lowest	Medium
Raft/Paxos consensus	Very high (formally proven)	Medium-High	Highest

Synchronous Replication Flow
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
CLIENT              PRIMARY             REPLICA-1           REPLICA-2
  │                    │                    │                    │
  │──SEND(msg)───────►│                    │                    │
  │                    │──REPLICATE────────►│                    │
  │                    │──REPLICATE───────────────────────────► │
  │                    │                    │                    │
  │                    │◄──ACK──────────────│                    │
  │                    │◄──ACK────────────────────────────────── │
  │                    │                    │                    │
  │                   [Wait for quorum: 2 of 3 ACKs]
  │                    │                    │                    │
  │◄──ACK(msg_id)──────│                    │                    │
  │                    │                    │                    │
 
Message is now durable: survives any single node failure.
Latency = max(RTT to replica-1, RTT to replica-2) + processing time

Consistency vs. Latency Trade-off

Requiring more replicas to acknowledge before responding to clients increases durability but also latency. The industry sweet spot is quorum writes: for 3 replicas, wait for 2 acknowledgments. This survives one failure and keeps latency bounded by the faster majority, not the slowest node.

Offline Message Handling

Unlike web applications, messaging must handle extended offline periods gracefully. A user may be offline for minutes, hours, or days, and must receive all missed messages upon reconnection.

8.1 The Offline Message Queue

When the recipient is offline, messages enter an offline queue (sometimes called an 'inbox'):

1. Server attempts to deliver message to recipient
2. Recipient is not connected (no active WebSocket)
3. Message is written to recipient's offline queue
4. Push notification sent to wake the device (optional)
5. When recipient connects:
   a. Client requests sync: "Give me everything since X"
   b. Server sends all queued messages
   c. Client acknowledges receipt
   d. Server clears delivered messages from queue

8.2 Queue Retention Policies

We can't store offline messages forever (storage costs, data protection laws):

Offline Message Retention Policies
Policy Type	Retention	Justification
Time-based	30 days	Balance storage costs with user expectations
Count-based	1000 messages per chat	Prevent abuse (spam to never-active accounts)
Size-based	100MB per user	Practical storage limit for media
Priority-based	Keep recent, drop old media first	Text is small; prioritize over media thumbnails

8.3 Sync Protocol Design

The sync protocol must be efficient—users shouldn't wait minutes to receive messages after opening the app.

Sync Protocol
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
interface SyncRequest {
    // Last known state for each conversation
    lastSyncTimestamps: Map<ConversationId, Timestamp>;
    
    // Or, simplified: single global sync point
    globalLastSyncTimestamp: Timestamp;
    
    // Maximum messages to receive per batch
    batchSize: number;
    
    // Whether to also fetch full media or just placeholders
    includeMedia: boolean;
}
 
interface SyncResponse {
    // Messages since last sync
    messages: Message[];
    
    // Are there more messages to fetch?
    hasMore: boolean;
    
    // New sync point (use this for next sync)
    newSyncTimestamp: Timestamp;
    
    // Conversations that have updates
    updatedConversations: ConversationMetadata[];
}
 
// Efficient sync flow:
// 1. Client connects with last sync timestamp
// 2. Server calculates: messages WHERE timestamp > lastSync
// 3. Server sends first batch (50-100 messages)
// 4. Client stores locally, updates UI
// 5. If hasMore, client requests next batch
// 6. Repeat until all caught up
 
// Optimization: Delta sync
// Instead of sending full messages, send only:
// - New messages (full content)
// - Status updates for existing messages (just IDs + new status)
// - Deleted message IDs (just IDs)
// This reduces bandwidth significantly for active users

The Cold Start Problem

A user inactive for months may have 10,000+ pending messages. Loading all at once creates poor UX and server strain. Solutions: 1) Prioritize recent conversations, 2) Load conversations incrementally as user navigates, 3) Show 'syncing' indicator with progress, 4) Let user browse already-loaded content while sync continues.

Deduplication at Scale

With at-least-once delivery and retries, duplicate messages are inevitable. The deduplication layer is what converts 'at-least-once' into 'effectively exactly-once.'

9.1 Where to Deduplicate

Server-side deduplication (essential):

Before storing a message, check if ID already exists
If duplicate, acknowledge idempotently without re-storing
Prevents duplicates at the source

Client-side deduplication (defense in depth):

When receiving messages, check local store for existing ID
Skip display/notification for already-seen messages
Protects against server bugs or race conditions

Both layers are necessary. Server-side prevents storage bloat; client-side prevents user-visible duplicates.

Server-Side Deduplication
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
interface MessageStore {
    // Returns true if message was stored, false if duplicate
    storeIfNotExists(message: Message): Promise<boolean>;
    
    // Get existing message by ID
    getById(messageId: string): Promise<Message | null>;
}
 
class MessageHandler {
    private store: MessageStore;
    private dedupeCache: LRUCache<string, boolean>;
    
    constructor(store: MessageStore) {
        this.store = store;
        // Hot cache for recent message IDs
        // 10M entries × ~32 bytes = ~320MB
        this.dedupeCache = new LRUCache<string, boolean>({ max: 10_000_000 });
    }
    
    async handleIncomingMessage(message: Message): Promise<HandleResult> {
        const messageId = message.id;
        
        // Fast path: check in-memory cache
        if (this.dedupeCache.has(messageId)) {
            // Definite duplicate - acknowledge without processing
            return { status: 'DUPLICATE', ack: true };
        }
        
        // Slow path: check database
        const stored = await this.store.storeIfNotExists(message);
        
        if (!stored) {
            // Database reports duplicate
            this.dedupeCache.set(messageId, true);
            return { status: 'DUPLICATE', ack: true };
        }
        
        // New message - proceed with processing
        this.dedupeCache.set(messageId, true);
        await this.routeToRecipient(message);
        return { status: 'NEW', ack: true };
    }
}
 
// Database implementation using INSERT ... ON CONFLICT
// PostgreSQL example:
async function storeIfNotExists(message: Message): Promise<boolean> {
    const result = await db.query(`
        INSERT INTO messages (id, sender_id, content, created_at)
        VALUES ($1, $2, $3, $4)
        ON CONFLICT (id) DO NOTHING
        RETURNING id
    `, [message.id, message.senderId, message.content, message.createdAt]);
    
    return result.rowCount > 0;  // True if inserted, false if conflict
}

9.2 Deduplication Time Window

We can't store every message ID forever. Deduplication has a time window after which we stop checking:

Trade-offs:

Too short (1 minute): Network issues lasting >1 minute cause duplicates
Too long (1 week): Storage overhead for ID tracking becomes massive
Sweet spot: 1-24 hours for most systems

Assumption: If a retry comes after 24 hours, either:

The original message was received (user would have noticed)
Something is seriously wrong, and duplicate is least of concerns

Implementation: Message IDs stored in TTL-backed cache (Redis with TTL) or database partition that's dropped after TTL.

Distributed Deduplication Challenge

With multiple servers handling messages, deduplication needs coordination. Two approaches: 1) Route all messages for a conversation to the same server (sticky routing by conversation ID), 2) Use distributed cache (Redis cluster) for deduplication. Sticky routing is simpler but creates hot spots for popular group chats.

Summary and Key Takeaways

Message delivery guarantees form the foundation of any reliable messaging system. Let's consolidate what we've learned:

Key Takeaways

•Exactly-once is achieved, not given — True exactly-once delivery is impossible; we achieve it through at-least-once delivery + idempotent processing using unique message IDs.
•Client-side ID generation is essential — Generating IDs before sending enables safe retries without duplicates, solving the acknowledgment-loss problem.
•Acknowledgments define the contract — Each ACK type (ACCEPTED, DELIVERED, SEEN) represents a durability and visibility guarantee to users.
•Exponential backoff with jitter prevents thundering herds — Naive retry strategies can overwhelm recovering servers; randomized exponential backoff spreads load.
•Causal ordering is the sweet spot — Total ordering is expensive, FIFO is too weak; causal ordering preserves conversation coherence efficiently.
•Durability requires replication — Single node storage is insufficient; quorum writes to replicated storage survive node failures.
•Offline handling is core, not edge case — With mobile devices, offline is the norm. Design the offline queue and sync protocol as first-class features.
•Deduplicate at every layer — Both server-side (prevents storage bloat) and client-side (prevents user-visible duplicates) deduplication are necessary.

What's next:

With delivery guarantees established, we'll explore the real-time messaging architecture—the WebSocket infrastructure, connection management, and message routing that enables sub-second delivery to billions of concurrent users. We'll design the connection layer that makes all these guarantees practical at scale.

Page Complete

You now understand the theoretical foundations and practical engineering of reliable message delivery. These patterns—idempotency, acknowledgments, retries, ordering, and deduplication—are universal building blocks that apply to any distributed system requiring reliable communication.

2 / 6

Loading learning content...

System Design (HLD)WhatsApp Messaging

WhatsApp Messaging System Design

LevelAdvanced

Duration120 mins

TopicWhatsApp Messaging

2 / 6

Message Delivery Guarantees

The Art and Science of Reliable Delivery

What You Will Master

Delivery Semantics: The Theoretical Foundation

The Three Delivery Semantics
Semantic	Guarantee	Risk	Complexity
At-Most-Once	Message delivered 0 or 1 times	Message may be lost	Lowest
At-Least-Once	Message delivered 1 or more times	Message may be duplicated	Medium
Exactly-Once	Message delivered exactly 1 time	None (ideal)	Highest

1.1 At-Most-Once Delivery

The simplest approach: send the message and don't worry about confirmation. If it arrives, great. If not, we don't retry.

Implementation: Fire-and-forget. Send one UDP packet or TCP request with no acknowledgment handling.

Example use case: Telemetry data, analytics events where occasional loss is acceptable.

Why it fails for messaging: Users absolutely cannot accept lost messages. A message saying 'I'll pick you up from the airport' that never arrives has real-world consequences.

1.2 At-Least-Once Delivery

We keep retrying until we get confirmation. This guarantees delivery but may result in duplicates if acknowledgments are lost.

Implementation:

1. Send message
2. Wait for acknowledgment
3. If no acknowledgment within timeout, retry from step 1
4. Stop after receiving acknowledgment

Why it's problematic for messaging: Seeing the same message twice is confusing and annoying. In group chats, it could mean 999 duplicate notifications.

1.3 Exactly-Once Delivery

The holy grail: every message arrives exactly once, never lost, never duplicated.

The hard truth: True exactly-once delivery is theoretically impossible in asynchronous distributed systems with arbitrary failures. The Two Generals' Problem proves this mathematically.

The practical solution: We implement 'effectively exactly-once' through a combination of:

At-least-once delivery (retry until acknowledged)
Idempotent processing (server detects and ignores duplicates)

For users, the experience is exactly-once. Internally, we may process the same message multiple times but only apply it once.

Interview Gold

The Message Lifecycle: From Send to Delivery

Understanding the complete lifecycle of a message is essential for designing reliable delivery. Let's trace the journey of a single message from sender to recipient:

Message Lifecycle Flow
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
SENDER'S DEVICE                    SERVER                    RECIPIENT'S DEVICE
━━━━━━━━━━━━━━━━━                    ━━━━━━                    ━━━━━━━━━━━━━━━━━━
 
1. User types message
   └── [Draft stored locally]
 
2. User presses send
   └── [Clock icon: Pending]
       │
       ├──── CREATE(msg_id, content, timestamp) ────────►
       │                                                  │
       │                                     3. Validate & persist
       │                                        └── [Write to queue]
       │                                        └── [Write to DB]
       │                                                  │
       │     ◄──────── ACK(msg_id, "ACCEPTED") ──────────┤
       │                                                  │
   └── [Single ✓: Sent]                                   │
                                                          │
                                          4. Route to recipient
                                             └── [Find recipient connection]
                                             └── [Push if online]
                                                          │
                                                          ├────── PUSH(msg_id, content) ────────►
                                                          │                                       │
                                                          │              5. Recipient receives
                                                          │                 └── [Store locally]
                                                          │                 └── [Display in UI]
                                                          │                 └── [Notify user]
                                                          │                                       │
                                                          │     ◄─── ACK(msg_id, "DELIVERED") ───┤
                                                          │
                                          6. Record delivery
                                             └── [Update message status]
                                                          │
       │     ◄────── NOTIFY(msg_id, "DELIVERED") ────────┤
       │
   └── [Double ✓✓: Delivered]
 
                                      [... User opens chat ...]
 
                                                          ├──── SEEN(msg_id, timestamp) ────────►
                                                          │                                       │
       │     ◄────── NOTIFY(msg_id, "READ") ─────────────┤
       │
   └── [Blue ✓✓: Read]

2.1 Key Observations from the Lifecycle

Five distinct states: A message transitions through multiple states, each representing a durability guarantee:

State	Indicator	Guarantee
Pending	⏱️ Clock	Message constructed, not yet sent to server
Sent	✓ Single check	Server has accepted and persisted the message
Delivered	✓✓ Double check	Recipient's device has received and acknowledged
Read	✓✓ Blue checks	Recipient has viewed the message in the chat
Failed	❌ Error	Sending failed; retry required

Each transition requires network round-trip: This means each state change is vulnerable to network failures, requiring careful timeout and retry logic.

Optimistic UI Updates

Unique Message Identification

3.1 Requirements for Message IDs

Globally unique: No two messages anywhere in the system share an ID
Locally generatable: Client must assign ID before sending (can't wait for server)
Non-colliding under high concurrency: Billions of messages per day from millions of devices
Reasonable size: Must be efficient to store, index, and transmit
Ordered (preferably): IDs that encode creation time aid debugging and partitioning

Message ID Generation Strategies
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
// Strategy 1: UUID v4 (Random)
// ─────────────────────────────
// Pros: Truly random, extremely low collision probability
// Cons: No ordering, 128 bits (16 bytes), not human-readable
import { v4 as uuidv4 } from 'uuid';
const messageId = uuidv4();  // "550e8400-e29b-41d4-a716-446655440000"
 
// Strategy 2: Snowflake ID (Twitter's approach)
// ─────────────────────────────────────────────
// 64-bit ID: timestamp (41b) + datacenter (5b) + worker (5b) + sequence (12b)
// Pros: Time-ordered, compact, high throughput (4096 IDs/ms per worker)
// Cons: Requires worker ID coordination
function generateSnowflakeId(datacenterId: number, workerId: number): bigint {
    const timestamp = BigInt(Date.now() - EPOCH);  // Custom epoch
    const sequence = getNextSequence();  // Increment per millisecond
    return (timestamp << 22n) | (BigInt(datacenterId) << 17n) | 
           (BigInt(workerId) << 12n) | BigInt(sequence);
}
 
// Strategy 3: ULID (Universally Unique Lexicographically Sortable ID)
// ──────────────────────────────────────────────────────────────────
// 128-bit: timestamp (48b) + randomness (80b)
// Pros: Time-ordered, sortable as strings, no coordination needed
// Cons: Slightly longer than Snowflake
import { ulid } from 'ulid';
const messageId = ulid();  // "01ARZ3NDEKTSV4RRFFQ69G5FAV"
 
// Strategy 4: Composite ID (Recommended for messaging)
// ────────────────────────────────────────────────────
// Combine device ID + local counter + timestamp
const messageId = `${deviceId}_${localCounter}_${Date.now()}`;
// This allows:
// - Client-side generation without server coordination
// - Natural ordering within a device's messages
// - Deduplication based on device + sequence

3.2 Why Client-Side ID Generation Matters

A critical design decision is where to generate message IDs:

Server-side generation (problematic):

Client sends message content
Server assigns ID, stores message, returns ID
Client receives ID, stores locally

Problem: If network fails between steps 2 and 3, client doesn't know if message was stored. Retry means potential duplicate, but client has no ID to deduplicate.

Client-side generation (preferred):

Client generates unique ID locally
Client sends (ID, content) to server
Server stores with client-provided ID
Server acknowledges with same ID

Advantage: If acknowledgment is lost, client can safely retry with the same ID. Server detects duplicate by ID and acknowledges idempotently without creating duplicates.

This pattern is fundamental to exactly-once semantics in any distributed system.

ID Collision Considerations

Acknowledgment Protocol Design

Acknowledgments (ACKs) are the heartbeat of reliable messaging. Every state transition in the message lifecycle depends on an acknowledgment. Let's design a robust ACK protocol.

4.1 Types of Acknowledgments

We need multiple acknowledgment types to represent different states:

Acknowledgment Types in Messaging
ACK Type	Sender	Meaning	Action on Receipt
ACCEPTED	Server → Sender	Message durably stored on server	Show single checkmark; stop retrying
DELIVERED	Recipient → Server	Message received by recipient device	Server notifies sender; update UI
SEEN	Recipient → Server	Recipient opened the chat	Server notifies sender; show read receipts
REJECTED	Server → Sender	Message rejected (spam, blocked, etc.)	Show error; do not retry automatically
FAILED	Server → Sender	Delivery failed after all attempts	Show failure; offer manual retry

Acknowledgment Message Schema
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
// Protocol Buffer definition for acknowledgments
message Acknowledgment {
    string message_id = 1;           // Which message this ACK is for
    AckType type = 2;                // Type of acknowledgment
    int64 timestamp = 3;             // When the event occurred
    string sender_id = 4;            // Who sent this ACK
    optional string error_code = 5;  // Error details if REJECTED/FAILED
}
 
enum AckType {
    ACK_UNKNOWN = 0;
    ACK_ACCEPTED = 1;     // Server accepted message
    ACK_DELIVERED = 2;    // Recipient device received
    ACK_SEEN = 3;         // Recipient viewed message
    ACK_REJECTED = 4;     // Message rejected by server
    ACK_FAILED = 5;       // Delivery failed permanently
}
 
// Batch acknowledgments for efficiency
message BatchAcknowledgment {
    repeated Acknowledgment acks = 1;
}
 
// Example: Acknowledge 10 "delivered" receipts in one network call
// This reduces connections by 10x for high-volume scenarios

4.2 ACK Batching for Efficiency

Sending individual acknowledgments for each message creates massive overhead:

100 billion messages/day = 100 billion delivery ACKs/day
Each ACK requires network round-trip
At scale, this is unsustainable

Solution: Batched acknowledgments

Accumulate multiple ACKs locally
Send batch every N milliseconds or when batch reaches M items
Server processes batch atomically

Typical configuration:

Batch window: 100-500ms
Max batch size: 50-100 ACKs

This reduces network calls by 10-50x while adding only 250ms average latency to receipt delivery—acceptable for delivery/read receipts that aren't latency-critical.

Cumulative ACKs

Retry Strategies and Backoff Algorithms

When sending fails, we must retry. But naive retry strategies can overwhelm servers during recovery or create thundering herds. Sophisticated retry logic is essential for system stability.

5.1 Exponential Backoff with Jitter

The industry-standard retry strategy combines exponential backoff (increasing delays) with jitter (randomization) to prevent synchronized retries:

Exponential Backoff Implementation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
interface RetryConfig {
    initialDelayMs: number;      // First retry delay
    maxDelayMs: number;          // Cap on delay growth
    exponentialBase: number;     // Multiplier per retry
    maxRetries: number;          // Give up after this many
    jitterFactor: number;        // 0-1, randomization amount
}
 
const DEFAULT_CONFIG: RetryConfig = {
    initialDelayMs: 500,         // Start at 500ms
    maxDelayMs: 60000,           // Cap at 1 minute
    exponentialBase: 2,          // Double each time
    maxRetries: 10,              // Try up to 10 times
    jitterFactor: 0.3,           // ±30% randomization
};
 
function calculateRetryDelay(attempt: number, config: RetryConfig): number {
    // Exponential growth: delay = initial * base^attempt
    const exponentialDelay = config.initialDelayMs * 
                             Math.pow(config.exponentialBase, attempt);
    
    // Cap at maximum
    const cappedDelay = Math.min(exponentialDelay, config.maxDelayMs);
    
    // Add jitter: ±jitterFactor around the delay
    const jitterRange = cappedDelay * config.jitterFactor;
    const jitter = (Math.random() * 2 - 1) * jitterRange;  // Random in [-range, +range]
    
    return Math.max(0, cappedDelay + jitter);
}
 
// Example retry sequence (approximate, varies with jitter):
// Attempt 1: ~500ms
// Attempt 2: ~1000ms  
// Attempt 3: ~2000ms
// Attempt 4: ~4000ms
// Attempt 5: ~8000ms
// Attempt 6+: ~60000ms (capped)
 
async function sendWithRetry<T>(
    operation: () => Promise<T>,
    config: RetryConfig = DEFAULT_CONFIG
): Promise<T> {
    let lastError: Error;
    
    for (let attempt = 0; attempt < config.maxRetries; attempt++) {
        try {
            return await operation();
        } catch (error) {
            lastError = error as Error;
            
            // Don't retry non-retryable errors
            if (isNonRetryable(error)) {
                throw error;
            }
            
            const delay = calculateRetryDelay(attempt, config);
            console.log(`Retry attempt ${attempt + 1} after ${delay}ms`);
            await sleep(delay);
        }
    }
    
    throw new Error(`Operation failed after ${config.maxRetries} retries: ${lastError.message}`);
}
 
function isNonRetryable(error: any): boolean {
    // These errors won't be fixed by retrying
    return [
        'INVALID_MESSAGE',
        'USER_BLOCKED',
        'GROUP_NOT_FOUND',
        'AUTHENTICATION_FAILED',
    ].includes(error.code);
}

Error Classification for Retry Decisions
Error Category	Examples	Retry Strategy
Transient Network	Timeout, connection reset, 503 Service Unavailable	Retry with exponential backoff
Rate Limiting	429 Too Many Requests	Retry after Retry-After header delay
Server Error	500 Internal Server Error	Retry with backoff (server may recover)
Client Error	400 Bad Request, 401 Unauthorized	Do NOT retry (request itself is wrong)
Business Logic	User blocked, group full	Do NOT retry (semantically correct failure)

The Thundering Herd Problem

Message Ordering Guarantees

Messages in a conversation must appear in a meaningful order. But defining 'correct order' in a distributed system is surprisingly complex. Different scenarios require different ordering guarantees.

6.1 Types of Ordering Guarantees

Total Order: All parties see messages in exactly the same sequence.

Strongest guarantee
Requires global coordination (expensive)
Necessary for: financial transactions, collaborative editing

Causal Order: If message B is a reply to message A, everyone sees A before B.

Preserves logical relationships
More practical than total order
Suitable for chat: replies always appear after the message they reply to

FIFO Order (per-sender): Messages from the same sender appear in send order.

Weakest meaningful guarantee
Sufficient for many chat scenarios
Example: My messages 1, 2, 3 arrive as 1, 2, 3 (not 2, 1, 3)

No Ordering: Messages may arrive in any order.

Unacceptable for chat (confusing conversations)
May be okay for independent events/telemetry

Ordering Scenarios
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
SCENARIO 1: FIFO VIOLATION (Unacceptable)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Alice sends:     [1] "Hey!" → [2] "How are you?" → [3] "Want to grab lunch?"
Bob receives:    [2] "How are you?" → [1] "Hey!" → [3] "Want to grab lunch?"
 
Confusing! Bob sees "How are you?" before "Hey!"
 
 
SCENARIO 2: CAUSALITY VIOLATION (Unacceptable)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Alice: [1] "What's the capital of France?"
Bob:   [2] "Paris" (reply to message 1)
 
Charlie sees: [2] "Paris" → [1] "What's the capital of France?"
 
Completely confusing! Answer appears before question.
 
 
SCENARIO 3: ACCEPTABLE TOTAL ORDER DIFFERENCE
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
At 10:00:00.100, Alice sends "Hello"
At 10:00:00.150, Bob sends "Hi there" (independent, not a reply)
 
Alice sees: "Hello" → "Hi there"
Bob sees:   "Hi there" → "Hello"
 
These are causally unrelated (sent concurrently without knowledge of each other).
Different ordering is acceptable, though showing same order is nicer.

6.2 Implementing Causal Ordering

For messaging, causal ordering is the sweet spot: strong enough for coherent conversations, practical enough to implement.

Technique: Lamport Timestamps + Causal Dependencies

Each message carries a vector clock or list of message IDs it 'depends on'
A message depends on all messages the sender has seen when sending
Recipients buffer messages until all dependencies are delivered
This ensures causal ordering without centralized coordination

Simplified approach for 1:1 chat:

Each party maintains a monotonic sequence number
Messages are displayed in (sender, sequence) order
Gaps in sequence indicate missing messages (trigger re-fetch)

This simpler approach works because in 1:1 chat, there are only two senders, making ordering much easier than in general distributed systems.

Prgmatic Ordering for Groups

Durable Message Storage

Once a message is accepted by the server, it must survive server crashes, disk failures, and data center outages. This durability guarantee is what the 'single checkmark' represents.

7.1 Write-Ahead Logging (WAL)

Before acknowledging a message, we must ensure it survives crashes:

1. Receive message from client
2. Write to WAL (append-only log on durable storage)
3. Acknowledge to client (single checkmark)
4. [Later, asynchronously] Move from WAL to permanent storage

The WAL guarantees that even if the server crashes between steps 3 and 4, we can recover all acknowledged messages by replaying the log.

7.2 Replication for Durability

A single disk can fail. True durability requires replication:

Replication Strategies for Message Storage
Strategy	Durability	Latency	Complexity
Single node	Low (disk failure = data loss)	Lowest	Lowest
Synchronous replication (2/3 nodes)	High (survives 1 node failure)	Medium (+5-50ms)	Medium
Synchronous replication (3/5 nodes)	Very high (survives 2 node failures)	Higher	Higher
Asynchronous replication	Medium (may lose recent data)	Lowest	Medium
Raft/Paxos consensus	Very high (formally proven)	Medium-High	Highest

Synchronous Replication Flow
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
CLIENT              PRIMARY             REPLICA-1           REPLICA-2
  │                    │                    │                    │
  │──SEND(msg)───────►│                    │                    │
  │                    │──REPLICATE────────►│                    │
  │                    │──REPLICATE───────────────────────────► │
  │                    │                    │                    │
  │                    │◄──ACK──────────────│                    │
  │                    │◄──ACK────────────────────────────────── │
  │                    │                    │                    │
  │                   [Wait for quorum: 2 of 3 ACKs]
  │                    │                    │                    │
  │◄──ACK(msg_id)──────│                    │                    │
  │                    │                    │                    │
 
Message is now durable: survives any single node failure.
Latency = max(RTT to replica-1, RTT to replica-2) + processing time

Consistency vs. Latency Trade-off

Offline Message Handling

Unlike web applications, messaging must handle extended offline periods gracefully. A user may be offline for minutes, hours, or days, and must receive all missed messages upon reconnection.

8.1 The Offline Message Queue

When the recipient is offline, messages enter an offline queue (sometimes called an 'inbox'):

1. Server attempts to deliver message to recipient
2. Recipient is not connected (no active WebSocket)
3. Message is written to recipient's offline queue
4. Push notification sent to wake the device (optional)
5. When recipient connects:
   a. Client requests sync: "Give me everything since X"
   b. Server sends all queued messages
   c. Client acknowledges receipt
   d. Server clears delivered messages from queue

8.2 Queue Retention Policies

We can't store offline messages forever (storage costs, data protection laws):

Offline Message Retention Policies
Policy Type	Retention	Justification
Time-based	30 days	Balance storage costs with user expectations
Count-based	1000 messages per chat	Prevent abuse (spam to never-active accounts)
Size-based	100MB per user	Practical storage limit for media
Priority-based	Keep recent, drop old media first	Text is small; prioritize over media thumbnails

8.3 Sync Protocol Design

The sync protocol must be efficient—users shouldn't wait minutes to receive messages after opening the app.

Sync Protocol
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
interface SyncRequest {
    // Last known state for each conversation
    lastSyncTimestamps: Map<ConversationId, Timestamp>;
    
    // Or, simplified: single global sync point
    globalLastSyncTimestamp: Timestamp;
    
    // Maximum messages to receive per batch
    batchSize: number;
    
    // Whether to also fetch full media or just placeholders
    includeMedia: boolean;
}
 
interface SyncResponse {
    // Messages since last sync
    messages: Message[];
    
    // Are there more messages to fetch?
    hasMore: boolean;
    
    // New sync point (use this for next sync)
    newSyncTimestamp: Timestamp;
    
    // Conversations that have updates
    updatedConversations: ConversationMetadata[];
}
 
// Efficient sync flow:
// 1. Client connects with last sync timestamp
// 2. Server calculates: messages WHERE timestamp > lastSync
// 3. Server sends first batch (50-100 messages)
// 4. Client stores locally, updates UI
// 5. If hasMore, client requests next batch
// 6. Repeat until all caught up
 
// Optimization: Delta sync
// Instead of sending full messages, send only:
// - New messages (full content)
// - Status updates for existing messages (just IDs + new status)
// - Deleted message IDs (just IDs)
// This reduces bandwidth significantly for active users

The Cold Start Problem

Deduplication at Scale

With at-least-once delivery and retries, duplicate messages are inevitable. The deduplication layer is what converts 'at-least-once' into 'effectively exactly-once.'

9.1 Where to Deduplicate

Server-side deduplication (essential):

Before storing a message, check if ID already exists
If duplicate, acknowledge idempotently without re-storing
Prevents duplicates at the source

Client-side deduplication (defense in depth):

When receiving messages, check local store for existing ID
Skip display/notification for already-seen messages
Protects against server bugs or race conditions

Both layers are necessary. Server-side prevents storage bloat; client-side prevents user-visible duplicates.

Server-Side Deduplication
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
interface MessageStore {
    // Returns true if message was stored, false if duplicate
    storeIfNotExists(message: Message): Promise<boolean>;
    
    // Get existing message by ID
    getById(messageId: string): Promise<Message | null>;
}
 
class MessageHandler {
    private store: MessageStore;
    private dedupeCache: LRUCache<string, boolean>;
    
    constructor(store: MessageStore) {
        this.store = store;
        // Hot cache for recent message IDs
        // 10M entries × ~32 bytes = ~320MB
        this.dedupeCache = new LRUCache<string, boolean>({ max: 10_000_000 });
    }
    
    async handleIncomingMessage(message: Message): Promise<HandleResult> {
        const messageId = message.id;
        
        // Fast path: check in-memory cache
        if (this.dedupeCache.has(messageId)) {
            // Definite duplicate - acknowledge without processing
            return { status: 'DUPLICATE', ack: true };
        }
        
        // Slow path: check database
        const stored = await this.store.storeIfNotExists(message);
        
        if (!stored) {
            // Database reports duplicate
            this.dedupeCache.set(messageId, true);
            return { status: 'DUPLICATE', ack: true };
        }
        
        // New message - proceed with processing
        this.dedupeCache.set(messageId, true);
        await this.routeToRecipient(message);
        return { status: 'NEW', ack: true };
    }
}
 
// Database implementation using INSERT ... ON CONFLICT
// PostgreSQL example:
async function storeIfNotExists(message: Message): Promise<boolean> {
    const result = await db.query(`
        INSERT INTO messages (id, sender_id, content, created_at)
        VALUES ($1, $2, $3, $4)
        ON CONFLICT (id) DO NOTHING
        RETURNING id
    `, [message.id, message.senderId, message.content, message.createdAt]);
    
    return result.rowCount > 0;  // True if inserted, false if conflict
}

9.2 Deduplication Time Window

We can't store every message ID forever. Deduplication has a time window after which we stop checking:

Trade-offs:

Too short (1 minute): Network issues lasting >1 minute cause duplicates
Too long (1 week): Storage overhead for ID tracking becomes massive
Sweet spot: 1-24 hours for most systems

Assumption: If a retry comes after 24 hours, either:

The original message was received (user would have noticed)
Something is seriously wrong, and duplicate is least of concerns

Implementation: Message IDs stored in TTL-backed cache (Redis with TTL) or database partition that's dropped after TTL.

Distributed Deduplication Challenge

Summary and Key Takeaways

Message delivery guarantees form the foundation of any reliable messaging system. Let's consolidate what we've learned:

Key Takeaways

•Exactly-once is achieved, not given — True exactly-once delivery is impossible; we achieve it through at-least-once delivery + idempotent processing using unique message IDs.
•Client-side ID generation is essential — Generating IDs before sending enables safe retries without duplicates, solving the acknowledgment-loss problem.
•Acknowledgments define the contract — Each ACK type (ACCEPTED, DELIVERED, SEEN) represents a durability and visibility guarantee to users.
•Exponential backoff with jitter prevents thundering herds — Naive retry strategies can overwhelm recovering servers; randomized exponential backoff spreads load.
•Causal ordering is the sweet spot — Total ordering is expensive, FIFO is too weak; causal ordering preserves conversation coherence efficiently.
•Durability requires replication — Single node storage is insufficient; quorum writes to replicated storage survive node failures.
•Offline handling is core, not edge case — With mobile devices, offline is the norm. Design the offline queue and sync protocol as first-class features.
•Deduplicate at every layer — Both server-side (prevents storage bloat) and client-side (prevents user-visible duplicates) deduplication are necessary.

What's next:

Page Complete

2 / 6