System Design (LLD)Cache Invalidation Strategies

Cache Invalidation Strategies

LevelIntermediate

Duration60 mins

TopicCache Invalidation Strategies

4 / 4

Dealing with Stale Data

The Inevitable Reality of Staleness

Despite our best efforts with TTL strategies, event-based invalidation, and careful versioning, stale data in caches is inevitable. This isn't a failure of engineering—it's a fundamental property of distributed systems.

In any system with caching:

There's always a window between data change and cache update
Events can be delayed, lost, or processed out of order
Clock drift causes expiration timing inconsistencies
Race conditions between concurrent updates create temporary inconsistencies
Network partitions can isolate cache nodes from invalidation signals

The question isn't whether you'll serve stale data—it's how stale, how often, and what happens when you do. This page addresses the practical reality of living with staleness: measuring it, detecting it, communicating it, and designing systems that tolerate it gracefully.

What You Will Learn

By the end of this page, you will understand how to measure and monitor staleness in production, design systems that tolerate stale reads, implement stale-while-revalidate patterns, and communicate staleness appropriately to users and downstream systems.

Understanding Staleness Dimensions

Staleness isn't a single metric—it has multiple dimensions that matter differently depending on your use case. Understanding these dimensions helps you set appropriate SLOs and make informed tradeoffs.

The Four Dimensions of Staleness:

Staleness Dimensions and Their Impact
Dimension	Definition	Example	Impact
Age	How long since the cached value was created	Entry cached 15 minutes ago	Bounds maximum possible staleness
Lag	Time between source change and cache update	DB updated 30 seconds before cache	Measures invalidation latency
Frequency	How often stale data is served	5% of reads serve stale data	Indicates overall cache health
Magnitude	How different the cached value is from truth	Price shows $10 when actual is $12	Measures user-visible impact

staleness-metrics.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
/**
 * Staleness Metrics Collector
 * 
 * Instruments cache reads to measure staleness across dimensions.
 */
 
interface StalenessMetrics {
    /** Age of the cached entry in milliseconds */
    ageMs: number;
    
    /** Lag between source update and cache (if detectable) */
    lagMs?: number;
    
    /** Whether this read was stale (if verifiable) */
    isStale?: boolean;
    
    /** Cache entry metadata */
    cacheKey: string;
    entityType: string;
}
 
class StalenessMonitor {
    private metrics: MetricsClient;
    
    constructor(metricsClient: MetricsClient) {
        this.metrics = metricsClient;
    }
 
    /**
     * Record staleness metrics for a cache read
     */
    recordRead(metrics: StalenessMetrics): void {
        // Age histogram - understand age distribution
        this.metrics.histogram('cache.staleness.age_ms', metrics.ageMs, {
            entityType: metrics.entityType,
        });
 
        // Lag histogram - if we can measure it
        if (metrics.lagMs !== undefined) {
            this.metrics.histogram('cache.staleness.lag_ms', metrics.lagMs, {
                entityType: metrics.entityType,
            });
        }
 
        // Staleness rate counter
        if (metrics.isStale !== undefined) {
            this.metrics.increment(
                metrics.isStale ? 'cache.reads.stale' : 'cache.reads.fresh',
                { entityType: metrics.entityType }
            );
        }
    }
 
    /**
     * Calculate P50/P95/P99 staleness from recent data
     */
    async geStalenessPercentiles(entityType: string, window: string): Promise<{
        p50: number;
        p95: number;
        p99: number;
    }> {
        return this.metrics.queryHistogramPercentiles(
            'cache.staleness.age_ms',
            { entityType },
            window,
            [50, 95, 99]
        );
    }
}
 
/**
 * Enhanced cache entry with staleness tracking metadata
 */
interface TrackedCacheEntry<T> {
    value: T;
    cachedAt: number;
    sourceUpdatedAt?: number;  // When source was last updated (if known)
    sourceVersion?: string;    // Version/ETag from source
}
 
class StalenessAwareCache {
    constructor(
        private cache: CacheClient,
        private monitor: StalenessMonitor,
    ) {}
 
    async get<T>(
        key: string,
        entityType: string
    ): Promise<{ value: T | null; staleness: StalenessMetrics | null }> {
        const raw = await this.cache.get(key);
        
        if (!raw) {
            return { value: null, staleness: null };
        }
 
        const entry = JSON.parse(raw) as TrackedCacheEntry<T>;
        const now = Date.now();
        
        const staleness: StalenessMetrics = {
            ageMs: now - entry.cachedAt,
            lagMs: entry.sourceUpdatedAt 
                ? entry.cachedAt - entry.sourceUpdatedAt 
                : undefined,
            cacheKey: key,
            entityType,
        };
 
        this.monitor.recordRead(staleness);
 
        return { value: entry.value, staleness };
    }
 
    async set<T>(
        key: string,
        value: T,
        ttlSeconds: number,
        sourceMetadata?: { updatedAt: number; version?: string }
    ): Promise<void> {
        const entry: TrackedCacheEntry<T> = {
            value,
            cachedAt: Date.now(),
            sourceUpdatedAt: sourceMetadata?.updatedAt,
            sourceVersion: sourceMetadata?.version,
        };
 
        await this.cache.setex(key, ttlSeconds, JSON.stringify(entry));
    }
}
 
interface MetricsClient {
    histogram(metric: string, value: number, tags: Record<string, string>): void;
    increment(metric: string, tags: Record<string, string>): void;
    queryHistogramPercentiles(
        metric: string, 
        tags: Record<string, string>, 
        window: string, 
        percentiles: number[]
    ): Promise<{ [key: string]: number }>;
}

Set SLOs on Staleness

Define explicit Service Level Objectives for staleness. For example: "P95 cache age must be under 5 minutes" or "Stale read rate must be below 2%". These SLOs guide TTL tuning and alert on cache degradation.

Stale-While-Revalidate Pattern

The stale-while-revalidate pattern is one of the most powerful techniques for balancing freshness and performance. It allows serving stale data immediately while asynchronously fetching fresh data in the background.

The key insight: Users would rather see slightly stale data instantly than wait for perfectly fresh data. The pattern optimizes for perceived performance while still maintaining eventual freshness.

Converting Mermaid diagram...

stale-while-revalidate.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
/**
 * Stale-While-Revalidate Implementation
 * 
 * Serves stale data immediately while refreshing in background.
 * Configurable freshness and staleness windows.
 */
 
interface SWRConfig {
    /** Time in seconds during which data is considered fresh */
    freshTtlSeconds: number;
    
    /** Additional time during which stale data can be served while revalidating */
    staleWhileRevalidateSeconds: number;
    
    /** Maximum age before data is considered too stale to serve */
    maxStaleSeconds: number;
}
 
interface SWREntry<T> {
    value: T;
    cachedAt: number;
    revalidating?: boolean;
}
 
enum FreshnessState {
    FRESH = 'FRESH',                        // Within freshTtl
    STALE_BUT_USABLE = 'STALE_BUT_USABLE', // Within staleWhileRevalidate window
    TOO_STALE = 'TOO_STALE',               // Beyond maxStale
}
 
class StaleWhileRevalidateCache<T> {
    private revalidationInProgress = new Set<string>();
 
    constructor(
        private cache: CacheClient,
        private fetcher: (key: string) => Promise<T>,
        private config: SWRConfig,
    ) {}
 
    async get(key: string): Promise<{ value: T | null; freshness: FreshnessState }> {
        const raw = await this.cache.get(key);
        
        if (!raw) {
            // Cache miss - fetch synchronously
            return this.fetchAndCache(key);
        }
 
        const entry = JSON.parse(raw) as SWREntry<T>;
        const freshness = this.determineFreshness(entry);
 
        switch (freshness) {
            case FreshnessState.FRESH:
                // Data is fresh, return immediately
                return { value: entry.value, freshness };
 
            case FreshnessState.STALE_BUT_USABLE:
                // Return stale data immediately, revalidate in background
                this.triggerBackgroundRevalidation(key);
                return { value: entry.value, freshness };
 
            case FreshnessState.TOO_STALE:
                // Data too old, fetch synchronously
                return this.fetchAndCache(key);
        }
    }
 
    private determineFreshness(entry: SWREntry<T>): FreshnessState {
        const ageSeconds = (Date.now() - entry.cachedAt) / 1000;
 
        if (ageSeconds <= this.config.freshTtlSeconds) {
            return FreshnessState.FRESH;
        }
 
        if (ageSeconds <= this.config.freshTtlSeconds + this.config.staleWhileRevalidateSeconds) {
            return FreshnessState.STALE_BUT_USABLE;
        }
 
        if (ageSeconds <= this.config.maxStaleSeconds) {
            return FreshnessState.STALE_BUT_USABLE; // Still serve, but flag as stale
        }
 
        return FreshnessState.TOO_STALE;
    }
 
    private async triggerBackgroundRevalidation(key: string): Promise<void> {
        // Prevent concurrent revalidations for same key
        if (this.revalidationInProgress.has(key)) {
            return;
        }
 
        this.revalidationInProgress.add(key);
 
        // Fire-and-forget background refresh
        setImmediate(async () => {
            try {
                const freshValue = await this.fetcher(key);
                await this.cacheValue(key, freshValue);
            } catch (error) {
                console.error(`Background revalidation failed for ${key}:`, error);
                // Keep serving stale data, don't delete cache entry
            } finally {
                this.revalidationInProgress.delete(key);
            }
        });
    }
 
    private async fetchAndCache(key: string): Promise<{ value: T | null; freshness: FreshnessState }> {
        try {
            const value = await this.fetcher(key);
            await this.cacheValue(key, value);
            return { value, freshness: FreshnessState.FRESH };
        } catch (error) {
            console.error(`Fetch failed for ${key}:`, error);
            return { value: null, freshness: FreshnessState.TOO_STALE };
        }
    }
 
    private async cacheValue(key: string, value: T): Promise<void> {
        const entry: SWREntry<T> = {
            value,
            cachedAt: Date.now(),
        };
 
        // Cache for the total possible lifetime
        const totalTtl = this.config.maxStaleSeconds + 60; // Buffer for clock drift
        await this.cache.setex(key, totalTtl, JSON.stringify(entry));
    }
}
 
// Example usage for product catalog
const productCache = new StaleWhileRevalidateCache<Product>(
    redis,
    async (key) => {
        const productId = key.split(':')[1];
        return await productService.getProduct(productId);
    },
    {
        freshTtlSeconds: 60,              // Fresh for 1 minute
        staleWhileRevalidateSeconds: 300, // Serve stale for 5 more minutes
        maxStaleSeconds: 3600,            // Absolute max 1 hour
    }
);
 
// Usage
const { value: product, freshness } = await productCache.get('product:123');
 
if (freshness === FreshnessState.STALE_BUT_USABLE) {
    // Optionally indicate to user that data might not be current
    console.log('Showing cached product data, refreshing in background...');
}

HTTP Cache-Control Header

The stale-while-revalidate pattern is standardized in HTTP as a Cache-Control directive: Cache-Control: max-age=60, stale-while-revalidate=300. CDNs like Cloudflare and Fastly implement this natively, applying the pattern at the edge without application code.

Graceful Degradation with Stale Data

When your backend or primary data source becomes unavailable, stale cached data becomes a lifeline. Rather than failing entirely, you can continue serving—degraded but functional—using cached data beyond its normal TTL.

The Graceful Degradation Hierarchy:

Fresh data — Ideal state, source available, cache current
Stale-while-revalidate — Source available, serving stale while refreshing
Stale-if-error — Source unavailable, serving stale as fallback
Static fallback — Nothing available, serving hardcoded defaults
Error — Complete failure, user sees error message

graceful-degradation.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
/**
 * Graceful Degradation with Stale-If-Error
 * 
 * When fetching fails, fall back to stale cached data
 * rather than returning an error to the user.
 */
 
interface DegradationConfig {
    /** Normal TTL for fresh data */
    normalTtlSeconds: number;
    
    /** Extended TTL for fallback during errors */
    errorFallbackTtlSeconds: number;
    
    /** Static fallback if no cache available */
    staticFallback?: unknown;
}
 
class GracefulDegradationCache<T> {
    constructor(
        private cache: CacheClient,
        private fetcher: (key: string) => Promise<T>,
        private config: DegradationConfig,
        private circuitBreaker: CircuitBreaker,
    ) {}
 
    async get(key: string): Promise<{
        value: T | null;
        source: 'fresh' | 'stale' | 'fallback' | 'error';
        staleness?: number;
    }> {
        // Try to fetch fresh data first
        if (this.circuitBreaker.isOpen()) {
            // Backend known to be down, skip to stale
            return this.getFromStaleCache(key, 'circuit_open');
        }
 
        try {
            const freshValue = await this.fetcher(key);
            await this.cacheValue(key, freshValue);
            return { value: freshValue, source: 'fresh' };
        } catch (error) {
            // Fetch failed, try to serve stale
            this.circuitBreaker.recordFailure();
            return this.getFromStaleCache(key, 'fetch_failed');
        }
    }
 
    private async getFromStaleCache(
        key: string,
        reason: string
    ): Promise<{
        value: T | null;
        source: 'stale' | 'fallback' | 'error';
        staleness?: number;
    }> {
        // Check if we have any cached value (even if expired)
        const raw = await this.cache.get(key);
 
        if (raw) {
            const { value, cachedAt } = JSON.parse(raw);
            const staleness = (Date.now() - cachedAt) / 1000;
 
            // Check if within extended fallback TTL
            if (staleness <= this.config.errorFallbackTtlSeconds) {
                console.warn(
                    `Serving stale data for ${key} (${reason}), age: ${staleness}s`
                );
                return { value, source: 'stale', staleness };
            }
        }
 
        // No valid cache, try static fallback
        if (this.config.staticFallback !== undefined) {
            console.warn(`Serving static fallback for ${key} (${reason})`);
            return { 
                value: this.config.staticFallback as T, 
                source: 'fallback' 
            };
        }
 
        // Complete failure
        return { value: null, source: 'error' };
    }
 
    private async cacheValue(key: string, value: T): Promise<void> {
        const entry = {
            value,
            cachedAt: Date.now(),
        };
        
        // Cache with extended TTL for error fallback
        const totalTtl = this.config.errorFallbackTtlSeconds + 60;
        await this.cache.setex(key, totalTtl, JSON.stringify(entry));
    }
}
 
/**
 * Simple circuit breaker for backend health
 */
class CircuitBreaker {
    private failures = 0;
    private lastFailure = 0;
    private state: 'closed' | 'open' | 'half-open' = 'closed';
 
    constructor(
        private failureThreshold: number = 5,
        private resetTimeMs: number = 30000,
    ) {}
 
    isOpen(): boolean {
        if (this.state === 'open') {
            // Check if reset time has passed
            if (Date.now() - this.lastFailure > this.resetTimeMs) {
                this.state = 'half-open';
                return false; // Allow one request through
            }
            return true;
        }
        return false;
    }
 
    recordFailure(): void {
        this.failures++;
        this.lastFailure = Date.now();
        
        if (this.failures >= this.failureThreshold) {
            this.state = 'open';
            console.error('Circuit breaker opened: backend unavailable');
        }
    }
 
    recordSuccess(): void {
        this.failures = 0;
        this.state = 'closed';
    }
}
 
// Example: Product catalog with graceful degradation
const productCache = new GracefulDegradationCache<Product>(
    redis,
    async (key) => await productAPI.fetch(key),
    {
        normalTtlSeconds: 300,           // 5 minutes normal
        errorFallbackTtlSeconds: 3600,   // Serve up to 1 hour old during outage
        staticFallback: {                // Last resort
            id: 'unknown',
            name: 'Product Temporarily Unavailable',
            price: 0,
            available: false,
        },
    },
    new CircuitBreaker(5, 30000), // Open after 5 failures, reset after 30s
);
 
// Usage in request handler
async function handleProductRequest(productId: string): Promise<Response> {
    const result = await productCache.get(`product:${productId}`);
 
    if (result.source === 'error') {
        return new Response('Service unavailable', { status: 503 });
    }
 
    // Include freshness metadata in response headers
    const headers = new Headers({
        'Content-Type': 'application/json',
        'X-Data-Source': result.source,
    });
 
    if (result.staleness) {
        headers.set('X-Data-Staleness-Seconds', result.staleness.toString());
    }
 
    return new Response(JSON.stringify(result.value), { headers });
}

Degradation Best Practices

•Log degraded states — Alert when serving stale data so ops can investigate the underlying issue.
•Communicate staleness to users — Show "data may be outdated" indicators for stale-if-error responses.
•Limit staleness by data type — Prices can't be 1 hour stale; descriptions can. Configure per data type.
•Don't degrade for writes — Stale reads are acceptable; stale writes cause corruption. Always fail writes properly.
•Test degradation paths — Chaos test by breaking backends and verifying graceful degradation works.

Staleness Detection Techniques

Sometimes you need to know definitively whether cached data is stale—not just probabilistically old. This is especially important for critical data where the cost of staleness is high.

Detection Approaches:

Version/ETag Validation

Store a version identifier with the cached data. Before using, optionally validate against the source's current version.

How it works:

Cache stores: { value, version: 'v42' }
On read, make lightweight HEAD/version request to source
If versions match, cached value is fresh
If versions differ, fetch new value

Trade-off: Adds a network round-trip per read, but guarantees freshness for critical data.

version-validation.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
/**
 * Version-Based Staleness Detection
 */
 
interface VersionedEntry<T> {
    value: T;
    version: string;  // ETag, updatedAt, or version number
    cachedAt: number;
}
 
class VersionValidatingCache<T> {
    constructor(
        private cache: CacheClient,
        private fetcher: (key: string) => Promise<T>,
        private versionFetcher: (key: string) => Promise<string>,
        private ttlSeconds: number,
    ) {}
 
    async get(key: string, validateFreshness: boolean = false): Promise<T | null> {
        const raw = await this.cache.get(key);
 
        if (!raw) {
            return this.fetchAndCache(key);
        }
 
        const entry = JSON.parse(raw) as VersionedEntry<T>;
 
        if (!validateFreshness) {
            // Trust cache without validation
            return entry.value;
        }
 
        // Validate version against source
        try {
            const currentVersion = await this.versionFetcher(key);
            
            if (currentVersion === entry.version) {
                // Cache is fresh
                return entry.value;
            }
 
            // Cache is stale, refetch
            console.log(`Cache stale for ${key}: ${entry.version} vs ${currentVersion}`);
            return this.fetchAndCache(key);
        } catch (error) {
            // Version check failed, return cached value as fallback
            console.warn(`Version check failed for ${key}, using cached`);
            return entry.value;
        }
    }
 
    private async fetchAndCache(key: string): Promise<T | null> {
        const [value, version] = await Promise.all([
            this.fetcher(key),
            this.versionFetcher(key),
        ]);
 
        const entry: VersionedEntry<T> = {
            value,
            version,
            cachedAt: Date.now(),
        };
 
        await this.cache.setex(key, this.ttlSeconds, JSON.stringify(entry));
        return value;
    }
}
 
// Usage
const userCache = new VersionValidatingCache<User>(
    redis,
    async (key) => userService.getUser(key.split(':')[1]),
    async (key) => userService.getUserVersion(key.split(':')[1]), // Lightweight call
    300,
);
 
// For critical reads, validate freshness
const user = await userCache.get('user:123', /* validateFreshness */ true);
 
// For non-critical reads, skip validation
const userFast = await userCache.get('user:123', /* validateFreshness */ false);

Staleness Detection Trade-offs
Technique	Latency Impact	Accuracy	Cost	Best For
Version Check	High (+1 round trip)	100%	Medium	Critical data, low volume
Conditional Fetch	Medium (optimized)	100%	Medium	HTTP-based sources
Sampling	None (async)	Statistical	Low	High-volume, monitoring

Communicating Staleness to Clients

When serving potentially stale data, transparency with clients and users helps them make informed decisions. Whether through HTTP headers, response metadata, or UI indicators, communicating staleness builds trust and enables appropriate reactions.

staleness-communication.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
/**
 * Staleness Communication Patterns
 * 
 * Multiple approaches for informing clients about data freshness.
 */
 
// ============================================
// 1. HTTP Headers (API Responses)
// ============================================
 
interface CacheMetadata {
    cachedAt: number;
    source: 'fresh' | 'stale' | 'fallback';
    age: number;  // Seconds since caching
    maxAge?: number; // Expected freshness duration
}
 
function addCacheHeaders(
    response: Response, 
    metadata: CacheMetadata
): Response {
    const headers = new Headers(response.headers);
    
    // Standard HTTP headers
    headers.set('Age', metadata.age.toString());
    headers.set('Date', new Date(metadata.cachedAt).toUTCString());
    
    // Custom headers for detailed staleness info
    headers.set('X-Cache-Status', metadata.source);
    headers.set('X-Cache-Age-Seconds', metadata.age.toString());
    
    if (metadata.source !== 'fresh') {
        // Indicate data may not be current
        headers.set('Warning', '110 - "Response is stale"');
    }
 
    if (metadata.maxAge) {
        // Suggest when client should refetch
        const remainingFreshness = Math.max(0, metadata.maxAge - metadata.age);
        headers.set(
            'Cache-Control', 
            `max-age=${remainingFreshness}, stale-while-revalidate=300`
        );
    }
 
    return new Response(response.body, {
        status: response.status,
        headers,
    });
}
 
 
// ============================================
// 2. Response Envelope (API JSON)
// ============================================
 
interface EnvelopedResponse<T> {
    data: T;
    meta: {
        cached: boolean;
        cachedAt: string;
        ageSeconds: number;
        freshness: 'fresh' | 'stale-ok' | 'stale-degraded';
        nextRefresh?: string;
    };
}
 
function envelopeResponse<T>(
    data: T,
    cacheMetadata: CacheMetadata
): EnvelopedResponse<T> {
    return {
        data,
        meta: {
            cached: cacheMetadata.source !== 'fresh',
            cachedAt: new Date(cacheMetadata.cachedAt).toISOString(),
            ageSeconds: cacheMetadata.age,
            freshness: mapToFreshness(cacheMetadata),
            nextRefresh: cacheMetadata.maxAge 
                ? new Date(cacheMetadata.cachedAt + cacheMetadata.maxAge * 1000).toISOString()
                : undefined,
        },
    };
}
 
function mapToFreshness(metadata: CacheMetadata): 'fresh' | 'stale-ok' | 'stale-degraded' {
    if (metadata.source === 'fresh') return 'fresh';
    if (metadata.source === 'stale') return 'stale-ok';
    return 'stale-degraded';
}
 
 
// ============================================
// 3. GraphQL Extensions
// ============================================
 
interface GraphQLResponse<T> {
    data: T;
    extensions?: {
        cacheControl?: {
            version: number;
            hints: Array<{
                path: string[];
                maxAge: number;
                scope: 'PUBLIC' | 'PRIVATE';
            }>;
        };
        staleness?: {
            staleFields: string[];
            ageByField: Record<string, number>;
        };
    };
}
 
 
// ============================================
// 4. UI Components (Frontend)
// ============================================
 
/**
 * React component for displaying staleness indicators
 */
const StalenessIndicatorExample = `
// React component
function DataCard({ data, staleness }) {
    const getIndicator = () => {
        if (staleness.freshness === 'fresh') return null;
        
        if (staleness.freshness === 'stale-ok') {
            return (
                <div className="text-yellow-600 text-sm">
                    Data from {formatTimeAgo(staleness.cachedAt)}
                    <button onClick={onRefresh}>Refresh</button>
                </div>
            );
        }
        
        return (
            <div className="text-orange-600 text-sm bg-orange-50 p-2">
                ⚠️ Showing cached data (service temporarily unavailable)
                <span>Data from {formatTimeAgo(staleness.cachedAt)}</span>
            </div>
        );
    };
 
    return (
        <div>
            {getIndicator()}
            <DataContent data={data} />
        </div>
    );
}
`;
 
 
// ============================================
// 5. Event Stream Updates
// ============================================
 
/**
 * For real-time apps, notify clients when fresher data is available
 */
class StalenessEventEmitter {
    private clients = new Map<string, Set<(event: StalenessEvent) => void>>();
 
    subscribe(
        entityId: string, 
        callback: (event: StalenessEvent) => void
    ): () => void {
        if (!this.clients.has(entityId)) {
            this.clients.set(entityId, new Set());
        }
        this.clients.get(entityId)!.add(callback);
        
        return () => this.clients.get(entityId)?.delete(callback);
    }
 
    notifyRefreshed(entityId: string, newVersion: string): void {
        const subscribers = this.clients.get(entityId);
        if (subscribers) {
            const event: StalenessEvent = {
                type: 'data_refreshed',
                entityId,
                newVersion,
                timestamp: Date.now(),
            };
            subscribers.forEach(callback => callback(event));
        }
    }
}
 
interface StalenessEvent {
    type: 'data_refreshed' | 'data_stale';
    entityId: string;
    newVersion?: string;
    timestamp: number;
}

Staleness Communication Guidelines

•Be transparent but not alarming — "Data from 2 minutes ago" is informative. "Warning: You're seeing stale data!" causes unnecessary concern.
•Provide refresh options — When serving stale data, give users a way to force-refresh.
•Use standard HTTP headers — Age, Cache-Control, and Warning headers are well-understood by CDNs and browsers.
•Distinguish intentional staleness — "Cached for performance" vs. "Showing cached due to service issues" have different user implications.
•Consider business context — Stock prices need real-time freshness indicators. Blog posts do not.

Summary: Dealing with Stale Data

Stale data is an inevitable reality in any cached system. Rather than treating staleness as a bug to eliminate, we should measure it, manage it, and communicate it transparently. The techniques in this page provide a comprehensive toolkit for living gracefully with cache staleness.

Key Takeaways

•Staleness has multiple dimensions — Age, lag, frequency, and magnitude each matter differently. Measure what's relevant to your use case.
•Stale-while-revalidate optimizes perceived performance — Serve stale immediately while refreshing in background. Users prefer fast-and-stale over slow-and-fresh.
•Graceful degradation preserves availability — When backends fail, stale cached data keeps the system functional. Design fallback hierarchies.
•Detection techniques allow verification — Version checks, conditional fetches, and sampling each trade latency for accuracy. Choose based on data criticality.
•Communicate staleness transparently — HTTP headers, response envelopes, and UI indicators let clients and users react appropriately to cached data.
•Set SLOs on staleness — Define acceptable staleness levels and alert when exceeded. Makes staleness a managed quantity rather than an unknown.

Module Complete:

You've now completed the Cache Invalidation Strategies module, covering:

Time-based expiration — TTL, absolute vs. sliding, and the mathematics of TTL selection
Event-based invalidation — Proactive invalidation architectures and key mapping
Cache versioning — Handling schema evolution and gradual rollouts
Dealing with stale data — Measuring, detecting, and communicating staleness

These four techniques form a comprehensive toolkit for managing cache freshness in production systems. Real-world caching typically combines multiple techniques: TTL as a baseline, events for critical data, versioning for deployments, and staleness handling for resilience.

Module Complete

Congratulations! You've mastered cache invalidation strategies—Phil Karlton's hardest problem in computer science. You now understand how to keep cached data fresh, handle the inevitable staleness, and build resilient caching systems that degrade gracefully under pressure.

4 / 4

Loading learning content...

System Design (LLD)Cache Invalidation Strategies

Cache Invalidation Strategies

LevelIntermediate

Duration60 mins

TopicCache Invalidation Strategies

4 / 4

Dealing with Stale Data

The Inevitable Reality of Staleness

In any system with caching:

There's always a window between data change and cache update
Events can be delayed, lost, or processed out of order
Clock drift causes expiration timing inconsistencies
Race conditions between concurrent updates create temporary inconsistencies
Network partitions can isolate cache nodes from invalidation signals

What You Will Learn

Understanding Staleness Dimensions

The Four Dimensions of Staleness:

Staleness Dimensions and Their Impact
Dimension	Definition	Example	Impact
Age	How long since the cached value was created	Entry cached 15 minutes ago	Bounds maximum possible staleness
Lag	Time between source change and cache update	DB updated 30 seconds before cache	Measures invalidation latency
Frequency	How often stale data is served	5% of reads serve stale data	Indicates overall cache health
Magnitude	How different the cached value is from truth	Price shows $10 when actual is $12	Measures user-visible impact

staleness-metrics.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
/**
 * Staleness Metrics Collector
 * 
 * Instruments cache reads to measure staleness across dimensions.
 */
 
interface StalenessMetrics {
    /** Age of the cached entry in milliseconds */
    ageMs: number;
    
    /** Lag between source update and cache (if detectable) */
    lagMs?: number;
    
    /** Whether this read was stale (if verifiable) */
    isStale?: boolean;
    
    /** Cache entry metadata */
    cacheKey: string;
    entityType: string;
}
 
class StalenessMonitor {
    private metrics: MetricsClient;
    
    constructor(metricsClient: MetricsClient) {
        this.metrics = metricsClient;
    }
 
    /**
     * Record staleness metrics for a cache read
     */
    recordRead(metrics: StalenessMetrics): void {
        // Age histogram - understand age distribution
        this.metrics.histogram('cache.staleness.age_ms', metrics.ageMs, {
            entityType: metrics.entityType,
        });
 
        // Lag histogram - if we can measure it
        if (metrics.lagMs !== undefined) {
            this.metrics.histogram('cache.staleness.lag_ms', metrics.lagMs, {
                entityType: metrics.entityType,
            });
        }
 
        // Staleness rate counter
        if (metrics.isStale !== undefined) {
            this.metrics.increment(
                metrics.isStale ? 'cache.reads.stale' : 'cache.reads.fresh',
                { entityType: metrics.entityType }
            );
        }
    }
 
    /**
     * Calculate P50/P95/P99 staleness from recent data
     */
    async geStalenessPercentiles(entityType: string, window: string): Promise<{
        p50: number;
        p95: number;
        p99: number;
    }> {
        return this.metrics.queryHistogramPercentiles(
            'cache.staleness.age_ms',
            { entityType },
            window,
            [50, 95, 99]
        );
    }
}
 
/**
 * Enhanced cache entry with staleness tracking metadata
 */
interface TrackedCacheEntry<T> {
    value: T;
    cachedAt: number;
    sourceUpdatedAt?: number;  // When source was last updated (if known)
    sourceVersion?: string;    // Version/ETag from source
}
 
class StalenessAwareCache {
    constructor(
        private cache: CacheClient,
        private monitor: StalenessMonitor,
    ) {}
 
    async get<T>(
        key: string,
        entityType: string
    ): Promise<{ value: T | null; staleness: StalenessMetrics | null }> {
        const raw = await this.cache.get(key);
        
        if (!raw) {
            return { value: null, staleness: null };
        }
 
        const entry = JSON.parse(raw) as TrackedCacheEntry<T>;
        const now = Date.now();
        
        const staleness: StalenessMetrics = {
            ageMs: now - entry.cachedAt,
            lagMs: entry.sourceUpdatedAt 
                ? entry.cachedAt - entry.sourceUpdatedAt 
                : undefined,
            cacheKey: key,
            entityType,
        };
 
        this.monitor.recordRead(staleness);
 
        return { value: entry.value, staleness };
    }
 
    async set<T>(
        key: string,
        value: T,
        ttlSeconds: number,
        sourceMetadata?: { updatedAt: number; version?: string }
    ): Promise<void> {
        const entry: TrackedCacheEntry<T> = {
            value,
            cachedAt: Date.now(),
            sourceUpdatedAt: sourceMetadata?.updatedAt,
            sourceVersion: sourceMetadata?.version,
        };
 
        await this.cache.setex(key, ttlSeconds, JSON.stringify(entry));
    }
}
 
interface MetricsClient {
    histogram(metric: string, value: number, tags: Record<string, string>): void;
    increment(metric: string, tags: Record<string, string>): void;
    queryHistogramPercentiles(
        metric: string, 
        tags: Record<string, string>, 
        window: string, 
        percentiles: number[]
    ): Promise<{ [key: string]: number }>;
}

Set SLOs on Staleness

Stale-While-Revalidate Pattern

Converting Mermaid diagram...

stale-while-revalidate.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
/**
 * Stale-While-Revalidate Implementation
 * 
 * Serves stale data immediately while refreshing in background.
 * Configurable freshness and staleness windows.
 */
 
interface SWRConfig {
    /** Time in seconds during which data is considered fresh */
    freshTtlSeconds: number;
    
    /** Additional time during which stale data can be served while revalidating */
    staleWhileRevalidateSeconds: number;
    
    /** Maximum age before data is considered too stale to serve */
    maxStaleSeconds: number;
}
 
interface SWREntry<T> {
    value: T;
    cachedAt: number;
    revalidating?: boolean;
}
 
enum FreshnessState {
    FRESH = 'FRESH',                        // Within freshTtl
    STALE_BUT_USABLE = 'STALE_BUT_USABLE', // Within staleWhileRevalidate window
    TOO_STALE = 'TOO_STALE',               // Beyond maxStale
}
 
class StaleWhileRevalidateCache<T> {
    private revalidationInProgress = new Set<string>();
 
    constructor(
        private cache: CacheClient,
        private fetcher: (key: string) => Promise<T>,
        private config: SWRConfig,
    ) {}
 
    async get(key: string): Promise<{ value: T | null; freshness: FreshnessState }> {
        const raw = await this.cache.get(key);
        
        if (!raw) {
            // Cache miss - fetch synchronously
            return this.fetchAndCache(key);
        }
 
        const entry = JSON.parse(raw) as SWREntry<T>;
        const freshness = this.determineFreshness(entry);
 
        switch (freshness) {
            case FreshnessState.FRESH:
                // Data is fresh, return immediately
                return { value: entry.value, freshness };
 
            case FreshnessState.STALE_BUT_USABLE:
                // Return stale data immediately, revalidate in background
                this.triggerBackgroundRevalidation(key);
                return { value: entry.value, freshness };
 
            case FreshnessState.TOO_STALE:
                // Data too old, fetch synchronously
                return this.fetchAndCache(key);
        }
    }
 
    private determineFreshness(entry: SWREntry<T>): FreshnessState {
        const ageSeconds = (Date.now() - entry.cachedAt) / 1000;
 
        if (ageSeconds <= this.config.freshTtlSeconds) {
            return FreshnessState.FRESH;
        }
 
        if (ageSeconds <= this.config.freshTtlSeconds + this.config.staleWhileRevalidateSeconds) {
            return FreshnessState.STALE_BUT_USABLE;
        }
 
        if (ageSeconds <= this.config.maxStaleSeconds) {
            return FreshnessState.STALE_BUT_USABLE; // Still serve, but flag as stale
        }
 
        return FreshnessState.TOO_STALE;
    }
 
    private async triggerBackgroundRevalidation(key: string): Promise<void> {
        // Prevent concurrent revalidations for same key
        if (this.revalidationInProgress.has(key)) {
            return;
        }
 
        this.revalidationInProgress.add(key);
 
        // Fire-and-forget background refresh
        setImmediate(async () => {
            try {
                const freshValue = await this.fetcher(key);
                await this.cacheValue(key, freshValue);
            } catch (error) {
                console.error(`Background revalidation failed for ${key}:`, error);
                // Keep serving stale data, don't delete cache entry
            } finally {
                this.revalidationInProgress.delete(key);
            }
        });
    }
 
    private async fetchAndCache(key: string): Promise<{ value: T | null; freshness: FreshnessState }> {
        try {
            const value = await this.fetcher(key);
            await this.cacheValue(key, value);
            return { value, freshness: FreshnessState.FRESH };
        } catch (error) {
            console.error(`Fetch failed for ${key}:`, error);
            return { value: null, freshness: FreshnessState.TOO_STALE };
        }
    }
 
    private async cacheValue(key: string, value: T): Promise<void> {
        const entry: SWREntry<T> = {
            value,
            cachedAt: Date.now(),
        };
 
        // Cache for the total possible lifetime
        const totalTtl = this.config.maxStaleSeconds + 60; // Buffer for clock drift
        await this.cache.setex(key, totalTtl, JSON.stringify(entry));
    }
}
 
// Example usage for product catalog
const productCache = new StaleWhileRevalidateCache<Product>(
    redis,
    async (key) => {
        const productId = key.split(':')[1];
        return await productService.getProduct(productId);
    },
    {
        freshTtlSeconds: 60,              // Fresh for 1 minute
        staleWhileRevalidateSeconds: 300, // Serve stale for 5 more minutes
        maxStaleSeconds: 3600,            // Absolute max 1 hour
    }
);
 
// Usage
const { value: product, freshness } = await productCache.get('product:123');
 
if (freshness === FreshnessState.STALE_BUT_USABLE) {
    // Optionally indicate to user that data might not be current
    console.log('Showing cached product data, refreshing in background...');
}

HTTP Cache-Control Header

Graceful Degradation with Stale Data

The Graceful Degradation Hierarchy:

Fresh data — Ideal state, source available, cache current
Stale-while-revalidate — Source available, serving stale while refreshing
Stale-if-error — Source unavailable, serving stale as fallback
Static fallback — Nothing available, serving hardcoded defaults
Error — Complete failure, user sees error message

graceful-degradation.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
/**
 * Graceful Degradation with Stale-If-Error
 * 
 * When fetching fails, fall back to stale cached data
 * rather than returning an error to the user.
 */
 
interface DegradationConfig {
    /** Normal TTL for fresh data */
    normalTtlSeconds: number;
    
    /** Extended TTL for fallback during errors */
    errorFallbackTtlSeconds: number;
    
    /** Static fallback if no cache available */
    staticFallback?: unknown;
}
 
class GracefulDegradationCache<T> {
    constructor(
        private cache: CacheClient,
        private fetcher: (key: string) => Promise<T>,
        private config: DegradationConfig,
        private circuitBreaker: CircuitBreaker,
    ) {}
 
    async get(key: string): Promise<{
        value: T | null;
        source: 'fresh' | 'stale' | 'fallback' | 'error';
        staleness?: number;
    }> {
        // Try to fetch fresh data first
        if (this.circuitBreaker.isOpen()) {
            // Backend known to be down, skip to stale
            return this.getFromStaleCache(key, 'circuit_open');
        }
 
        try {
            const freshValue = await this.fetcher(key);
            await this.cacheValue(key, freshValue);
            return { value: freshValue, source: 'fresh' };
        } catch (error) {
            // Fetch failed, try to serve stale
            this.circuitBreaker.recordFailure();
            return this.getFromStaleCache(key, 'fetch_failed');
        }
    }
 
    private async getFromStaleCache(
        key: string,
        reason: string
    ): Promise<{
        value: T | null;
        source: 'stale' | 'fallback' | 'error';
        staleness?: number;
    }> {
        // Check if we have any cached value (even if expired)
        const raw = await this.cache.get(key);
 
        if (raw) {
            const { value, cachedAt } = JSON.parse(raw);
            const staleness = (Date.now() - cachedAt) / 1000;
 
            // Check if within extended fallback TTL
            if (staleness <= this.config.errorFallbackTtlSeconds) {
                console.warn(
                    `Serving stale data for ${key} (${reason}), age: ${staleness}s`
                );
                return { value, source: 'stale', staleness };
            }
        }
 
        // No valid cache, try static fallback
        if (this.config.staticFallback !== undefined) {
            console.warn(`Serving static fallback for ${key} (${reason})`);
            return { 
                value: this.config.staticFallback as T, 
                source: 'fallback' 
            };
        }
 
        // Complete failure
        return { value: null, source: 'error' };
    }
 
    private async cacheValue(key: string, value: T): Promise<void> {
        const entry = {
            value,
            cachedAt: Date.now(),
        };
        
        // Cache with extended TTL for error fallback
        const totalTtl = this.config.errorFallbackTtlSeconds + 60;
        await this.cache.setex(key, totalTtl, JSON.stringify(entry));
    }
}
 
/**
 * Simple circuit breaker for backend health
 */
class CircuitBreaker {
    private failures = 0;
    private lastFailure = 0;
    private state: 'closed' | 'open' | 'half-open' = 'closed';
 
    constructor(
        private failureThreshold: number = 5,
        private resetTimeMs: number = 30000,
    ) {}
 
    isOpen(): boolean {
        if (this.state === 'open') {
            // Check if reset time has passed
            if (Date.now() - this.lastFailure > this.resetTimeMs) {
                this.state = 'half-open';
                return false; // Allow one request through
            }
            return true;
        }
        return false;
    }
 
    recordFailure(): void {
        this.failures++;
        this.lastFailure = Date.now();
        
        if (this.failures >= this.failureThreshold) {
            this.state = 'open';
            console.error('Circuit breaker opened: backend unavailable');
        }
    }
 
    recordSuccess(): void {
        this.failures = 0;
        this.state = 'closed';
    }
}
 
// Example: Product catalog with graceful degradation
const productCache = new GracefulDegradationCache<Product>(
    redis,
    async (key) => await productAPI.fetch(key),
    {
        normalTtlSeconds: 300,           // 5 minutes normal
        errorFallbackTtlSeconds: 3600,   // Serve up to 1 hour old during outage
        staticFallback: {                // Last resort
            id: 'unknown',
            name: 'Product Temporarily Unavailable',
            price: 0,
            available: false,
        },
    },
    new CircuitBreaker(5, 30000), // Open after 5 failures, reset after 30s
);
 
// Usage in request handler
async function handleProductRequest(productId: string): Promise<Response> {
    const result = await productCache.get(`product:${productId}`);
 
    if (result.source === 'error') {
        return new Response('Service unavailable', { status: 503 });
    }
 
    // Include freshness metadata in response headers
    const headers = new Headers({
        'Content-Type': 'application/json',
        'X-Data-Source': result.source,
    });
 
    if (result.staleness) {
        headers.set('X-Data-Staleness-Seconds', result.staleness.toString());
    }
 
    return new Response(JSON.stringify(result.value), { headers });
}

Degradation Best Practices

•Log degraded states — Alert when serving stale data so ops can investigate the underlying issue.
•Communicate staleness to users — Show "data may be outdated" indicators for stale-if-error responses.
•Limit staleness by data type — Prices can't be 1 hour stale; descriptions can. Configure per data type.
•Don't degrade for writes — Stale reads are acceptable; stale writes cause corruption. Always fail writes properly.
•Test degradation paths — Chaos test by breaking backends and verifying graceful degradation works.

Staleness Detection Techniques

Sometimes you need to know definitively whether cached data is stale—not just probabilistically old. This is especially important for critical data where the cost of staleness is high.

Detection Approaches:

Version/ETag Validation

Store a version identifier with the cached data. Before using, optionally validate against the source's current version.

How it works:

Cache stores: { value, version: 'v42' }
On read, make lightweight HEAD/version request to source
If versions match, cached value is fresh
If versions differ, fetch new value

Trade-off: Adds a network round-trip per read, but guarantees freshness for critical data.

version-validation.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
/**
 * Version-Based Staleness Detection
 */
 
interface VersionedEntry<T> {
    value: T;
    version: string;  // ETag, updatedAt, or version number
    cachedAt: number;
}
 
class VersionValidatingCache<T> {
    constructor(
        private cache: CacheClient,
        private fetcher: (key: string) => Promise<T>,
        private versionFetcher: (key: string) => Promise<string>,
        private ttlSeconds: number,
    ) {}
 
    async get(key: string, validateFreshness: boolean = false): Promise<T | null> {
        const raw = await this.cache.get(key);
 
        if (!raw) {
            return this.fetchAndCache(key);
        }
 
        const entry = JSON.parse(raw) as VersionedEntry<T>;
 
        if (!validateFreshness) {
            // Trust cache without validation
            return entry.value;
        }
 
        // Validate version against source
        try {
            const currentVersion = await this.versionFetcher(key);
            
            if (currentVersion === entry.version) {
                // Cache is fresh
                return entry.value;
            }
 
            // Cache is stale, refetch
            console.log(`Cache stale for ${key}: ${entry.version} vs ${currentVersion}`);
            return this.fetchAndCache(key);
        } catch (error) {
            // Version check failed, return cached value as fallback
            console.warn(`Version check failed for ${key}, using cached`);
            return entry.value;
        }
    }
 
    private async fetchAndCache(key: string): Promise<T | null> {
        const [value, version] = await Promise.all([
            this.fetcher(key),
            this.versionFetcher(key),
        ]);
 
        const entry: VersionedEntry<T> = {
            value,
            version,
            cachedAt: Date.now(),
        };
 
        await this.cache.setex(key, this.ttlSeconds, JSON.stringify(entry));
        return value;
    }
}
 
// Usage
const userCache = new VersionValidatingCache<User>(
    redis,
    async (key) => userService.getUser(key.split(':')[1]),
    async (key) => userService.getUserVersion(key.split(':')[1]), // Lightweight call
    300,
);
 
// For critical reads, validate freshness
const user = await userCache.get('user:123', /* validateFreshness */ true);
 
// For non-critical reads, skip validation
const userFast = await userCache.get('user:123', /* validateFreshness */ false);

Staleness Detection Trade-offs
Technique	Latency Impact	Accuracy	Cost	Best For
Version Check	High (+1 round trip)	100%	Medium	Critical data, low volume
Conditional Fetch	Medium (optimized)	100%	Medium	HTTP-based sources
Sampling	None (async)	Statistical	Low	High-volume, monitoring

Communicating Staleness to Clients

staleness-communication.ts
TypeScript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
/**
 * Staleness Communication Patterns
 * 
 * Multiple approaches for informing clients about data freshness.
 */
 
// ============================================
// 1. HTTP Headers (API Responses)
// ============================================
 
interface CacheMetadata {
    cachedAt: number;
    source: 'fresh' | 'stale' | 'fallback';
    age: number;  // Seconds since caching
    maxAge?: number; // Expected freshness duration
}
 
function addCacheHeaders(
    response: Response, 
    metadata: CacheMetadata
): Response {
    const headers = new Headers(response.headers);
    
    // Standard HTTP headers
    headers.set('Age', metadata.age.toString());
    headers.set('Date', new Date(metadata.cachedAt).toUTCString());
    
    // Custom headers for detailed staleness info
    headers.set('X-Cache-Status', metadata.source);
    headers.set('X-Cache-Age-Seconds', metadata.age.toString());
    
    if (metadata.source !== 'fresh') {
        // Indicate data may not be current
        headers.set('Warning', '110 - "Response is stale"');
    }
 
    if (metadata.maxAge) {
        // Suggest when client should refetch
        const remainingFreshness = Math.max(0, metadata.maxAge - metadata.age);
        headers.set(
            'Cache-Control', 
            `max-age=${remainingFreshness}, stale-while-revalidate=300`
        );
    }
 
    return new Response(response.body, {
        status: response.status,
        headers,
    });
}
 
 
// ============================================
// 2. Response Envelope (API JSON)
// ============================================
 
interface EnvelopedResponse<T> {
    data: T;
    meta: {
        cached: boolean;
        cachedAt: string;
        ageSeconds: number;
        freshness: 'fresh' | 'stale-ok' | 'stale-degraded';
        nextRefresh?: string;
    };
}
 
function envelopeResponse<T>(
    data: T,
    cacheMetadata: CacheMetadata
): EnvelopedResponse<T> {
    return {
        data,
        meta: {
            cached: cacheMetadata.source !== 'fresh',
            cachedAt: new Date(cacheMetadata.cachedAt).toISOString(),
            ageSeconds: cacheMetadata.age,
            freshness: mapToFreshness(cacheMetadata),
            nextRefresh: cacheMetadata.maxAge 
                ? new Date(cacheMetadata.cachedAt + cacheMetadata.maxAge * 1000).toISOString()
                : undefined,
        },
    };
}
 
function mapToFreshness(metadata: CacheMetadata): 'fresh' | 'stale-ok' | 'stale-degraded' {
    if (metadata.source === 'fresh') return 'fresh';
    if (metadata.source === 'stale') return 'stale-ok';
    return 'stale-degraded';
}
 
 
// ============================================
// 3. GraphQL Extensions
// ============================================
 
interface GraphQLResponse<T> {
    data: T;
    extensions?: {
        cacheControl?: {
            version: number;
            hints: Array<{
                path: string[];
                maxAge: number;
                scope: 'PUBLIC' | 'PRIVATE';
            }>;
        };
        staleness?: {
            staleFields: string[];
            ageByField: Record<string, number>;
        };
    };
}
 
 
// ============================================
// 4. UI Components (Frontend)
// ============================================
 
/**
 * React component for displaying staleness indicators
 */
const StalenessIndicatorExample = `
// React component
function DataCard({ data, staleness }) {
    const getIndicator = () => {
        if (staleness.freshness === 'fresh') return null;
        
        if (staleness.freshness === 'stale-ok') {
            return (
                <div className="text-yellow-600 text-sm">
                    Data from {formatTimeAgo(staleness.cachedAt)}
                    <button onClick={onRefresh}>Refresh</button>
                </div>
            );
        }
        
        return (
            <div className="text-orange-600 text-sm bg-orange-50 p-2">
                ⚠️ Showing cached data (service temporarily unavailable)
                <span>Data from {formatTimeAgo(staleness.cachedAt)}</span>
            </div>
        );
    };
 
    return (
        <div>
            {getIndicator()}
            <DataContent data={data} />
        </div>
    );
}
`;
 
 
// ============================================
// 5. Event Stream Updates
// ============================================
 
/**
 * For real-time apps, notify clients when fresher data is available
 */
class StalenessEventEmitter {
    private clients = new Map<string, Set<(event: StalenessEvent) => void>>();
 
    subscribe(
        entityId: string, 
        callback: (event: StalenessEvent) => void
    ): () => void {
        if (!this.clients.has(entityId)) {
            this.clients.set(entityId, new Set());
        }
        this.clients.get(entityId)!.add(callback);
        
        return () => this.clients.get(entityId)?.delete(callback);
    }
 
    notifyRefreshed(entityId: string, newVersion: string): void {
        const subscribers = this.clients.get(entityId);
        if (subscribers) {
            const event: StalenessEvent = {
                type: 'data_refreshed',
                entityId,
                newVersion,
                timestamp: Date.now(),
            };
            subscribers.forEach(callback => callback(event));
        }
    }
}
 
interface StalenessEvent {
    type: 'data_refreshed' | 'data_stale';
    entityId: string;
    newVersion?: string;
    timestamp: number;
}

Staleness Communication Guidelines

•Be transparent but not alarming — "Data from 2 minutes ago" is informative. "Warning: You're seeing stale data!" causes unnecessary concern.
•Provide refresh options — When serving stale data, give users a way to force-refresh.
•Use standard HTTP headers — Age, Cache-Control, and Warning headers are well-understood by CDNs and browsers.
•Distinguish intentional staleness — "Cached for performance" vs. "Showing cached due to service issues" have different user implications.
•Consider business context — Stock prices need real-time freshness indicators. Blog posts do not.

Summary: Dealing with Stale Data

Key Takeaways

•Staleness has multiple dimensions — Age, lag, frequency, and magnitude each matter differently. Measure what's relevant to your use case.
•Stale-while-revalidate optimizes perceived performance — Serve stale immediately while refreshing in background. Users prefer fast-and-stale over slow-and-fresh.
•Graceful degradation preserves availability — When backends fail, stale cached data keeps the system functional. Design fallback hierarchies.
•Detection techniques allow verification — Version checks, conditional fetches, and sampling each trade latency for accuracy. Choose based on data criticality.
•Communicate staleness transparently — HTTP headers, response envelopes, and UI indicators let clients and users react appropriately to cached data.
•Set SLOs on staleness — Define acceptable staleness levels and alert when exceeded. Makes staleness a managed quantity rather than an unknown.

Module Complete:

You've now completed the Cache Invalidation Strategies module, covering:

Time-based expiration — TTL, absolute vs. sliding, and the mathematics of TTL selection
Event-based invalidation — Proactive invalidation architectures and key mapping
Cache versioning — Handling schema evolution and gradual rollouts
Dealing with stale data — Measuring, detecting, and communicating staleness

Module Complete

4 / 4