Design a Content Delivery Network (CDN)

Design a globally distributed Content Delivery Network (CDN) like Cloudflare, Akamai, or AWS CloudFront. The CDN caches and serves static content (images, CSS, JS, videos) from edge servers geographically close to users, dramatically reducing latency and offloading traffic from origin servers.

Scale Estimates

Metric	Value
Edge Points of Presence (PoPs)	200–300 globally
Total bandwidth served	100+ Tbps (terabits per second) peak
Requests served	50+ billion / day
Cache hit ratio (target)	90–95% (only 5–10% of requests reach origin)
Average edge response time	< 10ms for cache hit
Origin shield → origin latency	< 50ms (same-region shield)
Content types	Static: images, CSS, JS, fonts, video segments; Dynamic: API acceleration
Cache storage per PoP	10–100 TB SSD

Non-Functional Requirements

Ultra-low latency: Serve cached content in < 10ms from the user's nearest edge PoP
Massive throughput: Handle 50+ billion requests/day across 200+ PoPs; absorb traffic spikes (viral content, flash sales)
High cache hit ratio: 90–95% cache hit rate through intelligent caching, pre-warming, and origin shielding
Global availability: Multi-PoP redundancy; Anycast / GeoDNS failover; stale-if-error when origin is unreachable
Fast purge: Cache invalidation propagates to all PoPs within 5 seconds via pub/sub fan-out
Security: DDoS absorption at edge scale (terabits/sec); WAF, bot detection, origin hiding
Cost-efficient: CDN absorbs 90%+ of traffic — origin serves only 5–10% of requests; massive bandwidth cost savings

Scale Estimates

Metric

Value

Edge Points of Presence (PoPs)

200–300 globally

Total bandwidth served

100+ Tbps (terabits per second) peak

Requests served

50+ billion / day

Cache hit ratio (target)

90–95% (only 5–10% of requests reach origin)

Average edge response time

< 10ms for cache hit

Origin shield → origin latency

< 50ms (same-region shield)

Content types

Static: images, CSS, JS, fonts, video segments; Dynamic: API acceleration

Cache storage per PoP

10–100 TB SSD

Non-Functional Requirements

Ultra-low latency: Serve cached content in < 10ms from the user's nearest edge PoP

Massive throughput: Handle 50+ billion requests/day across 200+ PoPs; absorb traffic spikes (viral content, flash sales)

High cache hit ratio: 90–95% cache hit rate through intelligent caching, pre-warming, and origin shielding

Global availability: Multi-PoP redundancy; Anycast / GeoDNS failover; stale-if-error when origin is unreachable

Fast purge: Cache invalidation propagates to all PoPs within 5 seconds via pub/sub fan-out

Security: DDoS absorption at edge scale (terabits/sec); WAF, bot detection, origin hiding

Cost-efficient: CDN absorbs 90%+ of traffic — origin serves only 5–10% of requests; massive bandwidth cost savings

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Follow-up Deep Dives(Questions an interviewer might ask)

Design a Content Delivery Network (CDN)

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Follow-up Deep Dives(Questions an interviewer might ask)

Design a Content Delivery Network (CDN)

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Non-Functional Requirements~3 min

Core Entities~2 min

API Design~3 min

High-Level Design~5 min

Follow-up Deep Dives(Questions an interviewer might ask)

1How does DNS-based request routing direct users to the nearest edge server?

2How does caching work at the edge? Explain cache keys, TTL, and eviction.

3How would you design cache invalidation / purge across hundreds of globally distributed edge PoPs?

4What is origin shielding and why is it important?

5How would you handle video streaming (large files) via the CDN?

6How would you ensure high availability and fault tolerance of the CDN?

7How would you handle DDoS protection and security at the CDN edge?

Key Topics

Asked At

Design a Content Delivery Network (CDN)

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Non-Functional Requirements~3 min

Core Entities~2 min

API Design~3 min

High-Level Design~5 min

Follow-up Deep Dives(Questions an interviewer might ask)

1How does DNS-based request routing direct users to the nearest edge server?

2How does caching work at the edge? Explain cache keys, TTL, and eviction.

3How would you design cache invalidation / purge across hundreds of globally distributed edge PoPs?

4What is origin shielding and why is it important?

5How would you handle video streaming (large files) via the CDN?

6How would you ensure high availability and fault tolerance of the CDN?

7How would you handle DDoS protection and security at the CDN edge?

Key Topics

Asked At