Design a Global CDN with Edge Computing

Design a global Content Delivery Network (CDN) with edge computing capabilities. The system deploys 200+ edge Points of Presence (PoPs) worldwide, routes user requests to the optimal PoP via Anycast/GeoDNS, caches static content in tiered storage (RAM + SSD) with configurable TTL and sub-5-second global purge propagation, supports edge compute (V8 isolates / Wasm for A/B testing, auth, geo-personalisation), delivers video via HLS/DASH with adaptive bitrate and request collapsing for live streaming, provides DDoS absorption (200+ Tbps) and WAF at the edge, accelerates dynamic content via connection pooling and smart routing, and achieves > 90% cache hit ratio with < 50ms TTFB for cached content.

Scale Estimates

Metric	Value
Edge PoPs	200+ globally
Aggregate network capacity	200+ Tbps
HTTP requests/sec (global)	20 million+
Cache hit ratio (target)	> 90%
TTFB for cached content	< 50ms
Purge propagation time	< 5 seconds globally
Edge compute cold start	< 5ms (V8 isolates)
TLS handshake (to nearest PoP)	< 20ms
Video segments served/sec	Millions
Internet traffic served by CDNs	30%+

Non-Functional Requirements

Latency: Cached content served in < 50ms TTFB (including TLS); edge compute adds < 5ms; user feels like content is local regardless of origin location; TLS 1.3 + HTTP/3 (QUIC) for minimal handshake overhead
Cache effectiveness: > 90% cache hit ratio for static content; tiered caching (L1 edge + L2 shield) reduces origin load to < 5% of total requests; request collapsing coalesces concurrent misses; stale-while-revalidate for zero-latency freshness
Global availability: 99.99%+ uptime; Anycast provides instant failover on PoP failure (BGP reconvergence in seconds); multi-PoP redundancy per region; origin shielding protects customer servers
Security: DDoS absorption at 200+ Tbps across global network; WAF (OWASP CRS + custom rules) at edge; bot management (fingerprinting, challenges); origin IP hidden and protected (mTLS, IP allowlist)
Scalability: Handle any traffic spike (viral content, events, product launches); auto-scaling edge compute; cache adapts (popular content replicated to more PoPs); 30%+ of global internet traffic served through CDNs
Edge compute: Sub-5ms cold start (V8 isolates); run custom logic at 200+ locations; stateless with Edge KV for simple state; supports: auth, routing, personalisation, A/B testing without origin round-trip

Scale Estimates

Metric

Value

Edge PoPs

200+ globally

Aggregate network capacity

200+ Tbps

HTTP requests/sec (global)

20 million+

Cache hit ratio (target)

> 90%

TTFB for cached content

< 50ms

Purge propagation time

< 5 seconds globally

Edge compute cold start

< 5ms (V8 isolates)

TLS handshake (to nearest PoP)

< 20ms

Video segments served/sec

Millions

Internet traffic served by CDNs

30%+

Non-Functional Requirements

Latency: Cached content served in < 50ms TTFB (including TLS); edge compute adds < 5ms; user feels like content is local regardless of origin location; TLS 1.3 + HTTP/3 (QUIC) for minimal handshake overhead

Cache effectiveness: > 90% cache hit ratio for static content; tiered caching (L1 edge + L2 shield) reduces origin load to < 5% of total requests; request collapsing coalesces concurrent misses; stale-while-revalidate for zero-latency freshness

Global availability: 99.99%+ uptime; Anycast provides instant failover on PoP failure (BGP reconvergence in seconds); multi-PoP redundancy per region; origin shielding protects customer servers

Security: DDoS absorption at 200+ Tbps across global network; WAF (OWASP CRS + custom rules) at edge; bot management (fingerprinting, challenges); origin IP hidden and protected (mTLS, IP allowlist)

Scalability: Handle any traffic spike (viral content, events, product launches); auto-scaling edge compute; cache adapts (popular content replicated to more PoPs); 30%+ of global internet traffic served through CDNs

Edge compute: Sub-5ms cold start (V8 isolates); run custom logic at 200+ locations; stateless with Edge KV for simple state; supports: auth, routing, personalisation, A/B testing without origin round-trip

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Follow-up Deep Dives(Questions an interviewer might ask)

Design a Global CDN with Edge Computing

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Follow-up Deep Dives(Questions an interviewer might ask)

Design a Global CDN with Edge Computing

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Non-Functional Requirements~3 min

Core Entities~2 min

API Design~3 min

High-Level Design~5 min

Follow-up Deep Dives(Questions an interviewer might ask)

1How does the request routing system work to direct users to the optimal edge PoP?

2How does the caching layer work, including cache hierarchies?

3How does cache invalidation work at global scale?

4How does edge computing work and what can run at the edge?

5How does the CDN handle video delivery and live streaming?

6How does the CDN provide DDoS protection and security?

7How would you architect the complete CDN system?

Key Topics

Asked At

Design a Global CDN with Edge Computing

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Non-Functional Requirements~3 min

Core Entities~2 min

API Design~3 min

High-Level Design~5 min

Follow-up Deep Dives(Questions an interviewer might ask)

1How does the request routing system work to direct users to the optimal edge PoP?

2How does the caching layer work, including cache hierarchies?

3How does cache invalidation work at global scale?

4How does edge computing work and what can run at the edge?

5How does the CDN handle video delivery and live streaming?

6How does the CDN provide DDoS protection and security?

7How would you architect the complete CDN system?

Key Topics

Asked At