System Design (HLD)Edge Computing

Edge Computing: Processing at the Network Periphery

LevelAdvanced

Duration90 mins

TopicEdge Computing

4 / 5

Edge vs Origin Processing: Architecting the Hybrid Continuum

The Placement Decision

The question is never "edge or origin?"—it's "what belongs where?"

Every distributed system must partition workloads across the available compute topology. In edge-enabled architectures, this topology spans from user devices through edge locations to centralized cloud. The art of edge architecture is drawing the right lines—placing each computation at the location that optimizes for the constraints that matter most for that specific operation.

This page develops the decision framework for workload placement, explores hybrid architecture patterns that combine edge and origin processing, and addresses the practical challenges of operating across the cloud-to-edge continuum. By the end, you'll be equipped to partition any application's workloads across the distributed topology effectively.

What You Will Learn

By the end of this page, you will understand the fundamental trade-offs that determine optimal workload placement, specific patterns for hybrid edge-origin architectures, techniques for workload partitioning and migration, consistency and coordination strategies across distributed locations, and operational considerations for hybrid systems.

The Placement Trade-off Space

Workload placement decisions navigate a multi-dimensional trade-off space. Understanding these dimensions is prerequisite to making informed placement decisions.

Key Trade-off Dimensions

•Latency vs. Consistency — Edge processing reduces latency but complicates coordination. The further from centralized state, the harder to maintain strong consistency across the system.
•Bandwidth vs. Compute — Edge can reduce bandwidth by filtering data locally, but edge compute is constrained. Heavy processing may require cloud resources despite bandwidth costs.
•Availability vs. Complexity — Edge nodes can operate during cloud outages, but managing independent edge state introduces complexity. Full consistency requires connectivity; autonomy requires complexity.
•Cost vs. Performance — Edge compute pricing differs from cloud. Some workloads are cheaper at edge (bandwidth-heavy); others are cheaper in cloud (compute-heavy).
•Flexibility vs. Latency — Cloud allows unlimited services and integrations. Edge limits to available primitives (KV stores, specific runtimes). More capability often means more latency.

Edge vs Origin Trade-off Matrix
Factor	Edge-Favoring	Origin-Favoring
Latency Need	Sub-50ms required	Seconds acceptable
Data Volume	High (reduce transmission)	Low (send to origin)
Computation Complexity	Simple transforms/filters	Heavy ML, complex logic
State Requirements	Stateless or local	Global coordination needed
Service Integrations	Self-contained	Multiple cloud services
Consistency Model	Eventual acceptable	Strong required
Failure Tolerance	Must work offline	Cloud connectivity required
Update Frequency	Stable logic	Rapidly changing requirements

The Decision Heuristic

When in doubt, start with this heuristic: Place workloads at edge only when latency constraints require it OR when bandwidth reduction provides clear economic benefit. Default to origin for everything else—it's operationally simpler. Move workloads to edge incrementally as you prove the value.

The Request Journey Model

To reason about placement, visualize the complete request journey from user to origin and back. Each point in this journey is a potential processing location with different characteristics.

The Request Journey Stages:

User Device → ISP Network → Edge PoP → Internet Backbone → Origin Region → Application Servers → Database
     ↓             ↓            ↓              ↓                ↓                ↓               ↓
   1ms          5ms         10ms          100ms            200ms            210ms           230ms
    (cumulative latency from user)

At each stage, we can intercept, process, cache, or forward the request. The closer to the user we can satisfy the request, the lower the latency—but the fewer resources and capabilities available.

Processing Opportunities at Each Stage

•Edge PoP (CDN Layer) — Static caching, request routing, header manipulation, simple transforms. Virtually unlimited capacity. Latency: 10-30ms from user. Suitable for: Cache hits, redirects, static responses, simple personalization.
•Edge Compute (Workers/Functions Layer) — Programmable logic execution, dynamic content generation, API routing. CPU-time limited. Latency: 10-30ms. Suitable for: Auth, A/B testing, response transformation, API gateway logic.
•Edge Data (KV/Durable Objects Layer) — Key-value storage, coordinated state, session persistence. Size and throughput limited. Latency: 10-50ms. Suitable for: Feature flags, session data, rate limit counters, personalization rules.
•Regional Cloud (Near-Origin Layer) — Full cloud services, relational databases, managed services. Unlimited but regional. Latency: 50-100ms. Suitable for: Complex queries, transaction processing, heavy computation.
•Origin (Application Layer) — Core business logic, primary databases, third-party integrations. Full capability. Latency: 100-300ms. Suitable for: Business transactions, data writes, complex orchestration.

Request Journey Decision Points:

For each incoming request, the system decides at each layer: process here, forward, or reject?

Static content → Satisfied at edge PoP cache (0% of requests reach origin)
Unauthenticated requests → Rejected at edge compute (saves origin capacity)
Personalized pages → Computed at edge with cached base + injected personalization
API reads → Served from edge cache or edge KV if fresh enough
API writes → Must reach origin for durability guarantees
Complex transactions → Only origin has necessary state and services

Optimize the Hot Path

Most optimization should focus on the hot path—the request patterns that represent 80%+ of traffic. Typically this is content serving, not transactions. If you can satisfy 90% of requests at the edge, the 10% that must reach origin can tolerate higher latency without impacting overall user experience.

Hybrid Architecture Patterns

Production edge deployments rarely put everything at edge or everything at origin. Instead, they employ hybrid patterns that distribute responsibilities appropriately. Here are the proven patterns:

Pattern 1: Edge Gateway + Origin Services

•Structure: Edge handles all request ingestion, authentication, routing, and caching. Origin provides pure business logic APIs.
•Edge Responsibilities: TLS termination, auth validation, rate limiting, request routing, cache serving, response optimization.
•Origin Responsibilities: Business logic, database operations, third-party integrations, transaction processing.
•Communication: Edge forwards authenticated, validated requests with enriched headers. Origin assumes all requests are pre-validated.
•Best For: Traditional web applications adding edge layer; microservices architectures; API platforms.

Pattern 2: Compute at Edge, State at Origin

•Structure: Edge performs all stateless computation. Any state access fetches from origin synchronously or asynchronously.
•Edge Responsibilities: Request processing, personalization logic, data transformation, API composition. No persistent state.
•Origin Responsibilities: Authoritative state storage, state mutation endpoints, consistency guarantees.
•Communication: Edge fetches state on-demand from origin APIs or replicates read models to edge caches.
•Best For: Applications with clear read/write split; CQRS architectures; content personalization at scale.

Pattern 3: Tiered Processing (Filter-Aggregate-Store)

•Structure: Data flows from devices through edge nodes with progressive processing at each tier. IoT-centric pattern.
•Device Layer: Raw data capture, initial filtering, local alerting.
•Edge Tier: Aggregation, inference, anomaly detection, buffering for connectivity gaps.
•Origin Tier: Historical storage, model training, fleet-wide analytics, reporting.
•Communication: Reduced data flows up; models and configuration flow down. Async, batched communication.
•Best For: IoT deployments, sensor networks, video analytics, industrial monitoring.

Pattern 4: Global Edge with Regional Origins

•Structure: Edge spans globally but routes to regional origin clusters. Each region handles its geographic users authoritatively.
•Edge Responsibilities: Global routing, latency-based origin selection, cross-region failover, global cache layer.
•Regional Origin Responsibilities: User data for that region, regional regulations compliance, regional third-party integrations.
•Cross-Region: Eventual replication of non-sensitive data; region-local for regulated data (GDPR, data residency).
•Best For: Global applications with data sovereignty requirements; multi-region active-active deployments.

Pattern 5: Edge-First with Cloud Backup

•Structure: Edge is primary processing location. Cloud serves as coordination point, backup, and for operations that edge cannot handle.
•Edge Responsibilities: Primary request handling, local state management via Durable Objects, real-time operations.
•Cloud Responsibilities: Edge coordination, analytics aggregation, configuration distribution, fallback for complex operations.
•Communication: Edge-to-cloud sync for durability; cloud-to-edge push for configuration. Cloud only hit for explicit escalation.
•Best For: Real-time applications, collaborative tools, games, applications where edge latency is the primary requirement.

Data Flow and Synchronization Strategies

In hybrid architectures, data flows between edge and origin. How you manage this flow determines consistency, latency, and system complexity. Here are the key synchronization strategies:

Read Replication Strategies

•Pull-Through Cache — Edge requests data from origin on cache miss; caches response. Simple but cold requests hit origin. Good for unpredictable access patterns.
•Push Replication — Origin proactively pushes data to edge when it changes. Warm reads everywhere but requires change detection. Good for hot, frequently-accessed data.
•Periodic Snapshot — Edge pulls complete data snapshots on schedule (e.g., every 5 minutes). Simple, bounded staleness, but not real-time. Good for reference data.
•Change Data Capture (CDC) — Stream of changes flows from origin to edge. Near-real-time with lower bandwidth than snapshots. Good for large datasets with localized changes.

Write Handling Strategies

•Write-Through to Origin — Writes go through edge to origin synchronously. Simple consistency but adds origin latency to writes. Good when write latency is less critical than read latency.
•Write-Behind (Async) — Edge accepts write, acknowledges immediately, syncs to origin async. Fast writes but potential data loss on edge failure. Good for non-critical data or when latency is paramount.
•Local-First with Merge — Edge writes locally, syncs later with conflict resolution. Enables offline operation. Good for collaborative apps, mobile-first, or disconnected scenarios.
•Command Forwarding — Edge validates and forwards commands (not data) to origin. Origin executes and notifies edge of results. Good for transactional operations requiring origin-side validation.

Synchronization Strategy Selection Guide
Requirement	Recommended Strategy	Trade-offs
Read latency critical, stale OK	Push replication + periodic sync	Origin compute for change detection
Read consistency critical	Pull-through with short TTL	Cold request latency
Write latency critical	Write-behind async	Risk of data loss, conflict handling
Write consistency critical	Write-through synchronous	Write latency includes origin RTT
Offline operation needed	Local-first with CRDT merge	Conflict resolution complexity
High write volume	Batch write-behind	Higher eventual inconsistency window

Conflict Resolution Is Hard

When edge and origin can both modify state, conflicts are inevitable during network partitions or concurrent access. Design for this from the start: use CRDTs for automatic merge where possible, last-writer-wins for simple cases, or application-specific merge logic for complex domains. Conflicts that reach production without a resolution strategy cause data corruption or loss.

Workload Partitioning Methodology

Given an existing application, how do you systematically identify what belongs at edge versus origin? This methodology provides a structured approach:

Step 1: Request Inventory

Catalog all request types in your application:

List every API endpoint and page type
Characterize each: read/write, latency sensitivity, frequency
Identify dependencies: what data and services does each request need?
Measure current latency distribution (P50, P95, P99) per request type

Step 2: Latency Impact Analysis

For each request type, quantify the impact of latency:

High-frequency, user-facing reads → latency-sensitive (edge candidate)
Low-frequency background operations → latency-tolerant (origin)
Conversion-impacting flows → measure latency-to-conversion correlation
Real-time interactions → measure acceptable latency threshold

Step 3: Dependency Mapping

For each edge candidate, analyze what it needs:

Can it be satisfied with edge-available primitives (KV, cache)?
Does it require origin services (database, third-party APIs)?
Can dependencies be replicated/cached at edge?
What consistency requirements constrain placement?

Step 4: Partition Design

Group requests into processing tiers:

Edge-Only: Stateless transforms, cached content, simple personalization
Edge with Edge Data: Session management, feature flags, rate limiting
Edge with Origin Fetch: Personalized but dynamic content, API reads
Origin-Only: Writes, transactions, complex queries, integrations

Step 5: Migration Sequencing

Prioritize edge migration by impact:

Start with static caching and CDN optimization (quick wins)
Add edge compute for authentication and routing (security + perf)
Implement edge personalization for high-frequency reads (UX improvement)
Move session/cart to edge for transactional flow optimization
Consider edge data for reference data requiring low latency

Incremental Migration

Don't attempt to edge-optimize everything at once. Each migration step should be independently measurable and reversible. Deploy edge logic behind feature flags, measure impact, and roll back if issues arise. Accumulate edge capabilities progressively rather than attempting a big-bang migration.

Consistency Models at the Edge

Edge computing forces explicit decisions about consistency. When data is replicated to hundreds of locations, strong consistency becomes expensive or impossible. Understanding consistency models helps you choose appropriately for each use case.

Consistency Models for Edge Systems

•Strong Consistency — All reads see the latest write. At edge, requires synchronous coordination with authoritative source. Latency: origin RTT + write commit. Use for: financial transactions, inventory decrements, critical state.
•Eventual Consistency — Reads may see stale data; system converges over time. At edge, data propagates asynchronously. Staleness: seconds to minutes. Use for: user preferences, feature flags, non-critical personalization.
•Bounded Staleness — Reads are guaranteed fresh within a time bound. At edge, TTLs or versioned sync. Staleness: defined ceiling (e.g., 5 minutes max). Use for: pricing, inventory snapshots, cached API responses.
•Read-Your-Writes — User sees their own writes immediately, even if replicas lag. At edge, session affinity or edge-local write cache. Use for: user profile updates, cart modifications, content authoring.
•Causal Consistency — Operations that are causally related are seen in order. At edge, version vectors or logical clocks. Use for: comment threads, message ordering, collaborative editing.

Consistency Model Selection by Use Case
Use Case	Recommended Model	Rationale
Inventory for add-to-cart	Bounded staleness (5 min)	Stale OK for display; verify at checkout
Payment processing	Strong consistency	Cannot double-charge or oversell
Feature flags	Eventual consistency	Seconds of staleness acceptable
User session/cart	Read-your-writes	User's changes visible immediately to them
Collaborative document	Causal consistency	Edits must be ordered correctly
Content personalization	Eventual (30 second)	Near-real-time personalization sufficient

CAP Theorem at the Edge

Edge computing is inherently a distributed system across unreliable networks. The CAP theorem applies: during network partitions, you must choose between consistency and availability. Most edge systems choose availability (continue serving from cache) and accept bounded inconsistency. Design your edge layer to degrade gracefully, not fail completely, when origin connectivity is lost.

Operational Considerations for Hybrid Systems

Operating edge-origin hybrid systems introduces unique challenges not present in purely centralized architectures. Teams must adapt their operational practices:

Deployment and Rollout

•Staged Edge Rollout — Deploy to subset of edge locations first; monitor; expand gradually. All-at-once edge deployments risk global incidents.
•Version Coordination — Edge and origin must maintain API compatibility. Deploy origin-backward-compatible changes first; then edge; then remove origin backward compatibility.
•Feature Flags at Edge — Ship edge logic behind flags; enable progressively per location or traffic percentage. Critical for safe edge deployment.
•Rollback Complexity — Edge rollback is slower (propagation delay across locations). Design for forward-fix; rollback is last resort.

Observability Across the Continuum

•Distributed Tracing — Traces must span edge-to-origin. Propagate trace IDs through all layers; aggregate at central collector. Tools: OpenTelemetry, Datadog, Honeycomb.
•Edge Metrics Aggregation — Metrics from 200+ edge locations must aggregate for visibility. Sample aggressively at edge; aggregate regionally; analyze centrally.
•Log Collection Challenges — Edge logs are high-volume and distributed. Use log streaming (tail workers, log push) rather than polling; apply edge-side filtering.
•Synthetic Monitoring — Probe from user locations, not just data centers. Monitor edge behavior from real user perspective; catch location-specific issues.

Incident Response for Hybrid Systems

•Impact Localization — Is the issue at one edge location, all edge, or origin? Dashboards must segment by location for rapid triage.
•Edge Bypass Mechanism — Ability to route around edge (direct-to-origin) for debugging or if edge is misbehaving.
•Edge-Specific Runbooks — Edge incidents have different resolution paths. Document edge-specific troubleshooting: cache invalidation, KV corruption, worker errors.
•Communication with Edge Providers — When edge platform has issues, you depend on vendor. Maintain relationships; know escalation paths; monitor vendor status pages.

Test Edge Failure Modes

Edge introduces failure modes that don't exist in centralized systems: edge-specific outages, edge-origin connectivity loss, inconsistent edge state. Conduct chaos engineering at the edge: disconnect edge locations, inject latency between edge and origin, corrupt edge cache. Verify graceful degradation before production incidents reveal gaps.

Summary: Edge vs Origin Processing

We've developed the framework for partitioning workloads between edge and origin—from the fundamental trade-offs to the practical operational considerations. Let's consolidate:

Key Takeaways

•Placement is about trade-offs — Latency vs. consistency, bandwidth vs. compute, flexibility vs. speed. Understand the trade-off space before deciding.
•The request journey has processing points — From edge PoP through origin services, each layer offers different capabilities and latency characteristics.
•Hybrid patterns are the norm — Pure edge or pure origin is rare. Proven patterns combine edge gateway, edge compute, and origin services appropriately.
•Synchronization strategy matters — Pull vs. push, sync vs. async, and conflict resolution are critical design decisions for hybrid systems.
•Partition workloads methodically — Inventory, analyze, map dependencies, design tiers, migrate incrementally. Avoid big-bang edge migrations.
•Choose consistency models explicitly — Different data has different consistency needs. Strong for transactions, eventual for caching, read-your-writes for user state.
•Operations require new practices — Staged rollouts, distributed tracing, edge-specific runbooks, and failure mode testing are essential for hybrid systems.

What's Next:

The final page of this module addresses edge data challenges—the unique difficulties of managing state at the edge, from caching invalidation to data sovereignty, and the architectural patterns that address these challenges.

Page Complete

You now have a comprehensive framework for edge vs. origin workload placement. You understand the trade-off dimensions, architectural patterns, synchronization strategies, and operational considerations for hybrid edge-origin systems. Next, we'll address the specific challenges of managing data at the edge.

4 / 5

Loading learning content...

System Design (HLD)Edge Computing

Edge Computing: Processing at the Network Periphery

LevelAdvanced

Duration90 mins

TopicEdge Computing

4 / 5

Edge vs Origin Processing: Architecting the Hybrid Continuum

The Placement Decision

The question is never "edge or origin?"—it's "what belongs where?"

What You Will Learn

The Placement Trade-off Space

Workload placement decisions navigate a multi-dimensional trade-off space. Understanding these dimensions is prerequisite to making informed placement decisions.

Key Trade-off Dimensions

•Latency vs. Consistency — Edge processing reduces latency but complicates coordination. The further from centralized state, the harder to maintain strong consistency across the system.
•Bandwidth vs. Compute — Edge can reduce bandwidth by filtering data locally, but edge compute is constrained. Heavy processing may require cloud resources despite bandwidth costs.
•Availability vs. Complexity — Edge nodes can operate during cloud outages, but managing independent edge state introduces complexity. Full consistency requires connectivity; autonomy requires complexity.
•Cost vs. Performance — Edge compute pricing differs from cloud. Some workloads are cheaper at edge (bandwidth-heavy); others are cheaper in cloud (compute-heavy).
•Flexibility vs. Latency — Cloud allows unlimited services and integrations. Edge limits to available primitives (KV stores, specific runtimes). More capability often means more latency.

Edge vs Origin Trade-off Matrix
Factor	Edge-Favoring	Origin-Favoring
Latency Need	Sub-50ms required	Seconds acceptable
Data Volume	High (reduce transmission)	Low (send to origin)
Computation Complexity	Simple transforms/filters	Heavy ML, complex logic
State Requirements	Stateless or local	Global coordination needed
Service Integrations	Self-contained	Multiple cloud services
Consistency Model	Eventual acceptable	Strong required
Failure Tolerance	Must work offline	Cloud connectivity required
Update Frequency	Stable logic	Rapidly changing requirements

The Decision Heuristic

The Request Journey Model

To reason about placement, visualize the complete request journey from user to origin and back. Each point in this journey is a potential processing location with different characteristics.

The Request Journey Stages:

User Device → ISP Network → Edge PoP → Internet Backbone → Origin Region → Application Servers → Database
     ↓             ↓            ↓              ↓                ↓                ↓               ↓
   1ms          5ms         10ms          100ms            200ms            210ms           230ms
    (cumulative latency from user)

At each stage, we can intercept, process, cache, or forward the request. The closer to the user we can satisfy the request, the lower the latency—but the fewer resources and capabilities available.

Processing Opportunities at Each Stage

•Edge PoP (CDN Layer) — Static caching, request routing, header manipulation, simple transforms. Virtually unlimited capacity. Latency: 10-30ms from user. Suitable for: Cache hits, redirects, static responses, simple personalization.
•Edge Compute (Workers/Functions Layer) — Programmable logic execution, dynamic content generation, API routing. CPU-time limited. Latency: 10-30ms. Suitable for: Auth, A/B testing, response transformation, API gateway logic.
•Edge Data (KV/Durable Objects Layer) — Key-value storage, coordinated state, session persistence. Size and throughput limited. Latency: 10-50ms. Suitable for: Feature flags, session data, rate limit counters, personalization rules.
•Regional Cloud (Near-Origin Layer) — Full cloud services, relational databases, managed services. Unlimited but regional. Latency: 50-100ms. Suitable for: Complex queries, transaction processing, heavy computation.
•Origin (Application Layer) — Core business logic, primary databases, third-party integrations. Full capability. Latency: 100-300ms. Suitable for: Business transactions, data writes, complex orchestration.

Request Journey Decision Points:

For each incoming request, the system decides at each layer: process here, forward, or reject?

Static content → Satisfied at edge PoP cache (0% of requests reach origin)
Unauthenticated requests → Rejected at edge compute (saves origin capacity)
Personalized pages → Computed at edge with cached base + injected personalization
API reads → Served from edge cache or edge KV if fresh enough
API writes → Must reach origin for durability guarantees
Complex transactions → Only origin has necessary state and services

Optimize the Hot Path

Hybrid Architecture Patterns

Pattern 1: Edge Gateway + Origin Services

•Structure: Edge handles all request ingestion, authentication, routing, and caching. Origin provides pure business logic APIs.
•Edge Responsibilities: TLS termination, auth validation, rate limiting, request routing, cache serving, response optimization.
•Origin Responsibilities: Business logic, database operations, third-party integrations, transaction processing.
•Communication: Edge forwards authenticated, validated requests with enriched headers. Origin assumes all requests are pre-validated.
•Best For: Traditional web applications adding edge layer; microservices architectures; API platforms.

Pattern 2: Compute at Edge, State at Origin

•Structure: Edge performs all stateless computation. Any state access fetches from origin synchronously or asynchronously.
•Edge Responsibilities: Request processing, personalization logic, data transformation, API composition. No persistent state.
•Origin Responsibilities: Authoritative state storage, state mutation endpoints, consistency guarantees.
•Communication: Edge fetches state on-demand from origin APIs or replicates read models to edge caches.
•Best For: Applications with clear read/write split; CQRS architectures; content personalization at scale.

Pattern 3: Tiered Processing (Filter-Aggregate-Store)

•Structure: Data flows from devices through edge nodes with progressive processing at each tier. IoT-centric pattern.
•Device Layer: Raw data capture, initial filtering, local alerting.
•Edge Tier: Aggregation, inference, anomaly detection, buffering for connectivity gaps.
•Origin Tier: Historical storage, model training, fleet-wide analytics, reporting.
•Communication: Reduced data flows up; models and configuration flow down. Async, batched communication.
•Best For: IoT deployments, sensor networks, video analytics, industrial monitoring.

Pattern 4: Global Edge with Regional Origins

•Structure: Edge spans globally but routes to regional origin clusters. Each region handles its geographic users authoritatively.
•Edge Responsibilities: Global routing, latency-based origin selection, cross-region failover, global cache layer.
•Regional Origin Responsibilities: User data for that region, regional regulations compliance, regional third-party integrations.
•Cross-Region: Eventual replication of non-sensitive data; region-local for regulated data (GDPR, data residency).
•Best For: Global applications with data sovereignty requirements; multi-region active-active deployments.

Pattern 5: Edge-First with Cloud Backup

•Structure: Edge is primary processing location. Cloud serves as coordination point, backup, and for operations that edge cannot handle.
•Edge Responsibilities: Primary request handling, local state management via Durable Objects, real-time operations.
•Cloud Responsibilities: Edge coordination, analytics aggregation, configuration distribution, fallback for complex operations.
•Communication: Edge-to-cloud sync for durability; cloud-to-edge push for configuration. Cloud only hit for explicit escalation.
•Best For: Real-time applications, collaborative tools, games, applications where edge latency is the primary requirement.

Data Flow and Synchronization Strategies

In hybrid architectures, data flows between edge and origin. How you manage this flow determines consistency, latency, and system complexity. Here are the key synchronization strategies:

Read Replication Strategies

•Pull-Through Cache — Edge requests data from origin on cache miss; caches response. Simple but cold requests hit origin. Good for unpredictable access patterns.
•Push Replication — Origin proactively pushes data to edge when it changes. Warm reads everywhere but requires change detection. Good for hot, frequently-accessed data.
•Periodic Snapshot — Edge pulls complete data snapshots on schedule (e.g., every 5 minutes). Simple, bounded staleness, but not real-time. Good for reference data.
•Change Data Capture (CDC) — Stream of changes flows from origin to edge. Near-real-time with lower bandwidth than snapshots. Good for large datasets with localized changes.

Write Handling Strategies

•Write-Through to Origin — Writes go through edge to origin synchronously. Simple consistency but adds origin latency to writes. Good when write latency is less critical than read latency.
•Write-Behind (Async) — Edge accepts write, acknowledges immediately, syncs to origin async. Fast writes but potential data loss on edge failure. Good for non-critical data or when latency is paramount.
•Local-First with Merge — Edge writes locally, syncs later with conflict resolution. Enables offline operation. Good for collaborative apps, mobile-first, or disconnected scenarios.
•Command Forwarding — Edge validates and forwards commands (not data) to origin. Origin executes and notifies edge of results. Good for transactional operations requiring origin-side validation.

Synchronization Strategy Selection Guide
Requirement	Recommended Strategy	Trade-offs
Read latency critical, stale OK	Push replication + periodic sync	Origin compute for change detection
Read consistency critical	Pull-through with short TTL	Cold request latency
Write latency critical	Write-behind async	Risk of data loss, conflict handling
Write consistency critical	Write-through synchronous	Write latency includes origin RTT
Offline operation needed	Local-first with CRDT merge	Conflict resolution complexity
High write volume	Batch write-behind	Higher eventual inconsistency window

Conflict Resolution Is Hard

Workload Partitioning Methodology

Given an existing application, how do you systematically identify what belongs at edge versus origin? This methodology provides a structured approach:

Step 1: Request Inventory

Catalog all request types in your application:

List every API endpoint and page type
Characterize each: read/write, latency sensitivity, frequency
Identify dependencies: what data and services does each request need?
Measure current latency distribution (P50, P95, P99) per request type

Step 2: Latency Impact Analysis

For each request type, quantify the impact of latency:

High-frequency, user-facing reads → latency-sensitive (edge candidate)
Low-frequency background operations → latency-tolerant (origin)
Conversion-impacting flows → measure latency-to-conversion correlation
Real-time interactions → measure acceptable latency threshold

Step 3: Dependency Mapping

For each edge candidate, analyze what it needs:

Can it be satisfied with edge-available primitives (KV, cache)?
Does it require origin services (database, third-party APIs)?
Can dependencies be replicated/cached at edge?
What consistency requirements constrain placement?

Step 4: Partition Design

Group requests into processing tiers:

Edge-Only: Stateless transforms, cached content, simple personalization
Edge with Edge Data: Session management, feature flags, rate limiting
Edge with Origin Fetch: Personalized but dynamic content, API reads
Origin-Only: Writes, transactions, complex queries, integrations

Step 5: Migration Sequencing

Prioritize edge migration by impact:

Start with static caching and CDN optimization (quick wins)
Add edge compute for authentication and routing (security + perf)
Implement edge personalization for high-frequency reads (UX improvement)
Move session/cart to edge for transactional flow optimization
Consider edge data for reference data requiring low latency

Incremental Migration

Consistency Models at the Edge

Consistency Models for Edge Systems

•Strong Consistency — All reads see the latest write. At edge, requires synchronous coordination with authoritative source. Latency: origin RTT + write commit. Use for: financial transactions, inventory decrements, critical state.
•Eventual Consistency — Reads may see stale data; system converges over time. At edge, data propagates asynchronously. Staleness: seconds to minutes. Use for: user preferences, feature flags, non-critical personalization.
•Bounded Staleness — Reads are guaranteed fresh within a time bound. At edge, TTLs or versioned sync. Staleness: defined ceiling (e.g., 5 minutes max). Use for: pricing, inventory snapshots, cached API responses.
•Read-Your-Writes — User sees their own writes immediately, even if replicas lag. At edge, session affinity or edge-local write cache. Use for: user profile updates, cart modifications, content authoring.
•Causal Consistency — Operations that are causally related are seen in order. At edge, version vectors or logical clocks. Use for: comment threads, message ordering, collaborative editing.

Consistency Model Selection by Use Case
Use Case	Recommended Model	Rationale
Inventory for add-to-cart	Bounded staleness (5 min)	Stale OK for display; verify at checkout
Payment processing	Strong consistency	Cannot double-charge or oversell
Feature flags	Eventual consistency	Seconds of staleness acceptable
User session/cart	Read-your-writes	User's changes visible immediately to them
Collaborative document	Causal consistency	Edits must be ordered correctly
Content personalization	Eventual (30 second)	Near-real-time personalization sufficient

CAP Theorem at the Edge

Operational Considerations for Hybrid Systems

Operating edge-origin hybrid systems introduces unique challenges not present in purely centralized architectures. Teams must adapt their operational practices:

Deployment and Rollout

•Staged Edge Rollout — Deploy to subset of edge locations first; monitor; expand gradually. All-at-once edge deployments risk global incidents.
•Version Coordination — Edge and origin must maintain API compatibility. Deploy origin-backward-compatible changes first; then edge; then remove origin backward compatibility.
•Feature Flags at Edge — Ship edge logic behind flags; enable progressively per location or traffic percentage. Critical for safe edge deployment.
•Rollback Complexity — Edge rollback is slower (propagation delay across locations). Design for forward-fix; rollback is last resort.

Observability Across the Continuum

•Distributed Tracing — Traces must span edge-to-origin. Propagate trace IDs through all layers; aggregate at central collector. Tools: OpenTelemetry, Datadog, Honeycomb.
•Edge Metrics Aggregation — Metrics from 200+ edge locations must aggregate for visibility. Sample aggressively at edge; aggregate regionally; analyze centrally.
•Log Collection Challenges — Edge logs are high-volume and distributed. Use log streaming (tail workers, log push) rather than polling; apply edge-side filtering.
•Synthetic Monitoring — Probe from user locations, not just data centers. Monitor edge behavior from real user perspective; catch location-specific issues.

Incident Response for Hybrid Systems

•Impact Localization — Is the issue at one edge location, all edge, or origin? Dashboards must segment by location for rapid triage.
•Edge Bypass Mechanism — Ability to route around edge (direct-to-origin) for debugging or if edge is misbehaving.
•Edge-Specific Runbooks — Edge incidents have different resolution paths. Document edge-specific troubleshooting: cache invalidation, KV corruption, worker errors.
•Communication with Edge Providers — When edge platform has issues, you depend on vendor. Maintain relationships; know escalation paths; monitor vendor status pages.

Test Edge Failure Modes

Summary: Edge vs Origin Processing

We've developed the framework for partitioning workloads between edge and origin—from the fundamental trade-offs to the practical operational considerations. Let's consolidate:

Key Takeaways

•Placement is about trade-offs — Latency vs. consistency, bandwidth vs. compute, flexibility vs. speed. Understand the trade-off space before deciding.
•The request journey has processing points — From edge PoP through origin services, each layer offers different capabilities and latency characteristics.
•Hybrid patterns are the norm — Pure edge or pure origin is rare. Proven patterns combine edge gateway, edge compute, and origin services appropriately.
•Synchronization strategy matters — Pull vs. push, sync vs. async, and conflict resolution are critical design decisions for hybrid systems.
•Partition workloads methodically — Inventory, analyze, map dependencies, design tiers, migrate incrementally. Avoid big-bang edge migrations.
•Choose consistency models explicitly — Different data has different consistency needs. Strong for transactions, eventual for caching, read-your-writes for user state.
•Operations require new practices — Staged rollouts, distributed tracing, edge-specific runbooks, and failure mode testing are essential for hybrid systems.

What's Next:

Page Complete

4 / 5