Design a Distributed Task Scheduler

Design a distributed task scheduling system (like Airflow, Celery, or Temporal) that reliably schedules and executes tasks at specified times across a fleet of workers. The system supports one-time tasks, recurring cron jobs, task DAGs (workflows), priority queues, retry with backoff, and guarantees at-least-once execution even under node failures.

Scale Estimates

Metric	Value
Tasks scheduled per day	100 million
Peak task submissions per second	10,000
Concurrent running tasks	500,000
Worker nodes	1,000+
Cron jobs registered	5 million
DAG workflows	100,000
Average task duration	5 seconds (range: 100ms – 1 hour)
Scheduling accuracy	≤ 1 second from scheduled time
Task state queries per second	50,000
Retention of execution history	30 days

Non-Functional Requirements

Reliability: At-least-once execution guaranteed; no task silently lost; tasks survive scheduler crashes, worker crashes, and network partitions; durable storage in PostgreSQL
Accuracy: Tasks trigger within 1 second of scheduled time; hybrid DB polling + in-memory timing for precision
Scalability: Horizontal scaling of workers and schedulers; adding workers linearly increases execution capacity; partition-based scheduling for high volume
Fault tolerance: No single point of failure; HA schedulers (active-active with SKIP LOCKED or leader election); heartbeat-based worker failure detection; automatic task reclamation
Observability: Full execution history, stdout/stderr logs, metrics (throughput, latency, queue depth), alerting, dashboard with DAG timeline view
Fairness: Multi-tenant quotas, rate limiting per queue/tenant, priority scheduling; prevent starvation of low-priority tasks

Scale Estimates

Metric

Value

Tasks scheduled per day

100 million

Peak task submissions per second

10,000

Concurrent running tasks

500,000

Worker nodes

1,000+

Cron jobs registered

5 million

DAG workflows

100,000

Average task duration

5 seconds (range: 100ms – 1 hour)

Scheduling accuracy

≤ 1 second from scheduled time

Task state queries per second

50,000

Retention of execution history

30 days

Non-Functional Requirements

Reliability: At-least-once execution guaranteed; no task silently lost; tasks survive scheduler crashes, worker crashes, and network partitions; durable storage in PostgreSQL

Accuracy: Tasks trigger within 1 second of scheduled time; hybrid DB polling + in-memory timing for precision

Scalability: Horizontal scaling of workers and schedulers; adding workers linearly increases execution capacity; partition-based scheduling for high volume

Fault tolerance: No single point of failure; HA schedulers (active-active with SKIP LOCKED or leader election); heartbeat-based worker failure detection; automatic task reclamation

Observability: Full execution history, stdout/stderr logs, metrics (throughput, latency, queue depth), alerting, dashboard with DAG timeline view

Fairness: Multi-tenant quotas, rate limiting per queue/tenant, priority scheduling; prevent starvation of low-priority tasks

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Follow-up Deep Dives(Questions an interviewer might ask)

Design a Distributed Task Scheduler

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Follow-up Deep Dives(Questions an interviewer might ask)

Design a Distributed Task Scheduler

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Non-Functional Requirements~3 min

Core Entities~2 min

API Design~3 min

High-Level Design~5 min

Follow-up Deep Dives(Questions an interviewer might ask)

1How would you design the core scheduling mechanism to trigger tasks at the right time?

2How would you ensure no task is missed and no task executes twice (at-least-once / exactly-once)?

3How would you design the worker fleet for distributed task execution?

4How would you implement cron / recurring task scheduling?

5How would you implement task DAGs (workflow orchestration)?

6How would you handle scheduler high availability and leader election?

7How would you design observability, monitoring, and debugging for the scheduler?

Key Topics

Asked At

Design a Distributed Task Scheduler

Scale Estimates

Non-Functional Requirements

Functional Requirements

Approach Guide(Click to expand each section)

Non-Functional Requirements~3 min

Core Entities~2 min

API Design~3 min

High-Level Design~5 min

Follow-up Deep Dives(Questions an interviewer might ask)

1How would you design the core scheduling mechanism to trigger tasks at the right time?

2How would you ensure no task is missed and no task executes twice (at-least-once / exactly-once)?

3How would you design the worker fleet for distributed task execution?

4How would you implement cron / recurring task scheduling?

5How would you implement task DAGs (workflow orchestration)?

6How would you handle scheduler high availability and leader election?

7How would you design observability, monitoring, and debugging for the scheduler?

Key Topics

Asked At