System Design (HLD)What Is Real-Time?

Understanding Real-Time Systems

LevelIntermediate

Duration60 mins

TopicWhat Is Real-Time?

1 / 4

Real-Time Requirements

When Milliseconds Become Mission-Critical

Imagine a high-frequency trading platform where a 10-millisecond delay costs millions of dollars. Consider an autonomous vehicle navigation system where a 100-millisecond lag in processing sensor data could mean the difference between a safe lane change and a catastrophic collision. Think about a multiplayer gaming server where 50 milliseconds of additional latency transforms a responsive, immersive experience into an unplayable, frustrating mess.

These aren't hypothetical scenarios—they represent the daily reality of real-time system engineering.

In traditional distributed systems, we optimize for throughput, availability, and eventual consistency. We accept that operations take "as long as they take" and design around asynchronous processing, retries, and graceful degradation. But real-time systems operate under an entirely different paradigm: time itself becomes a first-class constraint.

This page establishes the foundational understanding of what makes a system "real-time," exploring the formal requirements, mathematical models, and engineering constraints that separate real-time architectures from conventional distributed systems.

What You Will Learn

By the end of this page, you will understand the formal definition of real-time systems, recognize the key characteristics that distinguish them from traditional systems, grasp the mathematical foundations of timing constraints, and appreciate why temporal correctness is as critical as functional correctness in these domains.

Defining Real-Time Systems

The term "real-time" is perhaps one of the most misunderstood concepts in software engineering. Many developers equate "real-time" with "fast" or "interactive," but this conflation obscures the true meaning and engineering implications of real-time computing.

The formal definition:

A real-time system is a system in which the correctness of the computation depends not only on the logical correctness of the output but also on the time at which the output is produced.

This definition, established by decades of computer science research, captures the essential distinction: in real-time systems, a correct answer delivered too late is a wrong answer.

Consider the contrast with conventional systems:

Real-Time vs. Conventional Systems
Aspect	Conventional Systems	Real-Time Systems
Correctness criterion	Logical output correctness	Logical correctness + temporal correctness
Timing treatment	Best-effort, optimize for average case	Guaranteed bounds, worst-case analysis
Failure definition	Wrong output or crash	Wrong output, crash, OR late output
Design focus	Throughput, scalability, availability	Predictability, determinism, bounded latency
Resource allocation	Dynamic, on-demand	Pre-allocated, statically analyzed
Performance metric	Average latency, percentiles	Worst-case execution time (WCET)

Understanding temporal correctness:

Temporal correctness introduces a dimension that most software engineers rarely consider explicitly. When we say a system must respond "in time," we're making a statement that can be formalized mathematically:

Response time (R): The elapsed time from when a stimulus arrives to when the corresponding response is produced
Deadline (D): The maximum allowable response time for a given stimulus
Temporal correctness constraint: R ≤ D for all invocations

This seemingly simple constraint—"respond before the deadline"—has profound implications for system design, resource allocation, scheduling algorithms, and failure handling.

The Deadline Is Not Optional

In real-time systems, missing a deadline isn't just a performance degradation—it can constitute a system failure. Unlike web applications where a slow response is merely inconvenient, real-time systems treat deadline violations with the same severity as logical errors or crashes. Your system architecture must be designed around this constraint from the ground up.

The Anatomy of Timing Constraints

Real-time systems are characterized by specific types of timing constraints that govern their behavior. Understanding these constraint types is essential for proper system design and analysis.

Types of timing constraints:

Fundamental Timing Constraint Types

•Deadline constraints — The response must be produced before a specified time after the triggering event. This is the most common constraint type. Example: "Process the sensor reading within 10ms of receipt."
•Periodicity constraints — Events occur at regular intervals, and corresponding processing must complete within each period. Example: "Sample the temperature sensor every 100ms and process each sample before the next arrives."
•Jitter constraints — The variation in timing must be bounded, even if average timing is acceptable. Example: "Video frames must be rendered with no more than ±2ms variation to prevent visual stutter."
•Precedence constraints — Certain operations must complete before others can begin. Example: "The obstacle detection must complete before the path planning computation starts."
•Separation constraints — A minimum time must elapse between events. Example: "Allow at least 5ms between successive actuator commands to prevent mechanical stress."

The timing constraint hierarchy:

Real-time systems typically involve multiple concurrent activities, each with its own timing constraints. These constraints form a hierarchy that the system must satisfy simultaneously:

System-level deadline (e.g., end-to-end latency < 50ms)
├── Subsystem A deadline (e.g., sensor processing < 15ms)
│   ├── Task A1 deadline (5ms)
│   └── Task A2 deadline (10ms)
├── Subsystem B deadline (e.g., computation < 20ms)
│   ├── Task B1 deadline (8ms)
│   └── Task B2 deadline (12ms)
└── Subsystem C deadline (e.g., actuator control < 15ms)
    └── Task C1 deadline (15ms)

The challenge lies in ensuring that meeting individual task deadlines also satisfies subsystem and system-level timing requirements, accounting for communication delays, resource contention, and scheduling overhead.

End-to-End Latency Budgets

In practice, system designers create latency budgets that allocate portions of the end-to-end deadline to individual components. This budgeting process requires deep understanding of component behavior, worst-case execution times, and communication overheads. Overestimating component latencies leads to overprovisioned systems; underestimating leads to deadline violations.

Determinism and Predictability

Two fundamental properties distinguish real-time systems from best-effort systems: determinism and predictability. While often used interchangeably, these concepts have distinct meanings in real-time computing.

Determinism:

A system is deterministic if, given the same initial state and the same inputs, it will always produce the same outputs in the same amount of time. Deterministic behavior means there is no randomness or unpredictable variation in system execution.

Predictability:

A system is predictable if its timing behavior can be analyzed and bounded before execution. A predictable system may not be strictly deterministic (there might be variation), but the bounds of that variation are known and can be guaranteed.

The relationship:

Determinism vs. Predictability
Property	Description	Real-Time Requirement
Determinism	Same inputs → same timing, always	Ideal but often impractical
Predictability	Timing bounds are known a priori	Essential requirement
Bounded variation	Max variation from expected timing is limited	Required for jitter-sensitive applications
Analyzability	Timing can be mathematically proven	Required for safety-critical systems

Sources of non-determinism in modern systems:

Achieving determinism in modern computing systems is challenging due to numerous sources of timing variability:

Common Sources of Non-Determinism

•CPU caching — Cache hits vs. misses cause order-of-magnitude timing differences. A cache miss can take 100+ CPU cycles compared to single-digit cycles for a hit.
•Branch prediction — Mispredicted branches cause pipeline stalls. Modern CPUs can have 15+ pipeline stages, meaning mispredictions waste significant cycles.
•Virtual memory — Page faults trigger disk I/O, adding milliseconds of delay. TLB misses add memory access latency.
•Garbage collection — GC pauses can halt all application threads for milliseconds or longer. Even "low-pause" collectors introduce unpredictable delays.
•Operating system scheduling — Context switches, interrupt handling, and kernel activity consume unpredictable CPU time.
•Network communication — Network latency varies due to congestion, routing changes, and protocol behavior (TCP retransmissions, etc.).
•I/O operations — Disk seeks, SSD garbage collection, and device driver behavior introduce timing variability.
•Multi-core interference — Shared resources (memory bus, last-level cache) create contention between cores.

Designing for Predictability

Real-time system designers don't eliminate non-determinism—they bound it. Techniques include disabling interrupts during critical sections, pinning processes to specific CPU cores, pre-allocating memory, using real-time operating systems (RTOS), and avoiding dynamic memory allocation during time-critical operations.

Worst-Case Execution Time Analysis

The cornerstone of real-time system analysis is Worst-Case Execution Time (WCET)—the maximum time a piece of code can take to execute under any possible input and system state.

Why WCET matters:

In conventional systems, we often focus on average-case or typical-case performance. We might say "this operation typically takes 5ms" and consider optimization successful if we reduce the average. But in real-time systems, it's the worst case that determines whether deadlines are met:

If your deadline is 10ms and your WCET is 8ms, you can guarantee meeting the deadline
If your average is 5ms but WCET is 15ms, you cannot guarantee meeting the 10ms deadline, even though you "usually" succeed

Computing WCET:

WCET analysis uses two primary approaches:

Static Analysis

•Analyzes source code or binary without execution
•Determines longest execution path mathematically
•Models CPU pipeline, cache behavior, branch prediction
•Produces provable upper bounds
•Required for safety-critical certification
•Often overestimates (safe but pessimistic)
•Complex for modern out-of-order processors

Measurement-Based Analysis

•Executes code with various inputs
•Measures actual execution times
•Uses statistical analysis to estimate bounds
•More practical for complex systems
•Cannot guarantee absolute worst case found
•Often combined with static analysis
•Widely used in industry practice

The WCET challenge in modern systems:

Modern processors are designed for average-case performance, not worst-case predictability. Features that improve average throughput often make WCET analysis harder:

Feature	Benefit for Throughput	Challenge for WCET
Out-of-order execution	Better CPU utilization	Harder to analyze execution order
Speculative execution	Hides memory latency	Introduces timing variability
Multi-level caching	Faster memory access (on average)	Cache state affects timing dramatically
Dynamic frequency scaling	Power efficiency	CPU speed varies unpredictably
Simultaneous multithreading	Better core utilization	Threads interfere with each other

For safety-critical real-time systems, architects may deliberately choose simpler processors with more predictable behavior, sacrificing average performance for timing guarantees.

The 10x Rule of Thumb

In complex modern systems, the worst-case execution time can be 10x or more the average-case time. A function that typically executes in 1ms might take 10-20ms in the worst case due to cache misses, page faults, or GC pauses. Design your systems with substantial headroom, especially when formal WCET analysis isn't feasible.

Schedulability Analysis

Once we know the WCET of individual tasks, the next question is: can all tasks meet their deadlines when running concurrently on shared resources? This is the domain of schedulability analysis.

The schedulability problem:

Given:

A set of tasks, each with its own period, deadline, and WCET
A set of shared resources (CPUs, memory, I/O devices)
A scheduling algorithm

Determine: Will all tasks always meet their deadlines?

Rate Monotonic Scheduling (RMS):

For periodic tasks with deadlines equal to their periods, Rate Monotonic Scheduling is optimal among fixed-priority algorithms. The classic schedulability test for RMS states that n tasks are guaranteed schedulable if:

U = Σ(Ci/Ti) ≤ n(2^(1/n) - 1)

Where:

Ci = WCET of task i
Ti = Period of task i
U = Total CPU utilization

For large n, this bound approaches ln(2) ≈ 0.693, meaning you can guarantee schedulability with up to ~69% CPU utilization.

RMS Schedulability Bounds by Task Count
Number of Tasks	Utilization Bound	Guaranteed If Utilization Below
1 task	100%	U ≤ 1.000
2 tasks	82.8%	U ≤ 0.828
3 tasks	78.0%	U ≤ 0.780
5 tasks	74.3%	U ≤ 0.743
10 tasks	71.8%	U ≤ 0.718
∞ tasks	69.3% (ln 2)	U ≤ 0.693

Earliest Deadline First (EDF):

EDF is a dynamic priority scheduling algorithm that can achieve 100% CPU utilization while guaranteeing all deadlines are met (if the task set is schedulable at all). Tasks are prioritized by their absolute deadline—the task whose deadline is nearest runs first.

EDF schedulability test (for periodic tasks where deadline = period):

U = Σ(Ci/Ti) ≤ 1

This is the theoretical optimum: if total utilization exceeds 100%, no scheduling algorithm can guarantee all deadlines.

Practical considerations:

Schedulability in Practice

•Overhead matters — Context switch time, interrupt latency, and scheduler overhead must be included in analysis. A scheduler that takes 1ms to make decisions adds 1ms to every task switch.
•Resource contention — Shared resources (locks, I/O devices, memory bandwidth) can cause priority inversion and blocking. Priority inheritance protocols help but add complexity.
•Aperiodic and sporadic tasks — Many real systems have event-driven tasks with irregular arrival times. These require more sophisticated analysis techniques.
•Multi-core scheduling — Task assignment to cores, cache interference, and bus contention make multi-core schedulability analysis significantly more complex.
•Mixed criticality — Systems often combine hard real-time tasks with soft real-time and best-effort tasks, requiring isolation and guaranteed resource allocation.

Leave Headroom

Even when schedulability analysis says you can use 80% CPU utilization, prudent engineers target 50-60% in production. This headroom accommodates WCET estimation errors, unexpected load spikes, and future feature additions without requiring system redesign.

The Real-Time System Stack

Real-time requirements affect every layer of the system stack. Unlike conventional systems where timing is a "nice to have," real-time systems require timing guarantees at each layer to compose into end-to-end guarantees.

The layer-by-layer challenge:

Real-Time Considerations Across the Stack
Layer	Conventional Approach	Real-Time Approach
Hardware	Maximize average throughput	Predictable timing, disable dynamic features
Operating System	General-purpose, fairness-focused	RTOS with priority scheduling, bounded latency
Runtime/VM	JIT compilation, background GC	AOT compilation, predictable memory management
Language	Dynamic typing, runtime dispatch	Static types, compile-time decisions
Libraries	Optimize for common case	Bounded-time algorithms, no hidden allocations
Application	Handle errors via exceptions, retries	Fail-safe defaults, pre-validated inputs
Network	Best-effort delivery, congestion control	QoS guarantees, traffic shaping, bounded latency

Real-Time Operating Systems (RTOS):

General-purpose operating systems like Linux, Windows, and macOS are designed for throughput and fairness, not real-time guarantees. They include features that make timing unpredictable:

Preemptible kernels that can delay high-priority user tasks
Complex I/O scheduling that optimizes for throughput
Memory management that may trigger page faults at any time
Background processes that consume unpredictable resources

Real-Time Operating Systems (RTOS) like VxWorks, QNX, FreeRTOS, and RTEMS are specifically designed for timing predictability:

Bounded interrupt latency — Maximum time from interrupt signal to handler execution
Priority-based preemption — Highest-priority ready task always runs immediately
Minimal kernel overhead — Lean kernel code with predictable execution time
Priority inheritance — Prevents priority inversion when tasks share resources
No virtual memory (often) — Eliminates page faults in the critical path

The Linux RT_PREEMPT patch:

For systems that need Linux compatibility with improved real-time behavior, the PREEMPT_RT patch set converts Linux into a real-time capable system by:

Making the kernel fully preemptible
Converting spinlocks to sleeping locks
Moving interrupt handlers to kernel threads
Enabling priority inheritance for locks

This provides "soft real-time" capabilities with latencies in the tens-to-hundreds of microseconds range, suitable for many industrial applications.

Language Choice Matters

High-level languages with garbage collection (Java, Go, Python) are challenging for hard real-time systems due to GC pauses. Languages like C, C++, Rust, and Ada are preferred because they provide explicit memory control. Some domains use specialized real-time garbage collectors or regions-based memory management to enable high-level languages with predictable timing.

Quantifying Real-Time Requirements

Before designing a real-time system, engineers must precisely quantify the timing requirements. Vague requirements like "the system should be responsive" are insufficient—real-time design requires specific, measurable constraints.

The requirements specification process:

Steps to Quantify Real-Time Requirements

•Identify time-critical operations — Which operations have timing constraints? What triggers them (sensors, user input, external events, periodic timers)?
•Determine deadline sources — Where does each deadline come from? Physical laws (control system stability), human perception (UI responsiveness), external contracts (SLAs), safety requirements?
•Quantify specific values — What is the exact deadline in milliseconds or microseconds? What is the required frequency for periodic operations?
•Define acceptable failure rates — For soft real-time, what percentage of deadline misses is tolerable? What is the maximum acceptable delay when deadlines are missed?
•Identify dependencies — What must complete before a time-critical operation can start? What is the end-to-end latency budget?
•Document worst-case scenarios — What system state causes maximum latency? How does degraded mode (hardware failures, overload) affect timing?

Example: Video conferencing application requirements:

Requirement	Value	Source
Audio capture latency	< 10ms	Human auditory perception; longer delays cause echo perception
Video capture latency	< 33ms	30fps frame timing
Audio encoding latency	< 15ms	End-to-end budget allocation
Video encoding latency	< 40ms	End-to-end budget allocation
Network transmission	< 100ms one-way	Conversational quality threshold
End-to-end mouth-to-ear	< 150ms	ITU-T G.114 recommendation for acceptable conversation quality
Jitter budget	< 30ms	Audio buffer sizing, visible stutter threshold
Acceptable audio glitches	< 1% of 10-second windows	User experience quality target

These specific values drive architecture decisions: buffer sizes, encoding algorithm selection, network protocol choice, and server placement strategy.

When Requirements Are Unknown

If stakeholders can't provide specific timing requirements, that's often a sign that the system isn't truly real-time—it's just "should be fast." Push back on vague requirements. Real-time systems require explicit deadline specifications because they fundamentally change how the system is designed, built, and validated.

Summary: Real-Time Requirements

We've established the foundational understanding of what makes a system "real-time." Let's consolidate the key concepts:

Key Takeaways

•Real-time ≠ fast — Real-time means timing correctness is part of correctness. A correct result delivered late is a wrong result.
•Temporal and logical correctness — Real-time systems must satisfy both what they compute and when they compute it.
•Timing constraints have structure — Deadlines, periods, jitter bounds, precedence relationships form a constraint hierarchy.
•Predictability over performance — Real-time systems optimize for worst-case bounds, not average-case throughput.
•WCET is foundational — Worst-Case Execution Time analysis underpins all timing guarantees.
•Schedulability determines feasibility — Mathematical analysis proves whether a task set can meet all deadlines.
•Every layer matters — Real-time guarantees require support from hardware through application code.
•Specific requirements required — Real-time design starts with quantified, measurable timing specifications.

What's next:

Now that we understand what real-time requirements mean formally, the next page explores latency expectations in depth—examining how different domains require different levels of responsiveness, how latency is measured and characterized, and what latency budgets look like for real-world systems.

Page Complete

You now understand the formal definition of real-time systems and the key characteristics that distinguish them from conventional distributed systems. This foundation is essential for understanding the soft vs. hard real-time distinction, latency expectations, and the architectural patterns covered in subsequent pages.

1 / 4

Loading learning content...

System Design (HLD)What Is Real-Time?

Understanding Real-Time Systems

LevelIntermediate

Duration60 mins

TopicWhat Is Real-Time?

1 / 4

Real-Time Requirements

When Milliseconds Become Mission-Critical

These aren't hypothetical scenarios—they represent the daily reality of real-time system engineering.

What You Will Learn

Defining Real-Time Systems

The formal definition:

A real-time system is a system in which the correctness of the computation depends not only on the logical correctness of the output but also on the time at which the output is produced.

This definition, established by decades of computer science research, captures the essential distinction: in real-time systems, a correct answer delivered too late is a wrong answer.

Consider the contrast with conventional systems:

Real-Time vs. Conventional Systems
Aspect	Conventional Systems	Real-Time Systems
Correctness criterion	Logical output correctness	Logical correctness + temporal correctness
Timing treatment	Best-effort, optimize for average case	Guaranteed bounds, worst-case analysis
Failure definition	Wrong output or crash	Wrong output, crash, OR late output
Design focus	Throughput, scalability, availability	Predictability, determinism, bounded latency
Resource allocation	Dynamic, on-demand	Pre-allocated, statically analyzed
Performance metric	Average latency, percentiles	Worst-case execution time (WCET)

Understanding temporal correctness:

Response time (R): The elapsed time from when a stimulus arrives to when the corresponding response is produced
Deadline (D): The maximum allowable response time for a given stimulus
Temporal correctness constraint: R ≤ D for all invocations

This seemingly simple constraint—"respond before the deadline"—has profound implications for system design, resource allocation, scheduling algorithms, and failure handling.

The Deadline Is Not Optional

The Anatomy of Timing Constraints

Real-time systems are characterized by specific types of timing constraints that govern their behavior. Understanding these constraint types is essential for proper system design and analysis.

Types of timing constraints:

Fundamental Timing Constraint Types

•Deadline constraints — The response must be produced before a specified time after the triggering event. This is the most common constraint type. Example: "Process the sensor reading within 10ms of receipt."
•Periodicity constraints — Events occur at regular intervals, and corresponding processing must complete within each period. Example: "Sample the temperature sensor every 100ms and process each sample before the next arrives."
•Jitter constraints — The variation in timing must be bounded, even if average timing is acceptable. Example: "Video frames must be rendered with no more than ±2ms variation to prevent visual stutter."
•Precedence constraints — Certain operations must complete before others can begin. Example: "The obstacle detection must complete before the path planning computation starts."
•Separation constraints — A minimum time must elapse between events. Example: "Allow at least 5ms between successive actuator commands to prevent mechanical stress."

The timing constraint hierarchy:

Real-time systems typically involve multiple concurrent activities, each with its own timing constraints. These constraints form a hierarchy that the system must satisfy simultaneously:

System-level deadline (e.g., end-to-end latency < 50ms)
├── Subsystem A deadline (e.g., sensor processing < 15ms)
│   ├── Task A1 deadline (5ms)
│   └── Task A2 deadline (10ms)
├── Subsystem B deadline (e.g., computation < 20ms)
│   ├── Task B1 deadline (8ms)
│   └── Task B2 deadline (12ms)
└── Subsystem C deadline (e.g., actuator control < 15ms)
    └── Task C1 deadline (15ms)

End-to-End Latency Budgets

Determinism and Predictability

Determinism:

Predictability:

The relationship:

Determinism vs. Predictability
Property	Description	Real-Time Requirement
Determinism	Same inputs → same timing, always	Ideal but often impractical
Predictability	Timing bounds are known a priori	Essential requirement
Bounded variation	Max variation from expected timing is limited	Required for jitter-sensitive applications
Analyzability	Timing can be mathematically proven	Required for safety-critical systems

Sources of non-determinism in modern systems:

Achieving determinism in modern computing systems is challenging due to numerous sources of timing variability:

Common Sources of Non-Determinism

•CPU caching — Cache hits vs. misses cause order-of-magnitude timing differences. A cache miss can take 100+ CPU cycles compared to single-digit cycles for a hit.
•Branch prediction — Mispredicted branches cause pipeline stalls. Modern CPUs can have 15+ pipeline stages, meaning mispredictions waste significant cycles.
•Virtual memory — Page faults trigger disk I/O, adding milliseconds of delay. TLB misses add memory access latency.
•Garbage collection — GC pauses can halt all application threads for milliseconds or longer. Even "low-pause" collectors introduce unpredictable delays.
•Operating system scheduling — Context switches, interrupt handling, and kernel activity consume unpredictable CPU time.
•Network communication — Network latency varies due to congestion, routing changes, and protocol behavior (TCP retransmissions, etc.).
•I/O operations — Disk seeks, SSD garbage collection, and device driver behavior introduce timing variability.
•Multi-core interference — Shared resources (memory bus, last-level cache) create contention between cores.

Designing for Predictability

Worst-Case Execution Time Analysis

The cornerstone of real-time system analysis is Worst-Case Execution Time (WCET)—the maximum time a piece of code can take to execute under any possible input and system state.

Why WCET matters:

If your deadline is 10ms and your WCET is 8ms, you can guarantee meeting the deadline
If your average is 5ms but WCET is 15ms, you cannot guarantee meeting the 10ms deadline, even though you "usually" succeed

Computing WCET:

WCET analysis uses two primary approaches:

Static Analysis

•Analyzes source code or binary without execution
•Determines longest execution path mathematically
•Models CPU pipeline, cache behavior, branch prediction
•Produces provable upper bounds
•Required for safety-critical certification
•Often overestimates (safe but pessimistic)
•Complex for modern out-of-order processors

Measurement-Based Analysis

•Executes code with various inputs
•Measures actual execution times
•Uses statistical analysis to estimate bounds
•More practical for complex systems
•Cannot guarantee absolute worst case found
•Often combined with static analysis
•Widely used in industry practice

The WCET challenge in modern systems:

Modern processors are designed for average-case performance, not worst-case predictability. Features that improve average throughput often make WCET analysis harder:

Feature	Benefit for Throughput	Challenge for WCET
Out-of-order execution	Better CPU utilization	Harder to analyze execution order
Speculative execution	Hides memory latency	Introduces timing variability
Multi-level caching	Faster memory access (on average)	Cache state affects timing dramatically
Dynamic frequency scaling	Power efficiency	CPU speed varies unpredictably
Simultaneous multithreading	Better core utilization	Threads interfere with each other

For safety-critical real-time systems, architects may deliberately choose simpler processors with more predictable behavior, sacrificing average performance for timing guarantees.

The 10x Rule of Thumb

Schedulability Analysis

Once we know the WCET of individual tasks, the next question is: can all tasks meet their deadlines when running concurrently on shared resources? This is the domain of schedulability analysis.

The schedulability problem:

Given:

A set of tasks, each with its own period, deadline, and WCET
A set of shared resources (CPUs, memory, I/O devices)
A scheduling algorithm

Determine: Will all tasks always meet their deadlines?

Rate Monotonic Scheduling (RMS):

U = Σ(Ci/Ti) ≤ n(2^(1/n) - 1)

Where:

Ci = WCET of task i
Ti = Period of task i
U = Total CPU utilization

For large n, this bound approaches ln(2) ≈ 0.693, meaning you can guarantee schedulability with up to ~69% CPU utilization.

RMS Schedulability Bounds by Task Count
Number of Tasks	Utilization Bound	Guaranteed If Utilization Below
1 task	100%	U ≤ 1.000
2 tasks	82.8%	U ≤ 0.828
3 tasks	78.0%	U ≤ 0.780
5 tasks	74.3%	U ≤ 0.743
10 tasks	71.8%	U ≤ 0.718
∞ tasks	69.3% (ln 2)	U ≤ 0.693

Earliest Deadline First (EDF):

EDF schedulability test (for periodic tasks where deadline = period):

U = Σ(Ci/Ti) ≤ 1

This is the theoretical optimum: if total utilization exceeds 100%, no scheduling algorithm can guarantee all deadlines.

Practical considerations:

Schedulability in Practice

•Overhead matters — Context switch time, interrupt latency, and scheduler overhead must be included in analysis. A scheduler that takes 1ms to make decisions adds 1ms to every task switch.
•Resource contention — Shared resources (locks, I/O devices, memory bandwidth) can cause priority inversion and blocking. Priority inheritance protocols help but add complexity.
•Aperiodic and sporadic tasks — Many real systems have event-driven tasks with irregular arrival times. These require more sophisticated analysis techniques.
•Multi-core scheduling — Task assignment to cores, cache interference, and bus contention make multi-core schedulability analysis significantly more complex.
•Mixed criticality — Systems often combine hard real-time tasks with soft real-time and best-effort tasks, requiring isolation and guaranteed resource allocation.

Leave Headroom

The Real-Time System Stack

The layer-by-layer challenge:

Real-Time Considerations Across the Stack
Layer	Conventional Approach	Real-Time Approach
Hardware	Maximize average throughput	Predictable timing, disable dynamic features
Operating System	General-purpose, fairness-focused	RTOS with priority scheduling, bounded latency
Runtime/VM	JIT compilation, background GC	AOT compilation, predictable memory management
Language	Dynamic typing, runtime dispatch	Static types, compile-time decisions
Libraries	Optimize for common case	Bounded-time algorithms, no hidden allocations
Application	Handle errors via exceptions, retries	Fail-safe defaults, pre-validated inputs
Network	Best-effort delivery, congestion control	QoS guarantees, traffic shaping, bounded latency

Real-Time Operating Systems (RTOS):

General-purpose operating systems like Linux, Windows, and macOS are designed for throughput and fairness, not real-time guarantees. They include features that make timing unpredictable:

Preemptible kernels that can delay high-priority user tasks
Complex I/O scheduling that optimizes for throughput
Memory management that may trigger page faults at any time
Background processes that consume unpredictable resources

Real-Time Operating Systems (RTOS) like VxWorks, QNX, FreeRTOS, and RTEMS are specifically designed for timing predictability:

Bounded interrupt latency — Maximum time from interrupt signal to handler execution
Priority-based preemption — Highest-priority ready task always runs immediately
Minimal kernel overhead — Lean kernel code with predictable execution time
Priority inheritance — Prevents priority inversion when tasks share resources
No virtual memory (often) — Eliminates page faults in the critical path

The Linux RT_PREEMPT patch:

For systems that need Linux compatibility with improved real-time behavior, the PREEMPT_RT patch set converts Linux into a real-time capable system by:

Making the kernel fully preemptible
Converting spinlocks to sleeping locks
Moving interrupt handlers to kernel threads
Enabling priority inheritance for locks

This provides "soft real-time" capabilities with latencies in the tens-to-hundreds of microseconds range, suitable for many industrial applications.

Language Choice Matters

Quantifying Real-Time Requirements

The requirements specification process:

Steps to Quantify Real-Time Requirements

•Identify time-critical operations — Which operations have timing constraints? What triggers them (sensors, user input, external events, periodic timers)?
•Determine deadline sources — Where does each deadline come from? Physical laws (control system stability), human perception (UI responsiveness), external contracts (SLAs), safety requirements?
•Quantify specific values — What is the exact deadline in milliseconds or microseconds? What is the required frequency for periodic operations?
•Define acceptable failure rates — For soft real-time, what percentage of deadline misses is tolerable? What is the maximum acceptable delay when deadlines are missed?
•Identify dependencies — What must complete before a time-critical operation can start? What is the end-to-end latency budget?
•Document worst-case scenarios — What system state causes maximum latency? How does degraded mode (hardware failures, overload) affect timing?

Example: Video conferencing application requirements:

Requirement	Value	Source
Audio capture latency	< 10ms	Human auditory perception; longer delays cause echo perception
Video capture latency	< 33ms	30fps frame timing
Audio encoding latency	< 15ms	End-to-end budget allocation
Video encoding latency	< 40ms	End-to-end budget allocation
Network transmission	< 100ms one-way	Conversational quality threshold
End-to-end mouth-to-ear	< 150ms	ITU-T G.114 recommendation for acceptable conversation quality
Jitter budget	< 30ms	Audio buffer sizing, visible stutter threshold
Acceptable audio glitches	< 1% of 10-second windows	User experience quality target

These specific values drive architecture decisions: buffer sizes, encoding algorithm selection, network protocol choice, and server placement strategy.

When Requirements Are Unknown

Summary: Real-Time Requirements

We've established the foundational understanding of what makes a system "real-time." Let's consolidate the key concepts:

Key Takeaways

•Real-time ≠ fast — Real-time means timing correctness is part of correctness. A correct result delivered late is a wrong result.
•Temporal and logical correctness — Real-time systems must satisfy both what they compute and when they compute it.
•Timing constraints have structure — Deadlines, periods, jitter bounds, precedence relationships form a constraint hierarchy.
•Predictability over performance — Real-time systems optimize for worst-case bounds, not average-case throughput.
•WCET is foundational — Worst-Case Execution Time analysis underpins all timing guarantees.
•Schedulability determines feasibility — Mathematical analysis proves whether a task set can meet all deadlines.
•Every layer matters — Real-time guarantees require support from hardware through application code.
•Specific requirements required — Real-time design starts with quantified, measurable timing specifications.

What's next:

Page Complete

1 / 4