System Design (HLD)What Is Real-Time?

Understanding Real-Time Systems

LevelIntermediate

Duration60 mins

TopicWhat Is Real-Time?

3 / 4

Soft vs Hard Real-Time

The Spectrum of Deadline Criticality

Not all deadlines are created equal. In some systems, missing a deadline is a minor inconvenience—a video frame drops, a game stutters momentarily, a notification arrives a second late. In other systems, missing a deadline can mean a crashed aircraft, a failed surgical robot, or a nuclear reactor malfunction.

The distinction between soft real-time and hard real-time systems is not merely academic—it fundamentally shapes every aspect of system design, from hardware selection to programming language choice to verification methodology. Understanding this distinction is essential for making appropriate engineering decisions: overengineering a soft real-time system wastes resources, while under-engineering a hard real-time system risks catastrophic failure.

This page provides a rigorous examination of the soft/hard real-time spectrum, exploring the formal definitions, practical implications, and the critical category of firm real-time that falls between these extremes.

What You Will Learn

By the end of this page, you will understand the formal definitions of soft, firm, and hard real-time systems, recognize how these classifications drive architectural decisions, appreciate the spectrum of failure consequences, and be able to correctly classify your own system requirements.

Formal Definitions

The real-time systems community has established precise definitions that distinguish system categories based on the consequences of deadline misses.

Hard Real-Time:

A hard real-time system is one in which missing a deadline constitutes a system failure. The correctness of the system depends absolutely on results being produced within specified deadlines.

In hard real-time systems, there is no concept of "late but acceptable." A result delivered after the deadline has zero value—or negative value if acting on stale data causes harm. The system must be designed and verified to guarantee deadline compliance under all specified operating conditions.

Soft Real-Time:

A soft real-time system is one in which missing deadlines degrades the quality of service but does not constitute system failure. The system remains useful even when deadlines are occasionally missed.

Soft real-time systems optimize for good average performance while accepting some variability. The value of a late result decreases gracefully rather than catastrophically—a 110ms response when 100ms was targeted is better than a 200ms response, which is better than a 500ms response.

Firm Real-Time:

A firm real-time system is one in which missing a deadline renders the result worthless, but does not cause system failure. Late results have no utility but also no negative consequences.

Firm real-time falls between soft and hard. Missing a deadline means the effort is wasted, but the system continues operating. This is distinct from hard real-time, where a missed deadline may cause damage, and from soft real-time, where a late result still has partial value.

Real-Time System Classification Comparison
Characteristic	Soft Real-Time	Firm Real-Time	Hard Real-Time
Deadline miss consequence	Degraded quality	Wasted effort	System failure
Value of late result	Partial (degrades gracefully)	Zero (worthless)	Negative (potentially harmful)
Tolerance for misses	Some percentage acceptable	Wasteful but survivable	Zero tolerance
Design approach	Best-effort optimization	High probability guarantee	Absolute guarantee
Verification requirement	Statistical testing	Probabilistic analysis	Formal proof / certification
Overload handling	Graceful degradation	Drop requests, continue	Prevent overload condition

The Classification Matters

Misclassifying your system leads to inappropriate architecture. Treating a soft real-time system as hard leads to overengineered, expensive solutions. Treating a hard real-time system as soft leads to systems that occasionally kill people. Be rigorous in determining which category your system actually falls into.

Hard Real-Time Systems: Guaranteed Deadlines

Hard real-time systems are characterized by the absolute requirement that deadlines must never be missed. The consequences of deadline violation are severe enough that no statistical argument ("we only miss 0.001% of deadlines") is acceptable.

Characteristics of hard real-time systems:

Hard Real-Time System Properties

•Deterministic execution environment — The worst-case execution time of every operation must be known and bounded.
•Formal verification — Often requires mathematical proof that deadlines will be met, not just testing.
•Static resource allocation — Resources are pre-allocated at design time, not dynamically at runtime.
•No dynamic memory allocation — malloc/new are prohibited in critical paths due to unbounded timing.
•Certified development processes — Standards like DO-178C (avionics), ISO 26262 (automotive) mandate rigorous development.
•Watchdog mechanisms — Hardware watchdogs detect deadline misses and trigger fail-safe responses.
•Redundancy for fault tolerance — Critical functions are replicated to prevent single points of failure.

Examples of hard real-time systems:

Domain	System	Typical Deadline	Consequence of Miss
Aerospace	Flight control surfaces	10-20ms	Aircraft becomes uncontrollable
Aerospace	Engine control (FADEC)	1-5ms	Engine damage or failure
Automotive	Airbag deployment	10ms	Airbag deploys too late to protect
Automotive	Anti-lock braking (ABS)	1-5ms	Wheels lock, vehicle skids
Medical	Cardiac pacemaker	Milliseconds	Heart rhythm disruption
Medical	Radiation therapy	Milliseconds	Incorrect dosage delivery
Industrial	Nuclear reactor control	Varies by system	Potential meltdown
Industrial	Robot arm control	1-10ms	Collision with humans/objects

The fundamental constraint:

In hard real-time systems, the design must ensure that even in the worst case—the longest possible execution path, the highest possible load, the most unfavorable timing of interrupts—every deadline is met. This requires:

WCET analysis of all code paths
Schedulability analysis proving all tasks can complete on time
Resource reservation guaranteeing CPU, memory, and I/O availability
Bounded interrupt latency from the operating system
Fail-safe defaults for any situation where deadlines might be missed

Hard Real-Time ≠ Fast

A hard real-time system with a 10-second deadline is still hard real-time. The defining characteristic is the guarantee, not the speed. An aircraft's pre-flight checklist might have a 30-minute deadline that is just as "hard" as a 10ms control loop deadline—both must be met, or the flight doesn't proceed safely.

Soft Real-Time Systems: Optimized Experience

Soft real-time systems are the most common type of real-time system in commercial software. They aim to meet deadlines most of the time but accept that occasional misses are tolerable.

Characteristics of soft real-time systems:

Soft Real-Time System Properties

•Statistical performance targets — SLOs specify percentiles (e.g., p99 < 100ms) rather than absolute guarantees.
•Graceful degradation — When overloaded, quality decreases smoothly rather than failing catastrophically.
•Dynamic resource allocation — Resources can be allocated on-demand based on current load.
•Standard development practices — No special certification required; standard testing and monitoring suffice.
•Adaptive algorithms — Systems can adjust quality/latency tradeoffs based on conditions.
•User experience focus — Design targets human perception thresholds, not safety requirements.
•Economic consequences — Deadline misses cause business impact (churn, lost revenue) rather than safety impact.

Examples of soft real-time systems:

Domain	System	Target Latency	Consequence of Miss
Web	E-commerce checkout	< 1 second	Cart abandonment, lost sale
Web	Search results	< 200ms	User frustration, competitor switch
Media	Video streaming	< 100ms jitter	Buffering, quality reduction
Media	Live streaming	< 3 seconds glass-to-glass	Viewer disengagement
Communication	Video conferencing	< 150ms one-way	Awkward conversation
Gaming	Multiplayer games	< 100ms round-trip	Competitive disadvantage
Productivity	Document collaboration	< 500ms sync	Minor collaboration friction
Mobile	App responsiveness	< 100ms to feedback	Perceived sluggishness

The value function of lateness:

In soft real-time systems, the utility of a result decreases as latency increases, but doesn't immediately drop to zero:

Utility
  ↑
1.0 ┤●●●●●●●●●
    │         ●●●●
    │              ●●●●
    │                   ●●●●
    │                        ●●●●
    │                             ●●●●
0.0 ┤                                  ●●●→
    └──────────────────────────────────────→ Latency
         Deadline     Degraded     Unusable

This gradual degradation curve is what distinguishes soft from firm/hard real-time. The system remains "correct" even when slow; it just provides a worse experience.

Design for the Tail

In soft real-time systems, focus optimization effort on the tail of the latency distribution. Cutting p99 from 500ms to 100ms matters more than cutting p50 from 30ms to 20ms. The worst-case users drive churn and bad reviews.

Firm Real-Time Systems: The Middle Ground

Firm real-time represents an important middle category that is sometimes overlooked. In firm real-time systems, late results have no value, but missing deadlines doesn't cause harm.

The key distinction:

Soft real-time: Late result has reduced value
Firm real-time: Late result has zero value (but no harm)
Hard real-time: Late result has negative value (causes harm)

Examples of firm real-time:

Firm Real-Time System Examples
System	Deadline	If Deadline Missed
Financial market data feed	Trade window (~ms)	Stale quote is worthless; new quote coming
Live sports graphics overlay	Frame deadline (16ms)	Miss the frame; display on next one
Sensor sampling	Sampling period	Skip this sample; next one coming
Traffic light sequencing	Phase timing	Extend current phase; don't cause collision
Weather radar update	Rotation period	Display stale data; new scan starting
Online auction bid processing	Auction close time	Bid doesn't count; auction proceeds without it

Design implications of firm real-time:

Firm real-time systems can employ strategies that are inappropriate for either soft or hard real-time:

1. Skip on overload: When the system can't meet a deadline, it can simply discard the work and move on. This is unacceptable for hard real-time (where skipping causes failure) and wasteful for soft real-time (where a late result still has value).

2. Speculative execution: Start work before you're certain it's needed, discard if the deadline passes. The wasted computation is acceptable because the alternative (late completion) has no value anyway.

3. Quality-deadline tradeoffs: Provide a lower-quality result on time rather than a higher-quality result late. For example, display an interpolated position rather than a precisely calculated one if the calculation would miss the frame deadline.

4. Bounded retry: Retry a failed operation only if there's time remaining before the deadline. Once the deadline passes, don't bother retrying—the result would be worthless.

Firm Real-Time in Practice

Many systems that are casually called 'soft real-time' are actually firm real-time. Video streaming, game rendering, and sensor processing often have firm deadlines where late data is simply discarded. Recognizing the firm classification enables simpler designs—you don't need graceful degradation if late results are just thrown away.

Architectural Implications of Classification

The real-time classification directly determines appropriate architectural choices. Using soft real-time architecture for hard real-time requirements—or vice versa—leads to systems that are either dangerous or over-engineered.

Architectural Choices by Real-Time Classification
Aspect	Soft Real-Time	Firm Real-Time	Hard Real-Time
Operating System	Linux, Windows, general-purpose	Linux RT_PREEMPT, QNX	VxWorks, RTEMS, FreeRTOS, bare metal
Programming Language	Any (Java, Python, Go, etc.)	C++, Go, Rust (managed carefully)	C, Ada, SPARK (formally verifiable)
Memory Management	Garbage collection acceptable	GC with tuning, or manual	Static allocation only, no heap
Concurrency Model	Thread pools, async/await	Priority scheduling, lock-free	Rate monotonic/EDF scheduling
Error Handling	Exceptions, retries, degradation	Bounded retries, then skip	Pre-validated inputs, fail-safe defaults
Resource Allocation	Dynamic, on-demand	Bounded dynamic, pre-reserved	Static, compile-time allocation
Testing Approach	Load testing, chaos engineering	Stress testing, latency profiling	Formal verification, DO-178C/ISO 26262
Monitoring	Metrics, percentiles, SLOs	Histograms, timeout tracking	Watchdogs, hardware interlocks

Hardware considerations:

Hard real-time systems often require specialized hardware:

Deterministic processors: Simpler CPUs without deep pipelining, branch prediction, or speculative execution
Locked caches: Cache partitioning to prevent interference between tasks
Hardware watchdogs: External circuits that reset the system if deadlines are missed
Redundant systems: Triple modular redundancy (TMR) for fault tolerance
Safety-rated components: Automotive/aviation-grade parts with known failure modes

Soft real-time systems can use commodity hardware with performance optimizations:

Fast CPUs: Maximize average performance
Large caches: Improve typical-case access times
SSDs and NVMe: Reduce I/O latency
High-bandwidth networks: 10/25/100Gbps networking
Geographic distribution: Edge servers for lower user latency

Don't Mix Without Isolation

If your system contains both hard and soft real-time components, they must be rigorously isolated. A garbage collection pause in a soft real-time component must not block a hard real-time control loop. Use separate processes, cores, or even separate physical machines to ensure hard real-time guarantees are not compromised by soft real-time activity.

Mixed-Criticality Systems

Real-world systems often combine components with different criticality levels. A modern automobile contains hard real-time engine control, firm real-time instrument display, and soft real-time infotainment—all potentially sharing computational resources.

The mixed-criticality challenge:

Running tasks of different criticality on shared resources creates interference. If a low-criticality task consumes CPU time, memory bandwidth, or cache space, it can delay high-criticality tasks and potentially cause deadline misses.

Approaches to mixed-criticality:

Mixed-Criticality Design Strategies

•Physical separation — Dedicate separate hardware to different criticality levels. Most expensive but simplest to verify.
•Temporal partitioning — Time-division multiplexing where each criticality level gets guaranteed time slots. Used in avionics (ARINC 653).
•Spatial partitioning — Separate memory regions and CPU cores for different criticality levels. Hypervisors can enforce boundaries.
•Priority-based scheduling — High-criticality tasks preempt low-criticality tasks. Requires careful analysis of interference.
•Criticality mode switching — System operates in different modes; when overloaded, drops low-criticality work to guarantee high-criticality deadlines.
•Adaptive resource allocation — Dynamically adjust resource allocation based on system state and current criticality demands.

Example: Automotive zonal architecture

Modern vehicles use a zonal architecture that separates systems by criticality:

Zone	Criticality	Examples	Hardware
Safety Domain	Hard RT	ABS, ESC, Airbags	Dedicated safety MCUs
Powertrain	Hard RT	Engine control, Transmission	Powertrain ECU
Body Control	Firm RT	Door locks, Windows, Lights	Body control module
ADAS	Hard/Firm RT	Lane keeping, Adaptive cruise	GPU + Safety MCU
Infotainment	Soft RT	Navigation, Media, Apps	Central compute, Linux
Connectivity	Soft RT	Cellular, WiFi, OTA updates	Connectivity module

These domains communicate via automotive Ethernet and CAN bus, but interference is controlled through gateway modules and QoS mechanisms.

Certification Challenges

In regulated industries, mixing criticality levels on shared hardware requires demonstrating that low-criticality faults cannot affect high-criticality functions. This 'freedom from interference' proof is one of the hardest aspects of safety certification. When in doubt, separate.

Failure Handling by Criticality

How a system handles deadline misses or component failures depends critically on its real-time classification. The wrong failure strategy can turn a recoverable situation into a catastrophe, or waste resources on unnecessary caution.

Hard real-time failure handling:

Hard Real-Time Failure Strategies

•Fail-safe defaults — If control output cannot be computed, apply a safe default (e.g., cut engine fuel, apply brakes gradually).
•Watchdog timers — Hardware monitors deadline compliance; triggers safe state if deadlines are missed.
•Redundant computation — Multiple independent channels compute the same result; voting determines the output.
•Built-in self-test — Continuous testing detects degradation before it causes deadline misses.
•Graceful degradation — Fall back to simpler algorithms with shorter WCET when full algorithm might miss deadline.
•No retry — If an operation fails, don't retry—there's no time. Use the backup/default value.

Soft real-time failure handling:

Soft Real-Time Failure Strategies

•Retries with backoff — Retry failed operations with exponential backoff; succeed eventually.
•Timeout and fallback — Set timeouts; use cached or degraded responses when timeout is exceeded.
•Load shedding — Under overload, reject new requests to protect in-flight requests.
•Circuit breakers — Stop calling failing services; fail fast rather than accumulating latency.
•Quality reduction — Serve lower-quality responses (smaller images, less personalization) under load.
•User notification — Inform users of degraded performance rather than failing silently.

Firm real-time failure handling:

Firm Real-Time Failure Strategies

•Discard and continue — If deadline will be missed, stop the current computation and start the next one.
•Bounded work — Limit work per deadline to what can definitely complete; better to do less than nothing.
•Skip frames/samples — Miss one frame/sample rather than delay all subsequent ones.
•Interpolation — Use predicted/interpolated values when actual computation doesn't complete in time.
•Temporal overload management — Temporarily drop low-priority updates while catching up.

Test Your Failure Paths

Whatever your failure strategy, test it explicitly. Inject deadline misses, component failures, and overload conditions. The worst time to discover your failure handling doesn't work is during an actual failure. Chaos engineering principles apply even (especially) to real-time systems.

Classifying Your System

Given the importance of correct classification, how do you determine which category your system falls into? Use these questions as a guide:

Question 1: What happens if you miss a deadline once?

If someone could be injured or killed → Hard real-time
If equipment could be damaged → Hard real-time
If the work product is worthless but no harm occurs → Firm real-time
If the work product is still somewhat useful → Soft real-time

Question 2: What happens if you miss many deadlines?

If the system becomes dangerous → Hard real-time
If the system becomes useless → Firm or Hard real-time (depending on danger)
If user experience degrades → Soft real-time

Question 3: Does your domain have safety regulations?

Aviation (DO-178C), Automotive (ISO 26262), Medical (IEC 62304) → Likely Hard real-time for safety functions
Financial (MiFID II latency requirements) → Likely Firm real-time
Consumer software (no specific regulations) → Likely Soft real-time

System Classification Decision Matrix
If deadline miss causes...	And frequency is...	Then system is...
Injury/death/equipment damage	Unacceptable at any rate	Hard real-time
Worthless result but no harm	Tolerable occasionally	Firm real-time
Degraded but usable result	Tolerable within SLO bounds	Soft real-time
Mixed (some functions critical)	Varies by function	Mixed-criticality

When in Doubt, Consult Domain Experts

Classifying a system as soft real-time when it's actually hard real-time can have fatal consequences. If there's any doubt about safety implications, consult domain experts, safety engineers, and applicable standards. The cost of over-classifying (treating soft as hard) is money; the cost of under-classifying (treating hard as soft) can be lives.

Summary: Soft vs Hard Real-Time

We've examined the critical distinction between soft, firm, and hard real-time systems. Let's consolidate the key concepts:

Key Takeaways

•Classification is by consequence — Hard, firm, and soft real-time are defined by what happens when deadlines are missed, not by speed.
•Hard real-time means guarantees — Deadline misses constitute system failure; absolute assurance is required.
•Soft real-time means optimization — Deadline misses degrade quality but system remains functional.
•Firm real-time is often overlooked — Late results are worthless but harmless; enables unique design strategies.
•Architecture follows classification — OS, language, memory management, testing—everything changes based on which category you're in.
•Mixed-criticality requires isolation — When combining different levels, ensure high-criticality functions are protected from low-criticality interference.
•Failure strategies differ fundamentally — Hard real-time uses fail-safe defaults; soft real-time uses retries and degradation.
•Classification must be rigorous — Misclassification leads to dangerous under-engineering or wasteful over-engineering.

What's next:

With the theoretical foundations established, the final page of this module explores real-world use cases for real-time systems across industries. We'll examine specific applications in gaming, finance, industrial control, communication, and other domains to see how these principles apply in practice.

Page Complete

You now understand the critical distinctions between soft, firm, and hard real-time systems. This classification framework is essential for making appropriate architectural decisions and ensuring your system is neither dangerously under-engineered nor wastefully over-engineered for its actual requirements.

3 / 4

Loading learning content...

System Design (HLD)What Is Real-Time?

Understanding Real-Time Systems

LevelIntermediate

Duration60 mins

TopicWhat Is Real-Time?

3 / 4

Soft vs Hard Real-Time

The Spectrum of Deadline Criticality

What You Will Learn

Formal Definitions

The real-time systems community has established precise definitions that distinguish system categories based on the consequences of deadline misses.

Hard Real-Time:

A hard real-time system is one in which missing a deadline constitutes a system failure. The correctness of the system depends absolutely on results being produced within specified deadlines.

Soft Real-Time:

A soft real-time system is one in which missing deadlines degrades the quality of service but does not constitute system failure. The system remains useful even when deadlines are occasionally missed.

Firm Real-Time:

A firm real-time system is one in which missing a deadline renders the result worthless, but does not cause system failure. Late results have no utility but also no negative consequences.

Real-Time System Classification Comparison
Characteristic	Soft Real-Time	Firm Real-Time	Hard Real-Time
Deadline miss consequence	Degraded quality	Wasted effort	System failure
Value of late result	Partial (degrades gracefully)	Zero (worthless)	Negative (potentially harmful)
Tolerance for misses	Some percentage acceptable	Wasteful but survivable	Zero tolerance
Design approach	Best-effort optimization	High probability guarantee	Absolute guarantee
Verification requirement	Statistical testing	Probabilistic analysis	Formal proof / certification
Overload handling	Graceful degradation	Drop requests, continue	Prevent overload condition

The Classification Matters

Hard Real-Time Systems: Guaranteed Deadlines

Characteristics of hard real-time systems:

Hard Real-Time System Properties

•Deterministic execution environment — The worst-case execution time of every operation must be known and bounded.
•Formal verification — Often requires mathematical proof that deadlines will be met, not just testing.
•Static resource allocation — Resources are pre-allocated at design time, not dynamically at runtime.
•No dynamic memory allocation — malloc/new are prohibited in critical paths due to unbounded timing.
•Certified development processes — Standards like DO-178C (avionics), ISO 26262 (automotive) mandate rigorous development.
•Watchdog mechanisms — Hardware watchdogs detect deadline misses and trigger fail-safe responses.
•Redundancy for fault tolerance — Critical functions are replicated to prevent single points of failure.

Examples of hard real-time systems:

Domain	System	Typical Deadline	Consequence of Miss
Aerospace	Flight control surfaces	10-20ms	Aircraft becomes uncontrollable
Aerospace	Engine control (FADEC)	1-5ms	Engine damage or failure
Automotive	Airbag deployment	10ms	Airbag deploys too late to protect
Automotive	Anti-lock braking (ABS)	1-5ms	Wheels lock, vehicle skids
Medical	Cardiac pacemaker	Milliseconds	Heart rhythm disruption
Medical	Radiation therapy	Milliseconds	Incorrect dosage delivery
Industrial	Nuclear reactor control	Varies by system	Potential meltdown
Industrial	Robot arm control	1-10ms	Collision with humans/objects

The fundamental constraint:

WCET analysis of all code paths
Schedulability analysis proving all tasks can complete on time
Resource reservation guaranteeing CPU, memory, and I/O availability
Bounded interrupt latency from the operating system
Fail-safe defaults for any situation where deadlines might be missed

Hard Real-Time ≠ Fast

Soft Real-Time Systems: Optimized Experience

Soft real-time systems are the most common type of real-time system in commercial software. They aim to meet deadlines most of the time but accept that occasional misses are tolerable.

Characteristics of soft real-time systems:

Soft Real-Time System Properties

•Statistical performance targets — SLOs specify percentiles (e.g., p99 < 100ms) rather than absolute guarantees.
•Graceful degradation — When overloaded, quality decreases smoothly rather than failing catastrophically.
•Dynamic resource allocation — Resources can be allocated on-demand based on current load.
•Standard development practices — No special certification required; standard testing and monitoring suffice.
•Adaptive algorithms — Systems can adjust quality/latency tradeoffs based on conditions.
•User experience focus — Design targets human perception thresholds, not safety requirements.
•Economic consequences — Deadline misses cause business impact (churn, lost revenue) rather than safety impact.

Examples of soft real-time systems:

Domain	System	Target Latency	Consequence of Miss
Web	E-commerce checkout	< 1 second	Cart abandonment, lost sale
Web	Search results	< 200ms	User frustration, competitor switch
Media	Video streaming	< 100ms jitter	Buffering, quality reduction
Media	Live streaming	< 3 seconds glass-to-glass	Viewer disengagement
Communication	Video conferencing	< 150ms one-way	Awkward conversation
Gaming	Multiplayer games	< 100ms round-trip	Competitive disadvantage
Productivity	Document collaboration	< 500ms sync	Minor collaboration friction
Mobile	App responsiveness	< 100ms to feedback	Perceived sluggishness

The value function of lateness:

In soft real-time systems, the utility of a result decreases as latency increases, but doesn't immediately drop to zero:

Utility
  ↑
1.0 ┤●●●●●●●●●
    │         ●●●●
    │              ●●●●
    │                   ●●●●
    │                        ●●●●
    │                             ●●●●
0.0 ┤                                  ●●●→
    └──────────────────────────────────────→ Latency
         Deadline     Degraded     Unusable

This gradual degradation curve is what distinguishes soft from firm/hard real-time. The system remains "correct" even when slow; it just provides a worse experience.

Design for the Tail

Firm Real-Time Systems: The Middle Ground

Firm real-time represents an important middle category that is sometimes overlooked. In firm real-time systems, late results have no value, but missing deadlines doesn't cause harm.

The key distinction:

Soft real-time: Late result has reduced value
Firm real-time: Late result has zero value (but no harm)
Hard real-time: Late result has negative value (causes harm)

Examples of firm real-time:

Firm Real-Time System Examples
System	Deadline	If Deadline Missed
Financial market data feed	Trade window (~ms)	Stale quote is worthless; new quote coming
Live sports graphics overlay	Frame deadline (16ms)	Miss the frame; display on next one
Sensor sampling	Sampling period	Skip this sample; next one coming
Traffic light sequencing	Phase timing	Extend current phase; don't cause collision
Weather radar update	Rotation period	Display stale data; new scan starting
Online auction bid processing	Auction close time	Bid doesn't count; auction proceeds without it

Design implications of firm real-time:

Firm real-time systems can employ strategies that are inappropriate for either soft or hard real-time:

4. Bounded retry: Retry a failed operation only if there's time remaining before the deadline. Once the deadline passes, don't bother retrying—the result would be worthless.

Firm Real-Time in Practice

Architectural Implications of Classification

Architectural Choices by Real-Time Classification
Aspect	Soft Real-Time	Firm Real-Time	Hard Real-Time
Operating System	Linux, Windows, general-purpose	Linux RT_PREEMPT, QNX	VxWorks, RTEMS, FreeRTOS, bare metal
Programming Language	Any (Java, Python, Go, etc.)	C++, Go, Rust (managed carefully)	C, Ada, SPARK (formally verifiable)
Memory Management	Garbage collection acceptable	GC with tuning, or manual	Static allocation only, no heap
Concurrency Model	Thread pools, async/await	Priority scheduling, lock-free	Rate monotonic/EDF scheduling
Error Handling	Exceptions, retries, degradation	Bounded retries, then skip	Pre-validated inputs, fail-safe defaults
Resource Allocation	Dynamic, on-demand	Bounded dynamic, pre-reserved	Static, compile-time allocation
Testing Approach	Load testing, chaos engineering	Stress testing, latency profiling	Formal verification, DO-178C/ISO 26262
Monitoring	Metrics, percentiles, SLOs	Histograms, timeout tracking	Watchdogs, hardware interlocks

Hardware considerations:

Hard real-time systems often require specialized hardware:

Deterministic processors: Simpler CPUs without deep pipelining, branch prediction, or speculative execution
Locked caches: Cache partitioning to prevent interference between tasks
Hardware watchdogs: External circuits that reset the system if deadlines are missed
Redundant systems: Triple modular redundancy (TMR) for fault tolerance
Safety-rated components: Automotive/aviation-grade parts with known failure modes

Soft real-time systems can use commodity hardware with performance optimizations:

Fast CPUs: Maximize average performance
Large caches: Improve typical-case access times
SSDs and NVMe: Reduce I/O latency
High-bandwidth networks: 10/25/100Gbps networking
Geographic distribution: Edge servers for lower user latency

Don't Mix Without Isolation

Mixed-Criticality Systems

The mixed-criticality challenge:

Approaches to mixed-criticality:

Mixed-Criticality Design Strategies

•Physical separation — Dedicate separate hardware to different criticality levels. Most expensive but simplest to verify.
•Temporal partitioning — Time-division multiplexing where each criticality level gets guaranteed time slots. Used in avionics (ARINC 653).
•Spatial partitioning — Separate memory regions and CPU cores for different criticality levels. Hypervisors can enforce boundaries.
•Priority-based scheduling — High-criticality tasks preempt low-criticality tasks. Requires careful analysis of interference.
•Criticality mode switching — System operates in different modes; when overloaded, drops low-criticality work to guarantee high-criticality deadlines.
•Adaptive resource allocation — Dynamically adjust resource allocation based on system state and current criticality demands.

Example: Automotive zonal architecture

Modern vehicles use a zonal architecture that separates systems by criticality:

Zone	Criticality	Examples	Hardware
Safety Domain	Hard RT	ABS, ESC, Airbags	Dedicated safety MCUs
Powertrain	Hard RT	Engine control, Transmission	Powertrain ECU
Body Control	Firm RT	Door locks, Windows, Lights	Body control module
ADAS	Hard/Firm RT	Lane keeping, Adaptive cruise	GPU + Safety MCU
Infotainment	Soft RT	Navigation, Media, Apps	Central compute, Linux
Connectivity	Soft RT	Cellular, WiFi, OTA updates	Connectivity module

These domains communicate via automotive Ethernet and CAN bus, but interference is controlled through gateway modules and QoS mechanisms.

Certification Challenges

Failure Handling by Criticality

Hard real-time failure handling:

Hard Real-Time Failure Strategies

•Fail-safe defaults — If control output cannot be computed, apply a safe default (e.g., cut engine fuel, apply brakes gradually).
•Watchdog timers — Hardware monitors deadline compliance; triggers safe state if deadlines are missed.
•Redundant computation — Multiple independent channels compute the same result; voting determines the output.
•Built-in self-test — Continuous testing detects degradation before it causes deadline misses.
•Graceful degradation — Fall back to simpler algorithms with shorter WCET when full algorithm might miss deadline.
•No retry — If an operation fails, don't retry—there's no time. Use the backup/default value.

Soft real-time failure handling:

Soft Real-Time Failure Strategies

•Retries with backoff — Retry failed operations with exponential backoff; succeed eventually.
•Timeout and fallback — Set timeouts; use cached or degraded responses when timeout is exceeded.
•Load shedding — Under overload, reject new requests to protect in-flight requests.
•Circuit breakers — Stop calling failing services; fail fast rather than accumulating latency.
•Quality reduction — Serve lower-quality responses (smaller images, less personalization) under load.
•User notification — Inform users of degraded performance rather than failing silently.

Firm real-time failure handling:

Firm Real-Time Failure Strategies

•Discard and continue — If deadline will be missed, stop the current computation and start the next one.
•Bounded work — Limit work per deadline to what can definitely complete; better to do less than nothing.
•Skip frames/samples — Miss one frame/sample rather than delay all subsequent ones.
•Interpolation — Use predicted/interpolated values when actual computation doesn't complete in time.
•Temporal overload management — Temporarily drop low-priority updates while catching up.

Test Your Failure Paths

Classifying Your System

Given the importance of correct classification, how do you determine which category your system falls into? Use these questions as a guide:

Question 1: What happens if you miss a deadline once?

If someone could be injured or killed → Hard real-time
If equipment could be damaged → Hard real-time
If the work product is worthless but no harm occurs → Firm real-time
If the work product is still somewhat useful → Soft real-time

Question 2: What happens if you miss many deadlines?

If the system becomes dangerous → Hard real-time
If the system becomes useless → Firm or Hard real-time (depending on danger)
If user experience degrades → Soft real-time

Question 3: Does your domain have safety regulations?

Aviation (DO-178C), Automotive (ISO 26262), Medical (IEC 62304) → Likely Hard real-time for safety functions
Financial (MiFID II latency requirements) → Likely Firm real-time
Consumer software (no specific regulations) → Likely Soft real-time

System Classification Decision Matrix
If deadline miss causes...	And frequency is...	Then system is...
Injury/death/equipment damage	Unacceptable at any rate	Hard real-time
Worthless result but no harm	Tolerable occasionally	Firm real-time
Degraded but usable result	Tolerable within SLO bounds	Soft real-time
Mixed (some functions critical)	Varies by function	Mixed-criticality

When in Doubt, Consult Domain Experts

Summary: Soft vs Hard Real-Time

We've examined the critical distinction between soft, firm, and hard real-time systems. Let's consolidate the key concepts:

Key Takeaways

•Classification is by consequence — Hard, firm, and soft real-time are defined by what happens when deadlines are missed, not by speed.
•Hard real-time means guarantees — Deadline misses constitute system failure; absolute assurance is required.
•Soft real-time means optimization — Deadline misses degrade quality but system remains functional.
•Firm real-time is often overlooked — Late results are worthless but harmless; enables unique design strategies.
•Architecture follows classification — OS, language, memory management, testing—everything changes based on which category you're in.
•Mixed-criticality requires isolation — When combining different levels, ensure high-criticality functions are protected from low-criticality interference.
•Failure strategies differ fundamentally — Hard real-time uses fail-safe defaults; soft real-time uses retries and degradation.
•Classification must be rigorous — Misclassification leads to dangerous under-engineering or wasteful over-engineering.

What's next:

Page Complete

3 / 4