Operating SystemsCPU Scheduling

Round Robin & Multi-Level Queue Scheduling

LevelIntermediate

Duration90 mins

TopicCPU Scheduling

4 / 5

Multi-Level Feedback Queue (MLFQ)

The Self-Learning Scheduler

Imagine a restaurant that automatically upgrades frequent diners to priority seating while moving tourists to the regular queue—without anyone having to declare their customer type upfront. The restaurant learns from behavior: regulars get fast service because they've proven they value quick turnaround, while newcomers start in the general queue until their dining patterns reveal their needs.

The Multi-Level Feedback Queue (MLFQ) applies this principle to CPU scheduling. Rather than requiring processes to declare whether they're interactive or batch, MLFQ observes their behavior and adjusts priority accordingly. Processes that use little CPU before blocking (typical of interactive apps) get promoted to high priority. Processes that consume their entire quantum (typical of CPU-bound batch jobs) get demoted.

This elegant feedback mechanism—proposed by Fernando Corbató in the 1960s as part of CTSS and Multics—remains the foundation of scheduling in nearly every modern operating system. Understanding MLFQ is essential for anyone seeking to truly comprehend how computers juggle dozens of applications while keeping everything responsive.

What You Will Master

By the end of this page, you will understand: the fundamental MLFQ rules, how feedback adjusts priority, the starvation and gaming problems with solutions, formal MLFQ semantics, implementation details, and how modern schedulers like Linux CFS and Windows extend MLFQ concepts.

The MLFQ Concept

MLFQ solves MLQ's fundamental limitation: the requirement for static process classification. Instead of administrators or programmers declaring process types, MLFQ infers behavior from observation.

The Key Insight:

Process behavior reveals its needs:

Interactive processes have short CPU bursts—they block frequently for I/O or user input, rarely using their full quantum
CPU-bound processes have long CPU bursts—they consume their entire quantum before the scheduler preempts them

By tracking whether a process yields early (interactive) or runs to quantum expiration (CPU-bound), MLFQ automatically classifies processes and assigns appropriate priority.

The Feedback Mechanism:

All processes start at highest priority (assumed interactive until proven otherwise)
Processes that use full quantum are demoted (behaving like batch)
Processes that yield early stay at current priority (behaving like interactive)
Over time, processes settle at appropriate priority levels

Converting Mermaid diagram...

The Wisdom of MLFQ

MLFQ embodies a profound principle: optimize for the common case (short interactive bursts) while still handling the uncommon case (long CPU-bound work). By starting everyone at high priority, interactive processes get immediate responsiveness. CPU-bound processes naturally sink to where they belong without explicit classification.

The Fundamental MLFQ Rules

MLFQ behavior is governed by a set of carefully designed rules. The canonical formulation involves the following:

Rule 1: Priority-Based Scheduling

If Priority(A) > Priority(B), then A runs (B doesn't)

Processes in higher-priority queues always run before those in lower queues. This is identical to fixed-priority MLQ.

Rule 2: Equal Priority Uses Round Robin

If Priority(A) = Priority(B), then A and B run in Round Robin within their queue

Within a priority level, fairness is maintained through Round Robin scheduling.

Rule 3: New Processes Start at Highest Priority

When a process enters the system, it is placed at the highest priority (topmost queue)

This benefits interactive processes (they need responsiveness) and gives CPU-bound processes an initial chance to run—they'll be demoted soon anyway.

Rule 4a: Priority Demotion on Full Quantum Usage

If a process uses its entire time quantum, its priority is reduced (moves down one queue)

This is the core feedback mechanism. Processes demonstrating CPU-bound behavior sink to lower priorities.

Rule 4b: Priority Preservation on Early Yield

If a process gives up the CPU before its quantum expires (I/O or blocking), it stays at its current priority

Interactive processes that yield quickly maintain their high priority.

mlfq_basic.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
from collections import deque
from dataclasses import dataclass, field
from typing import List, Optional, Tuple
from enum import Enum
 
class ProcessState(Enum):
    READY = "ready"
    RUNNING = "running"
    BLOCKED = "blocked"
    COMPLETED = "completed"
 
@dataclass
class MLFQConfig:
    """Configuration for MLFQ scheduler."""
    num_queues: int = 5
    base_quantum: int = 8  # ms, doubles each level
    boost_interval: int = 1000  # ms, for priority boost
 
@dataclass
class Process:
    pid: int
    name: str
    arrival_time: int
    bursts: List[int]  # Alternating CPU and I/O bursts
    current_burst_idx: int = 0
    remaining_burst: int = 0
    priority: int = 0  # 0 = highest
    state: ProcessState = ProcessState.READY
    quantum_used: int = 0  # Track usage within current quantum allotment
    completion_time: int = 0
    
    def __post_init__(self):
        if self.bursts:
            self.remaining_burst = self.bursts[0]
 
class MLFQScheduler:
    """
    Multi-Level Feedback Queue Scheduler
    Implements dynamic priority adjustment based on behavior.
    """
    
    def __init__(self, config: MLFQConfig):
        self.config = config
        # Create queues: index 0 = highest priority
        self.queues = [deque() for _ in range(config.num_queues)]
        self.current_time = 0
        self.execution_log = []
        self.all_processes: List[Process] = []
        self.blocked_processes: List[Tuple[Process, int]] = []  # (process, io_end)
        
    def get_quantum(self, priority: int) -> int:
        """Time quantum increases exponentially with lower priority."""
        return self.config.base_quantum * (2 ** priority)
    
    def add_process(self, process: Process):
        """Add new process to highest priority queue (Rule 3)."""
        process.priority = 0
        process.quantum_used = 0
        self.queues[0].append(process)
        self.all_processes.append(process)
        self.log(f"P{process.pid} ({process.name}) arrives, added to Q0")
    
    def log(self, message: str):
        self.execution_log.append(f"[{self.current_time:4d}] {message}")
    
    def demote(self, process: Process):
        """Demote process to lower priority queue (Rule 4a)."""
        if process.priority < self.config.num_queues - 1:
            old_priority = process.priority
            process.priority += 1
            self.log(f"P{process.pid} demoted: Q{old_priority} -> Q{process.priority}")
    
    def get_next_process(self) -> Optional[Process]:
        """Get highest priority ready process (Rules 1 & 2)."""
        for priority, queue in enumerate(self.queues):
            if queue:
                return queue.popleft()
        return None
    
    def handle_blocked_processes(self):
        """Check for processes completing I/O."""
        still_blocked = []
        for process, io_end in self.blocked_processes:
            if io_end <= self.current_time:
                # I/O complete - move to next CPU burst
                process.current_burst_idx += 1
                if process.current_burst_idx < len(process.bursts):
                    process.remaining_burst = process.bursts[process.current_burst_idx]
                    process.state = ProcessState.READY
                    process.quantum_used = 0  # Reset quantum tracking
                    # Stay at same priority (Rule 4b - yielded early)
                    self.queues[process.priority].append(process)
                    self.log(f"P{process.pid} I/O complete, returns to Q{process.priority}")
                else:
                    process.state = ProcessState.COMPLETED
                    process.completion_time = self.current_time
                    self.log(f"P{process.pid} completed")
            else:
                still_blocked.append((process, io_end))
        self.blocked_processes = still_blocked
    
    def run(self) -> List[Process]:
        """Execute MLFQ scheduling simulation."""
        while True:
            self.handle_blocked_processes()
            
            process = self.get_next_process()
            if process is None:
                if self.blocked_processes:
                    # Jump to next I/O completion
                    next_io = min(end for _, end in self.blocked_processes)
                    self.current_time = next_io
                    continue
                else:
                    break  # All done
            
            # Determine execution time
            quantum = self.get_quantum(process.priority)
            remaining_quantum = quantum - process.quantum_used
            
            if process.remaining_burst <= remaining_quantum:
                # Burst completes within quantum
                exec_time = process.remaining_burst
                used_full_quantum = False
            else:
                # Quantum expires before burst completes
                exec_time = remaining_quantum
                used_full_quantum = True
            
            self.log(f"P{process.pid} runs for {exec_time}ms (Q{process.priority}, "
                    f"quantum={quantum}ms)")
            
            process.remaining_burst -= exec_time
            process.quantum_used += exec_time
            self.current_time += exec_time
            
            if process.remaining_burst == 0:
                # CPU burst complete
                process.current_burst_idx += 1
                if process.current_burst_idx < len(process.bursts):
                    # Start I/O burst
                    io_time = process.bursts[process.current_burst_idx]
                    process.state = ProcessState.BLOCKED
                    self.blocked_processes.append(
                        (process, self.current_time + io_time)
                    )
                    self.log(f"P{process.pid} blocks for I/O ({io_time}ms)")
                else:
                    process.state = ProcessState.COMPLETED
                    process.completion_time = self.current_time
                    self.log(f"P{process.pid} completed at time {self.current_time}")
            elif used_full_quantum:
                # Used full quantum - demote and re-queue
                self.demote(process)
                process.quantum_used = 0
                self.queues[process.priority].append(process)
            else:
                # Quantum not exhausted, preempted by higher priority
                self.queues[process.priority].append(process)
        
        return self.all_processes
 
 
# Example simulation
if __name__ == "__main__":
    config = MLFQConfig(num_queues=4, base_quantum=8)
    scheduler = MLFQScheduler(config)
    
    # Process with alternating CPU and I/O bursts
    # Interactive: short CPU bursts, frequent I/O
    interactive = Process(
        pid=1, name="editor", arrival_time=0,
        bursts=[5, 50, 3, 40, 4]  # CPU, I/O, CPU, I/O, CPU
    )
    
    # CPU-bound: long CPU bursts
    cpu_bound = Process(
        pid=2, name="compiler", arrival_time=0,
        bursts=[100]  # Single long CPU burst
    )
    
    scheduler.add_process(interactive)
    scheduler.add_process(cpu_bound)
    
    completed = scheduler.run()
    
    print("=" * 60)
    print("MLFQ Scheduling Simulation")
    print("=" * 60)
    
    for entry in scheduler.execution_log:
        print(entry)
    
    print("
" + "-" * 60)
    print("Results:")
    for p in completed:
        print(f"  P{p.pid} ({p.name}): completed at {p.completion_time}ms")

MLFQ Execution Walkthrough

Let's trace through a detailed example to see how MLFQ dynamically adjusts priorities.

System Configuration:

3 priority queues (Q0, Q1, Q2)
Time quanta: Q0 = 8ms, Q1 = 16ms, Q2 = 32ms

Processes:

Process Characteristics
Process	Type	Behavior Pattern	CPU Burst
P1 (vim)	Interactive	5ms CPU, then blocks for I/O	5ms (repeating)
P2 (gcc)	CPU-bound	Continuous computation	200ms total

Execution Timeline:

Time	Event	Queue State	Notes
0	P1, P2 arrive	Q0: [P1, P2]	Both start at highest priority
0	P1 runs	Q0: [P2]	P1 selected (first in Q0)
5	P1 blocks (I/O)	Q0: [P2]	P1 used only 5ms < 8ms quantum → stays in Q0
5	P2 runs	Q0: []	P2 starts with 8ms quantum
13	P2 quantum expires	Q1: [P2]	P2 used full 8ms quantum → demoted to Q1
13	P1 returns from I/O	Q0: [P1], Q1: [P2]	P1 goes to Q0 (preserved priority)
13	P1 runs	Q0: [], Q1: [P2]	P1 in Q0 preempts P2 in Q1
18	P1 blocks (I/O)	Q1: [P2]	P1 again yields early, stays Q0
18	P2 runs	Q1: []	P2 gets 16ms quantum in Q1
34	P2 quantum expires	Q2: [P2]	P2 demoted to Q2
34	P1 returns	Q0: [P1], Q2: [P2]
34	P1 runs	Q2: [P2]	P1 preempts again
39	P1 blocks	Q2: [P2]
39	P2 runs	Q2: []	32ms quantum, 176ms remaining
71	P2 quantum expires	Q2: [P2]	P2 at bottom, stays in Q2
...	Pattern continues		P2 eventually completes

Key Observations:

P1 (interactive) maintains Q0 priority because it never uses a full quantum
P2 (CPU-bound) rapidly sinks to Q2 after using full quanta at each level
P1 consistently preempts P2 due to priority difference
P2 still makes progress with 32ms time slices at lowest priority

Automatic Classification

Without any explicit labeling, MLFQ correctly identified P1 as interactive (high priority) and P2 as CPU-bound (low priority). The feedback mechanism worked as designed—behavior revealed classification, and the scheduler adapted accordingly.

Problems with Basic MLFQ

The basic MLFQ rules have two significant vulnerabilities that must be addressed for production use:

Problem 1: Starvation

If there are many interactive processes at high priority, CPU-bound processes at low priority may never execute. Consider a system with:

50 interactive processes frequently returning from I/O
1 CPU-bound process stuck at lowest priority

The CPU-bound process waits forever as interactive processes continuously preempt it.

Problem 2: Gaming the Scheduler

A malicious or poorly-designed process can exploit Rule 4b by voluntarily yielding just before its quantum expires:

while (1) {
    // Do CPU work for 7.9ms (just under 8ms quantum)
    busy_work();
    // Issue trivial I/O to yield without using full quantum
    read(fd, buf, 1);  // 'Yield' to avoid demotion
}

This process does mostly CPU work but maintains high priority by 'cheating' with minimal I/O operations. It monopolizes high-priority CPU time while preventing legitimate interactive processes from running.

Starvation Scenario

•100 interactive processes at Q0
•Each uses 5ms then blocks for 50ms I/O
•At any time, ~10 are ready in Q0
•CPU-bound process in Q2 never runs
•No progress on batch work—ever

Gaming Scenario

•Malicious process at Q0
•Yields at 7.9ms (just before 8ms quantum)
•Maintains Q0 priority indefinitely
•Gets 99% of CPU despite being CPU-bound
•Legitimate interactive processes wait

Problem 3: Behavior Phase Changes

A process may change behavior during execution:

Compiler starts interactive (reading source files) then becomes CPU-bound (optimization)
Browser is CPU-bound loading a page then interactive during scrolling

With basic MLFQ, a process demoted during a CPU-bound phase stays demoted even when it becomes interactive again. This violates the responsiveness guarantee for legitimately interactive work.

Critical Design Flaws

These problems are not edge cases—they represent fundamental flaws in the basic MLFQ design. Production schedulers MUST address all three issues. The solutions (priority boost and better accounting) add complexity but are essential for a robust system.

Solutions: Priority Boost and Better Accounting

Modern MLFQ implementations add two additional rules to address the vulnerabilities:

Rule 5: Priority Boost (Solves Starvation and Phase Change)

After a fixed period S, move all processes to the highest priority queue

Periodically (every 1-2 seconds typically), every process gets boosted to Q0. This ensures:

No starvation: Even lowest-priority processes run periodically
Adaptation to behavior change: Processes get fresh chances at high priority
Fairness floor: Long-term CPU share has a minimum guarantee

Rule 4 (Revised): Comprehensive Time Accounting (Solves Gaming)

Once a process uses its total time allotment at a priority level (regardless of how many times it gives up the CPU), its priority is reduced

Instead of tracking per-quantum usage, track cumulative CPU time at each priority level:

if (total_time_at_current_priority >= allotment) {
    demote(process);
    total_time_at_current_priority = 0;
}

This defeats gaming because the malicious process still gets demoted after accumulating its allotment, even across many short bursts.

mlfq_improved.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
/* Improved MLFQ with priority boost and time accounting */
 
#include <stdint.h>
#include <stdbool.h>
 
#define NUM_QUEUES 4
#define BOOST_PERIOD_MS 1000  /* Priority boost every 1 second */
 
typedef struct {
    uint32_t pid;
    uint32_t priority;         /* Current queue (0 = highest) */
    uint64_t time_at_priority; /* Cumulative CPU time at current priority */
    /* ... other fields ... */
} Process;
 
typedef struct {
    uint64_t allotment;  /* Time allotment before demotion (ms) */
    uint64_t quantum;    /* Time quantum (ms) */
} QueueConfig;
 
/* Queue configurations: increasing allotment and quantum */
static QueueConfig queue_configs[NUM_QUEUES] = {
    { .allotment = 20,  .quantum = 8  },   /* Q0: responsive */
    { .allotment = 40,  .quantum = 16 },   /* Q1 */
    { .allotment = 80,  .quantum = 32 },   /* Q2 */
    { .allotment = 0,   .quantum = 64 },   /* Q3: no demotion (bottom) */
};
 
static uint64_t last_boost_time = 0;
 
/**
 * Update process after execution.
 * Handles demotion based on cumulative time accounting.
 */
void update_process_after_run(Process* p, uint64_t time_run) {
    p->time_at_priority += time_run;
    
    QueueConfig* config = &queue_configs[p->priority];
    
    /* Check for demotion (Rule 4 revised) */
    if (p->priority < NUM_QUEUES - 1 && 
        config->allotment > 0 &&
        p->time_at_priority >= config->allotment) {
        
        /* Demote to next level */
        p->priority++;
        p->time_at_priority = 0;  /* Reset for new level */
    }
}
 
/**
 * Priority boost - called periodically.
 * Moves all processes to highest priority (Rule 5).
 */
void priority_boost(Process** all_processes, uint32_t num_processes) {
    for (uint32_t i = 0; i < num_processes; i++) {
        Process* p = all_processes[i];
        
        p->priority = 0;           /* Boost to highest priority */
        p->time_at_priority = 0;   /* Reset time accounting */
    }
}
 
/**
 * Check if priority boost is due.
 * Called from scheduler main loop.
 */
void check_priority_boost(
    uint64_t current_time,
    Process** all_processes,
    uint32_t num_processes
) {
    if (current_time - last_boost_time >= BOOST_PERIOD_MS) {
        priority_boost(all_processes, num_processes);
        last_boost_time = current_time;
    }
}
 
/**
 * Get quantum for process's current priority level.
 */
uint64_t get_quantum(Process* p) {
    return queue_configs[p->priority].quantum;
}
 
/* 
 * Example scheduling behavior with these rules:
 * 
 * 1. Gaming process tries to yield at 7.9ms each time:
 *    - First few runs: maintains Q0
 *    - After ~3 runs: 7.9 * 3 = 23.7ms > 20ms allotment
 *    - DEMOTED to Q1 despite never using full quantum
 *    - Gaming defeated!
 *
 * 2. Starving process at Q3:
 *    - Every 1000ms: boosted to Q0
 *    - Gets guaranteed CPU access
 *    - If still CPU-bound, sinks back to Q3
 *    - Starvation prevented!
 *
 * 3. Browser behavior change (CPU-bound -> interactive):
 *    - While loading: demoted to Q2/Q3
 *    - After 1000ms: boosted to Q0
 *    - User scrolling: short bursts, stays in Q0
 *    - Responsiveness restored!
 */

Choosing the Boost Period

The boost period S involves a tradeoff: smaller S means better starvation protection but allows CPU-bound processes to repeatedly disrupt high-priority work. Typical values are S = 1-2 seconds. This ensures no process waits more than ~1 second for CPU access even under heavy interactive load.

Complete MLFQ Rule Set

With the solutions incorporated, here is the complete, production-ready MLFQ rule set:

Complete MLFQ Rules

•Rule 1 (Priority): If Priority(A) > Priority(B), A runs and B does not.
•Rule 2 (Round Robin): If Priority(A) = Priority(B), A and B run in Round Robin within their priority level.
•Rule 3 (Entry at Top): When a process enters the system, it starts at the highest priority (Q0).
•Rule 4 (Time Accounting): Once a process has used its time allotment at a given priority level (regardless of how many times it has given up the CPU), its priority is reduced (demoted one queue).
•Rule 5 (Priority Boost): After some time period S, move all processes in the system to the topmost queue.

Mathematical Formulation:

For a system with n priority levels (0 = highest):

Time allotment at level i: A(i) = A₀ × 2ⁱ (doubles per level)
Time quantum at level i: Q(i) = Q₀ × 2ⁱ (doubles per level)
Boost period: S (typically 1000ms)

Invariants:

Responsiveness bound: An interactive process's worst-case response time is bounded by the boost period S.
Throughput optimization: CPU-bound processes eventually run with large quanta, minimizing context switch overhead.
Fairness guarantee: Every process receives CPU time at least once per boost period.
Gaming resistance: Cumulative time accounting prevents priority manipulation through artificial yields.

Example MLFQ Configuration
Queue	Priority	Quantum	Allotment	Typical Processes
Q0	Highest	8ms	20ms	Interactive, newly arrived
Q1	High	16ms	40ms	Mixed behavior
Q2	Medium	32ms	80ms	Computation-heavy
Q3	Low	64ms	∞	CPU-bound (no further demotion)

Parameter Tuning

MLFQ parameters (quanta, allotments, boost period, queue count) significantly affect behavior. Production systems often make these configurable: Linux's sched_latency, Windows' quantum tables, and Solaris' dispatch tables. Optimal values depend on workload characteristics.

MLFQ in Modern Operating Systems

Modern operating systems have evolved beyond classical MLFQ, but the core principles remain influential.

Linux: Beyond MLFQ with CFS

Linux's Completely Fair Scheduler (CFS) doesn't use explicit priority queues, but achieves MLFQ-like behavior through virtual runtime:

Each process has a virtual runtime counter
CPU-bound processes accumulate virtual runtime faster (they run more)
The scheduler always picks the process with lowest virtual runtime
Interactive processes (low runtime) naturally get priority

This is conceptually similar to MLFQ but uses continuous virtual runtime instead of discrete queue levels. The effect is gradual, proportional priority adjustment rather than step-function demotions.

Key CFS parameters with MLFQ parallels:

sched_latency: Similar to boost period—ensures all processes run within this period
sched_min_granularity: Minimum quantum—prevents excessive switching
Nice values: Affect virtual runtime accumulation rate (like MLFQ queue assignment)

Windows: Hybrid Priority Scheduling

Windows uses a 32-level priority system with MLFQ-like dynamic adjustments:

Base priority: Set by priority class and thread priority
Dynamic boost: Threads get temporary priority increase after:
- Waking from I/O (especially UI-related I/O)
- Foreground window activation
- Lock release
Priority decay: Boosted priority decays back to base over time

This is essentially MLFQ with continuous boosting rather than periodic global boost, and decay rather than explicit demotion.

macOS/iOS: QoS-Based Scheduling

Apple systems use Quality of Service (QoS) classes that incorporate MLFQ concepts:

User Interactive: Highest priority, small quantum
User Initiated: High priority for user-triggered work
Utility: Lower priority for long-running tasks
Background: Lowest priority for invisible work

The system observes process behavior and can adjust QoS recommendations, implementing feedback similar to MLFQ.

MLFQ Concepts in Modern Schedulers
Concept	Classic MLFQ	Linux CFS	Windows
Priority levels	Discrete queues	Continuous (virtual runtime)	32 levels
Demotion trigger	Quantum exhaustion	Runtime accumulation	Quantum use + decay
Boost mechanism	Periodic global boost	Low vruntime from I/O	Dynamic boost events
Anti-gaming	Time allotment	Runtime accounting	Quantum usage + monitoring
Quantum variation	Doubles per level	Based on weight/nice	Varies by foreground/class

The Enduring Legacy

While no major OS uses textbook MLFQ exactly, every modern scheduler implements MLFQ's core insight: infer process behavior from CPU usage patterns and adjust priority accordingly. Corbató's 1960s invention remains the conceptual foundation of 21st-century scheduling.

MLFQ Tuning and Tradeoffs

MLFQ has multiple tunable parameters, each involving tradeoffs:

Number of Priority Levels:

Few levels (3-4): Simpler, coarse-grained adaptation, faster settling
Many levels (8+): Fine-grained control, slower settling, more complex
Typical: 4-8 levels balance granularity with complexity

Quantum Growth Factor:

Small factor (1.5×): Gradual throughput improvement, more switches
Large factor (4×): Rapid throughput gain, longer starvation risk at high priority
Typical: 2× per level is a robust default

Allotment Before Demotion:

Small allotment: Quick demotion, more responsive to behavior
Large allotment: Tolerates bursty behavior, slower classification
Typical: 2-3× the quantum provides reasonable demotion speed

Boost Period:

Short period (100ms): Excellent starvation protection, CPU-bound disruptions frequent
Long period (10s): Less disruption, significant starvation possible
Typical: 1-2 seconds balances both concerns

MLFQ Parameter Tradeoffs
Parameter	Small/Few	Large/Many	Recommendation
Queue count	Coarse priority	Fine-grained control	4-8 queues
Base quantum	Responsive, high overhead	Efficient, slow response	8-20ms
Quantum multiplier	Gradual throughput gain	Fast throughput, starvation risk	2×
Allotment	Quick demotion	Tolerant of bursts	2-3× quantum
Boost period	Anti-starvation priority, frequent disruption	Stable but starvation risk	1-2 seconds

Workload-Specific Tuning:

For interactive-heavy systems (desktops):

More queues (6-8) for finer responsiveness control
Smaller base quantum (8ms) for quick response
Shorter boost period (1s) to prevent starvation

For throughput-heavy systems (servers):

Fewer queues (3-4) for stability
Larger base quantum (20ms) for efficiency
Longer boost period (5s) to minimize disruption

For mixed workloads:

Medium values across all parameters
Consider separating real-time from non-real-time entirely
Use OS-specific tuning mechanisms (Linux sched tunables, Windows priority classes)

No Perfect Configuration

MLFQ parameters always involve tradeoffs. Optimizing for one metric (e.g., interactive latency) may harm another (e.g., batch throughput). Production systems typically profile actual workloads and tune parameters based on measurements rather than theory alone.

Summary: Multi-Level Feedback Queue

We have comprehensively explored the Multi-Level Feedback Queue—the self-learning scheduler that has underpinned interactive computing for over 60 years.

Key Takeaways

•Core Innovation: MLFQ infers process behavior from CPU usage patterns—no explicit classification required.
•Feedback Mechanism: Processes that use full quanta are demoted (CPU-bound); processes that yield early maintain priority (interactive).
•Five Rules: Priority scheduling, RR within levels, start at top, time-based demotion, periodic boost.
•Starvation Prevention: Periodic priority boost ensures every process gets periodic high-priority access.
•Gaming Resistance: Cumulative time accounting prevents manipulation through strategic yielding.
•Phase Adaptation: Boost mechanism allows processes to regain high priority after behavior changes.
•Modern Influence: While exact MLFQ is rare, its principles (behavior inference, dynamic priority) pervade CFS, Windows, and macOS schedulers.
•Tunable Parameters: Queue count, quanta, allotments, and boost period can be adjusted for different workloads.

Looking Ahead:

The final page of this module explores queue movement policies in detail—the specific rules and algorithms that govern how processes transition between priority levels in MLFQ and similar schedulers. We'll examine advanced techniques for aging, decay, and hybrid approaches used in production systems.

Page Complete

You now possess deep understanding of MLFQ—its design, rules, problems, solutions, and influence on modern scheduling. This knowledge is essential for understanding how any contemporary operating system manages CPU resources.

4 / 5

Loading learning content...

Operating SystemsCPU Scheduling

Round Robin & Multi-Level Queue Scheduling

LevelIntermediate

Duration90 mins

TopicCPU Scheduling

4 / 5

Multi-Level Feedback Queue (MLFQ)

The Self-Learning Scheduler

What You Will Master

The MLFQ Concept

The Key Insight:

Process behavior reveals its needs:

Interactive processes have short CPU bursts—they block frequently for I/O or user input, rarely using their full quantum
CPU-bound processes have long CPU bursts—they consume their entire quantum before the scheduler preempts them

By tracking whether a process yields early (interactive) or runs to quantum expiration (CPU-bound), MLFQ automatically classifies processes and assigns appropriate priority.

The Feedback Mechanism:

All processes start at highest priority (assumed interactive until proven otherwise)
Processes that use full quantum are demoted (behaving like batch)
Processes that yield early stay at current priority (behaving like interactive)
Over time, processes settle at appropriate priority levels

Converting Mermaid diagram...

The Wisdom of MLFQ

The Fundamental MLFQ Rules

MLFQ behavior is governed by a set of carefully designed rules. The canonical formulation involves the following:

Rule 1: Priority-Based Scheduling

If Priority(A) > Priority(B), then A runs (B doesn't)

Processes in higher-priority queues always run before those in lower queues. This is identical to fixed-priority MLQ.

Rule 2: Equal Priority Uses Round Robin

If Priority(A) = Priority(B), then A and B run in Round Robin within their queue

Within a priority level, fairness is maintained through Round Robin scheduling.

Rule 3: New Processes Start at Highest Priority

When a process enters the system, it is placed at the highest priority (topmost queue)

This benefits interactive processes (they need responsiveness) and gives CPU-bound processes an initial chance to run—they'll be demoted soon anyway.

Rule 4a: Priority Demotion on Full Quantum Usage

If a process uses its entire time quantum, its priority is reduced (moves down one queue)

This is the core feedback mechanism. Processes demonstrating CPU-bound behavior sink to lower priorities.

Rule 4b: Priority Preservation on Early Yield

If a process gives up the CPU before its quantum expires (I/O or blocking), it stays at its current priority

Interactive processes that yield quickly maintain their high priority.

mlfq_basic.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
from collections import deque
from dataclasses import dataclass, field
from typing import List, Optional, Tuple
from enum import Enum
 
class ProcessState(Enum):
    READY = "ready"
    RUNNING = "running"
    BLOCKED = "blocked"
    COMPLETED = "completed"
 
@dataclass
class MLFQConfig:
    """Configuration for MLFQ scheduler."""
    num_queues: int = 5
    base_quantum: int = 8  # ms, doubles each level
    boost_interval: int = 1000  # ms, for priority boost
 
@dataclass
class Process:
    pid: int
    name: str
    arrival_time: int
    bursts: List[int]  # Alternating CPU and I/O bursts
    current_burst_idx: int = 0
    remaining_burst: int = 0
    priority: int = 0  # 0 = highest
    state: ProcessState = ProcessState.READY
    quantum_used: int = 0  # Track usage within current quantum allotment
    completion_time: int = 0
    
    def __post_init__(self):
        if self.bursts:
            self.remaining_burst = self.bursts[0]
 
class MLFQScheduler:
    """
    Multi-Level Feedback Queue Scheduler
    Implements dynamic priority adjustment based on behavior.
    """
    
    def __init__(self, config: MLFQConfig):
        self.config = config
        # Create queues: index 0 = highest priority
        self.queues = [deque() for _ in range(config.num_queues)]
        self.current_time = 0
        self.execution_log = []
        self.all_processes: List[Process] = []
        self.blocked_processes: List[Tuple[Process, int]] = []  # (process, io_end)
        
    def get_quantum(self, priority: int) -> int:
        """Time quantum increases exponentially with lower priority."""
        return self.config.base_quantum * (2 ** priority)
    
    def add_process(self, process: Process):
        """Add new process to highest priority queue (Rule 3)."""
        process.priority = 0
        process.quantum_used = 0
        self.queues[0].append(process)
        self.all_processes.append(process)
        self.log(f"P{process.pid} ({process.name}) arrives, added to Q0")
    
    def log(self, message: str):
        self.execution_log.append(f"[{self.current_time:4d}] {message}")
    
    def demote(self, process: Process):
        """Demote process to lower priority queue (Rule 4a)."""
        if process.priority < self.config.num_queues - 1:
            old_priority = process.priority
            process.priority += 1
            self.log(f"P{process.pid} demoted: Q{old_priority} -> Q{process.priority}")
    
    def get_next_process(self) -> Optional[Process]:
        """Get highest priority ready process (Rules 1 & 2)."""
        for priority, queue in enumerate(self.queues):
            if queue:
                return queue.popleft()
        return None
    
    def handle_blocked_processes(self):
        """Check for processes completing I/O."""
        still_blocked = []
        for process, io_end in self.blocked_processes:
            if io_end <= self.current_time:
                # I/O complete - move to next CPU burst
                process.current_burst_idx += 1
                if process.current_burst_idx < len(process.bursts):
                    process.remaining_burst = process.bursts[process.current_burst_idx]
                    process.state = ProcessState.READY
                    process.quantum_used = 0  # Reset quantum tracking
                    # Stay at same priority (Rule 4b - yielded early)
                    self.queues[process.priority].append(process)
                    self.log(f"P{process.pid} I/O complete, returns to Q{process.priority}")
                else:
                    process.state = ProcessState.COMPLETED
                    process.completion_time = self.current_time
                    self.log(f"P{process.pid} completed")
            else:
                still_blocked.append((process, io_end))
        self.blocked_processes = still_blocked
    
    def run(self) -> List[Process]:
        """Execute MLFQ scheduling simulation."""
        while True:
            self.handle_blocked_processes()
            
            process = self.get_next_process()
            if process is None:
                if self.blocked_processes:
                    # Jump to next I/O completion
                    next_io = min(end for _, end in self.blocked_processes)
                    self.current_time = next_io
                    continue
                else:
                    break  # All done
            
            # Determine execution time
            quantum = self.get_quantum(process.priority)
            remaining_quantum = quantum - process.quantum_used
            
            if process.remaining_burst <= remaining_quantum:
                # Burst completes within quantum
                exec_time = process.remaining_burst
                used_full_quantum = False
            else:
                # Quantum expires before burst completes
                exec_time = remaining_quantum
                used_full_quantum = True
            
            self.log(f"P{process.pid} runs for {exec_time}ms (Q{process.priority}, "
                    f"quantum={quantum}ms)")
            
            process.remaining_burst -= exec_time
            process.quantum_used += exec_time
            self.current_time += exec_time
            
            if process.remaining_burst == 0:
                # CPU burst complete
                process.current_burst_idx += 1
                if process.current_burst_idx < len(process.bursts):
                    # Start I/O burst
                    io_time = process.bursts[process.current_burst_idx]
                    process.state = ProcessState.BLOCKED
                    self.blocked_processes.append(
                        (process, self.current_time + io_time)
                    )
                    self.log(f"P{process.pid} blocks for I/O ({io_time}ms)")
                else:
                    process.state = ProcessState.COMPLETED
                    process.completion_time = self.current_time
                    self.log(f"P{process.pid} completed at time {self.current_time}")
            elif used_full_quantum:
                # Used full quantum - demote and re-queue
                self.demote(process)
                process.quantum_used = 0
                self.queues[process.priority].append(process)
            else:
                # Quantum not exhausted, preempted by higher priority
                self.queues[process.priority].append(process)
        
        return self.all_processes
 
 
# Example simulation
if __name__ == "__main__":
    config = MLFQConfig(num_queues=4, base_quantum=8)
    scheduler = MLFQScheduler(config)
    
    # Process with alternating CPU and I/O bursts
    # Interactive: short CPU bursts, frequent I/O
    interactive = Process(
        pid=1, name="editor", arrival_time=0,
        bursts=[5, 50, 3, 40, 4]  # CPU, I/O, CPU, I/O, CPU
    )
    
    # CPU-bound: long CPU bursts
    cpu_bound = Process(
        pid=2, name="compiler", arrival_time=0,
        bursts=[100]  # Single long CPU burst
    )
    
    scheduler.add_process(interactive)
    scheduler.add_process(cpu_bound)
    
    completed = scheduler.run()
    
    print("=" * 60)
    print("MLFQ Scheduling Simulation")
    print("=" * 60)
    
    for entry in scheduler.execution_log:
        print(entry)
    
    print("
" + "-" * 60)
    print("Results:")
    for p in completed:
        print(f"  P{p.pid} ({p.name}): completed at {p.completion_time}ms")

MLFQ Execution Walkthrough

Let's trace through a detailed example to see how MLFQ dynamically adjusts priorities.

System Configuration:

3 priority queues (Q0, Q1, Q2)
Time quanta: Q0 = 8ms, Q1 = 16ms, Q2 = 32ms

Processes:

Process Characteristics
Process	Type	Behavior Pattern	CPU Burst
P1 (vim)	Interactive	5ms CPU, then blocks for I/O	5ms (repeating)
P2 (gcc)	CPU-bound	Continuous computation	200ms total

Execution Timeline:

Time	Event	Queue State	Notes
0	P1, P2 arrive	Q0: [P1, P2]	Both start at highest priority
0	P1 runs	Q0: [P2]	P1 selected (first in Q0)
5	P1 blocks (I/O)	Q0: [P2]	P1 used only 5ms < 8ms quantum → stays in Q0
5	P2 runs	Q0: []	P2 starts with 8ms quantum
13	P2 quantum expires	Q1: [P2]	P2 used full 8ms quantum → demoted to Q1
13	P1 returns from I/O	Q0: [P1], Q1: [P2]	P1 goes to Q0 (preserved priority)
13	P1 runs	Q0: [], Q1: [P2]	P1 in Q0 preempts P2 in Q1
18	P1 blocks (I/O)	Q1: [P2]	P1 again yields early, stays Q0
18	P2 runs	Q1: []	P2 gets 16ms quantum in Q1
34	P2 quantum expires	Q2: [P2]	P2 demoted to Q2
34	P1 returns	Q0: [P1], Q2: [P2]
34	P1 runs	Q2: [P2]	P1 preempts again
39	P1 blocks	Q2: [P2]
39	P2 runs	Q2: []	32ms quantum, 176ms remaining
71	P2 quantum expires	Q2: [P2]	P2 at bottom, stays in Q2
...	Pattern continues		P2 eventually completes

Key Observations:

P1 (interactive) maintains Q0 priority because it never uses a full quantum
P2 (CPU-bound) rapidly sinks to Q2 after using full quanta at each level
P1 consistently preempts P2 due to priority difference
P2 still makes progress with 32ms time slices at lowest priority

Automatic Classification

Problems with Basic MLFQ

The basic MLFQ rules have two significant vulnerabilities that must be addressed for production use:

Problem 1: Starvation

If there are many interactive processes at high priority, CPU-bound processes at low priority may never execute. Consider a system with:

50 interactive processes frequently returning from I/O
1 CPU-bound process stuck at lowest priority

The CPU-bound process waits forever as interactive processes continuously preempt it.

Problem 2: Gaming the Scheduler

A malicious or poorly-designed process can exploit Rule 4b by voluntarily yielding just before its quantum expires:

while (1) {
    // Do CPU work for 7.9ms (just under 8ms quantum)
    busy_work();
    // Issue trivial I/O to yield without using full quantum
    read(fd, buf, 1);  // 'Yield' to avoid demotion
}

Starvation Scenario

•100 interactive processes at Q0
•Each uses 5ms then blocks for 50ms I/O
•At any time, ~10 are ready in Q0
•CPU-bound process in Q2 never runs
•No progress on batch work—ever

Gaming Scenario

•Malicious process at Q0
•Yields at 7.9ms (just before 8ms quantum)
•Maintains Q0 priority indefinitely
•Gets 99% of CPU despite being CPU-bound
•Legitimate interactive processes wait

Problem 3: Behavior Phase Changes

A process may change behavior during execution:

Compiler starts interactive (reading source files) then becomes CPU-bound (optimization)
Browser is CPU-bound loading a page then interactive during scrolling

With basic MLFQ, a process demoted during a CPU-bound phase stays demoted even when it becomes interactive again. This violates the responsiveness guarantee for legitimately interactive work.

Critical Design Flaws

Solutions: Priority Boost and Better Accounting

Modern MLFQ implementations add two additional rules to address the vulnerabilities:

Rule 5: Priority Boost (Solves Starvation and Phase Change)

After a fixed period S, move all processes to the highest priority queue

Periodically (every 1-2 seconds typically), every process gets boosted to Q0. This ensures:

No starvation: Even lowest-priority processes run periodically
Adaptation to behavior change: Processes get fresh chances at high priority
Fairness floor: Long-term CPU share has a minimum guarantee

Rule 4 (Revised): Comprehensive Time Accounting (Solves Gaming)

Once a process uses its total time allotment at a priority level (regardless of how many times it gives up the CPU), its priority is reduced

Instead of tracking per-quantum usage, track cumulative CPU time at each priority level:

if (total_time_at_current_priority >= allotment) {
    demote(process);
    total_time_at_current_priority = 0;
}

This defeats gaming because the malicious process still gets demoted after accumulating its allotment, even across many short bursts.

mlfq_improved.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
/* Improved MLFQ with priority boost and time accounting */
 
#include <stdint.h>
#include <stdbool.h>
 
#define NUM_QUEUES 4
#define BOOST_PERIOD_MS 1000  /* Priority boost every 1 second */
 
typedef struct {
    uint32_t pid;
    uint32_t priority;         /* Current queue (0 = highest) */
    uint64_t time_at_priority; /* Cumulative CPU time at current priority */
    /* ... other fields ... */
} Process;
 
typedef struct {
    uint64_t allotment;  /* Time allotment before demotion (ms) */
    uint64_t quantum;    /* Time quantum (ms) */
} QueueConfig;
 
/* Queue configurations: increasing allotment and quantum */
static QueueConfig queue_configs[NUM_QUEUES] = {
    { .allotment = 20,  .quantum = 8  },   /* Q0: responsive */
    { .allotment = 40,  .quantum = 16 },   /* Q1 */
    { .allotment = 80,  .quantum = 32 },   /* Q2 */
    { .allotment = 0,   .quantum = 64 },   /* Q3: no demotion (bottom) */
};
 
static uint64_t last_boost_time = 0;
 
/**
 * Update process after execution.
 * Handles demotion based on cumulative time accounting.
 */
void update_process_after_run(Process* p, uint64_t time_run) {
    p->time_at_priority += time_run;
    
    QueueConfig* config = &queue_configs[p->priority];
    
    /* Check for demotion (Rule 4 revised) */
    if (p->priority < NUM_QUEUES - 1 && 
        config->allotment > 0 &&
        p->time_at_priority >= config->allotment) {
        
        /* Demote to next level */
        p->priority++;
        p->time_at_priority = 0;  /* Reset for new level */
    }
}
 
/**
 * Priority boost - called periodically.
 * Moves all processes to highest priority (Rule 5).
 */
void priority_boost(Process** all_processes, uint32_t num_processes) {
    for (uint32_t i = 0; i < num_processes; i++) {
        Process* p = all_processes[i];
        
        p->priority = 0;           /* Boost to highest priority */
        p->time_at_priority = 0;   /* Reset time accounting */
    }
}
 
/**
 * Check if priority boost is due.
 * Called from scheduler main loop.
 */
void check_priority_boost(
    uint64_t current_time,
    Process** all_processes,
    uint32_t num_processes
) {
    if (current_time - last_boost_time >= BOOST_PERIOD_MS) {
        priority_boost(all_processes, num_processes);
        last_boost_time = current_time;
    }
}
 
/**
 * Get quantum for process's current priority level.
 */
uint64_t get_quantum(Process* p) {
    return queue_configs[p->priority].quantum;
}
 
/* 
 * Example scheduling behavior with these rules:
 * 
 * 1. Gaming process tries to yield at 7.9ms each time:
 *    - First few runs: maintains Q0
 *    - After ~3 runs: 7.9 * 3 = 23.7ms > 20ms allotment
 *    - DEMOTED to Q1 despite never using full quantum
 *    - Gaming defeated!
 *
 * 2. Starving process at Q3:
 *    - Every 1000ms: boosted to Q0
 *    - Gets guaranteed CPU access
 *    - If still CPU-bound, sinks back to Q3
 *    - Starvation prevented!
 *
 * 3. Browser behavior change (CPU-bound -> interactive):
 *    - While loading: demoted to Q2/Q3
 *    - After 1000ms: boosted to Q0
 *    - User scrolling: short bursts, stays in Q0
 *    - Responsiveness restored!
 */

Choosing the Boost Period

Complete MLFQ Rule Set

With the solutions incorporated, here is the complete, production-ready MLFQ rule set:

Complete MLFQ Rules

•Rule 1 (Priority): If Priority(A) > Priority(B), A runs and B does not.
•Rule 2 (Round Robin): If Priority(A) = Priority(B), A and B run in Round Robin within their priority level.
•Rule 3 (Entry at Top): When a process enters the system, it starts at the highest priority (Q0).
•Rule 4 (Time Accounting): Once a process has used its time allotment at a given priority level (regardless of how many times it has given up the CPU), its priority is reduced (demoted one queue).
•Rule 5 (Priority Boost): After some time period S, move all processes in the system to the topmost queue.

Mathematical Formulation:

For a system with n priority levels (0 = highest):

Time allotment at level i: A(i) = A₀ × 2ⁱ (doubles per level)
Time quantum at level i: Q(i) = Q₀ × 2ⁱ (doubles per level)
Boost period: S (typically 1000ms)

Invariants:

Responsiveness bound: An interactive process's worst-case response time is bounded by the boost period S.
Throughput optimization: CPU-bound processes eventually run with large quanta, minimizing context switch overhead.
Fairness guarantee: Every process receives CPU time at least once per boost period.
Gaming resistance: Cumulative time accounting prevents priority manipulation through artificial yields.

Example MLFQ Configuration
Queue	Priority	Quantum	Allotment	Typical Processes
Q0	Highest	8ms	20ms	Interactive, newly arrived
Q1	High	16ms	40ms	Mixed behavior
Q2	Medium	32ms	80ms	Computation-heavy
Q3	Low	64ms	∞	CPU-bound (no further demotion)

Parameter Tuning

MLFQ in Modern Operating Systems

Modern operating systems have evolved beyond classical MLFQ, but the core principles remain influential.

Linux: Beyond MLFQ with CFS

Linux's Completely Fair Scheduler (CFS) doesn't use explicit priority queues, but achieves MLFQ-like behavior through virtual runtime:

Each process has a virtual runtime counter
CPU-bound processes accumulate virtual runtime faster (they run more)
The scheduler always picks the process with lowest virtual runtime
Interactive processes (low runtime) naturally get priority

This is conceptually similar to MLFQ but uses continuous virtual runtime instead of discrete queue levels. The effect is gradual, proportional priority adjustment rather than step-function demotions.

Key CFS parameters with MLFQ parallels:

sched_latency: Similar to boost period—ensures all processes run within this period
sched_min_granularity: Minimum quantum—prevents excessive switching
Nice values: Affect virtual runtime accumulation rate (like MLFQ queue assignment)

Windows: Hybrid Priority Scheduling

Windows uses a 32-level priority system with MLFQ-like dynamic adjustments:

Base priority: Set by priority class and thread priority
Dynamic boost: Threads get temporary priority increase after:
- Waking from I/O (especially UI-related I/O)
- Foreground window activation
- Lock release
Priority decay: Boosted priority decays back to base over time

This is essentially MLFQ with continuous boosting rather than periodic global boost, and decay rather than explicit demotion.

macOS/iOS: QoS-Based Scheduling

Apple systems use Quality of Service (QoS) classes that incorporate MLFQ concepts:

User Interactive: Highest priority, small quantum
User Initiated: High priority for user-triggered work
Utility: Lower priority for long-running tasks
Background: Lowest priority for invisible work

The system observes process behavior and can adjust QoS recommendations, implementing feedback similar to MLFQ.

MLFQ Concepts in Modern Schedulers
Concept	Classic MLFQ	Linux CFS	Windows
Priority levels	Discrete queues	Continuous (virtual runtime)	32 levels
Demotion trigger	Quantum exhaustion	Runtime accumulation	Quantum use + decay
Boost mechanism	Periodic global boost	Low vruntime from I/O	Dynamic boost events
Anti-gaming	Time allotment	Runtime accounting	Quantum usage + monitoring
Quantum variation	Doubles per level	Based on weight/nice	Varies by foreground/class

The Enduring Legacy

MLFQ Tuning and Tradeoffs

MLFQ has multiple tunable parameters, each involving tradeoffs:

Number of Priority Levels:

Few levels (3-4): Simpler, coarse-grained adaptation, faster settling
Many levels (8+): Fine-grained control, slower settling, more complex
Typical: 4-8 levels balance granularity with complexity

Quantum Growth Factor:

Small factor (1.5×): Gradual throughput improvement, more switches
Large factor (4×): Rapid throughput gain, longer starvation risk at high priority
Typical: 2× per level is a robust default

Allotment Before Demotion:

Small allotment: Quick demotion, more responsive to behavior
Large allotment: Tolerates bursty behavior, slower classification
Typical: 2-3× the quantum provides reasonable demotion speed

Boost Period:

Short period (100ms): Excellent starvation protection, CPU-bound disruptions frequent
Long period (10s): Less disruption, significant starvation possible
Typical: 1-2 seconds balances both concerns

MLFQ Parameter Tradeoffs
Parameter	Small/Few	Large/Many	Recommendation
Queue count	Coarse priority	Fine-grained control	4-8 queues
Base quantum	Responsive, high overhead	Efficient, slow response	8-20ms
Quantum multiplier	Gradual throughput gain	Fast throughput, starvation risk	2×
Allotment	Quick demotion	Tolerant of bursts	2-3× quantum
Boost period	Anti-starvation priority, frequent disruption	Stable but starvation risk	1-2 seconds

Workload-Specific Tuning:

For interactive-heavy systems (desktops):

More queues (6-8) for finer responsiveness control
Smaller base quantum (8ms) for quick response
Shorter boost period (1s) to prevent starvation

For throughput-heavy systems (servers):

Fewer queues (3-4) for stability
Larger base quantum (20ms) for efficiency
Longer boost period (5s) to minimize disruption

For mixed workloads:

Medium values across all parameters
Consider separating real-time from non-real-time entirely
Use OS-specific tuning mechanisms (Linux sched tunables, Windows priority classes)

No Perfect Configuration

Summary: Multi-Level Feedback Queue

We have comprehensively explored the Multi-Level Feedback Queue—the self-learning scheduler that has underpinned interactive computing for over 60 years.

Key Takeaways

•Core Innovation: MLFQ infers process behavior from CPU usage patterns—no explicit classification required.
•Feedback Mechanism: Processes that use full quanta are demoted (CPU-bound); processes that yield early maintain priority (interactive).
•Five Rules: Priority scheduling, RR within levels, start at top, time-based demotion, periodic boost.
•Starvation Prevention: Periodic priority boost ensures every process gets periodic high-priority access.
•Gaming Resistance: Cumulative time accounting prevents manipulation through strategic yielding.
•Phase Adaptation: Boost mechanism allows processes to regain high priority after behavior changes.
•Modern Influence: While exact MLFQ is rare, its principles (behavior inference, dynamic priority) pervade CFS, Windows, and macOS schedulers.
•Tunable Parameters: Queue count, quanta, allotments, and boost period can be adjusted for different workloads.

Looking Ahead:

Page Complete

4 / 5