Why Concurrency Matters - Learning Module

Loading content...

0/246

Responsiveness and Throughput

The Dual Goals of Concurrency

Having understood why single-threaded execution creates bottlenecks, we must now articulate precisely what we want from concurrent systems. What are we trying to achieve? What does success look like?

Concurrency serves two distinct but related goals: responsiveness and throughput. While they often go together, they represent fundamentally different concerns, optimize for different user experiences, and sometimes require different technical approaches.

Understanding these goals deeply isn't merely academic. The distinction shapes architectural decisions, guides technology selection, and determines how we measure success. A system optimized purely for throughput might feel sluggish to users. A system optimized purely for responsiveness might waste resources and cost more. Excellent systems achieve both.

This page dissects these goals with precision, establishing the vocabulary and mental models you'll need for all subsequent concurrency discussions.

What You Will Learn

By the end of this page, you will understand the distinction between responsiveness and throughput, how each is measured, the user experiences they create, and how concurrency enables both. You'll also learn when these goals conflict and how to balance them.

Defining Responsiveness

Responsiveness is the perceived speed of a system's reaction to user actions. It's not about how much work a system completes, but about how quickly the system acknowledges and begins responding to user input.

The psychology of responsiveness:

Humans perceive delays in specific thresholds. Research by Jakob Nielsen, Robert B. Miller, and others has established well-documented response time limits:

< 100ms: Perceived as instantaneous. User feels in direct control.
100-300ms: Slight delay noticed but feels responsive. Flow is maintained.
300-1000ms: Noticeable delay. User attention starts to wander.
1-3 seconds: User loses sense of operating directly. Context switching begins.
> 3 seconds: User is likely to abandon the task or switch to another window.

Responsiveness is about staying within these perceptual thresholds—especially for the most frequent user interactions.

Response Time Perception Thresholds
Delay	User Perception	Recommended Action
0-100ms	Instantaneous	Direct feedback, no indication needed
100-300ms	Slight lag	Subtle visual acknowledgment
300ms-1s	Noticeable wait	Progress indicator recommended
1-3s	Attention breaks	Progress bar or spinner required
3-10s	Frustrating delay	Detailed progress + keep user engaged
10s	Unacceptable	Background processing + notification

Responsiveness vs actual speed:

Critically, responsiveness is about perception, not objective speed. A system can be responsive while still being slow in absolute terms. Consider these two implementations of the same file upload:

Implementation A (Non-responsive):

User clicks 'Upload'
Screen freezes for 10 seconds
File uploads in background
Success message appears
User experience: Frustrating, uncertain

Implementation B (Responsive):

User clicks 'Upload'
Progress bar appears within 50ms
Progress bar animates smoothly as upload proceeds
Success message appears after 10 seconds
User experience: Feels fast and under control

Both implementations take the same 10 seconds. But the responsive version feels dramatically faster because the user receives immediate, continuous feedback.

The Responsiveness Principle

Responsiveness is not about doing work faster—it's about never blocking the user's ability to see, interact with, and control the application. The UI must remain live and reactive even while heavy work proceeds in the background.

Measuring Responsiveness

To improve responsiveness, we must first measure it. Several key metrics capture different aspects of response behavior:

Latency metrics:

Time to First Byte (TTFB): How long until the first byte of response arrives. Most relevant for network requests.
First Contentful Paint (FCP): For web applications, when the first content appears on screen.
Time to Interactive (TTI): When the application becomes fully responsive to user input.
Input Delay: Time between user action (click, keystroke) and visible response.
Frame Time: For animations and games, time to render each frame (target: 16.67ms for 60fps).

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
// Measuring input delay in a web application
class ResponsivenessMonitor {
    private inputTimes: Map<string, number> = new Map();
    
    // Called when user interaction starts
    onInputStart(eventId: string) {
        this.inputTimes.set(eventId, performance.now());
    }
    
    // Called when visible response occurs
    onResponseRendered(eventId: string) {
        const startTime = this.inputTimes.get(eventId);
        if (startTime) {
            const inputDelay = performance.now() - startTime;
            
            // Classify the response
            if (inputDelay < 100) {
                console.log(`✅ Instant response: ${inputDelay.toFixed(1)}ms`);
            } else if (inputDelay < 300) {
                console.log(`⚠️ Slight lag: ${inputDelay.toFixed(1)}ms`);
            } else {
                console.log(`❌ Noticeable delay: ${inputDelay.toFixed(1)}ms`);
            }
            
            // Report to analytics
            this.reportMetric('input_delay', inputDelay);
        }
    }
}
 
// Web Vitals measurements for responsiveness
interface WebVitalsMetrics {
    FCP: number;   // First Contentful Paint (target: < 1.8s)
    LCP: number;   // Largest Contentful Paint (target: < 2.5s)
    FID: number;   // First Input Delay (target: < 100ms)
    INP: number;   // Interaction to Next Paint (target: < 200ms)
}

Percentile-based measurement:

Responsiveness metrics should always be reported as percentiles, not averages. Average latency hides the long tail of slow responses that ruin user experience.

P50 (median): The typical experience
P95: Bad but not disastrous experiences
P99: The 1% worst experiences
P99.9: Rare but memorable failures

For a responsive system:

P50 should be well under 100ms
P95 should stay under 300ms
P99 should rarely exceed 1 second

Example Latency Distribution for Button Click Handler
Percentile	Non-Concurrent System	Concurrent System	Target
P50	120ms	45ms	< 100ms
P75	350ms	80ms	< 150ms
P95	1,200ms	150ms	< 300ms
P99	3,500ms	280ms	< 1,000ms
P99.9	8,000ms	450ms	< 3,000ms

The Average Lie

Never use average latency as your primary metric. An average of 100ms might hide the fact that 5% of users experience 2+ second delays. Those users will leave, complain, or stop using your product—and the average won't tell you why.

Defining Throughput

Throughput is the rate at which a system completes work—typically measured as operations per unit time. Unlike responsiveness (which focuses on individual interactions), throughput is about aggregate capacity.

Throughput metrics by domain:

Web servers: Requests per second (RPS), pages per minute
Databases: Transactions per second (TPS), queries per second
Stream processing: Events per second, records per minute
Batch processing: GB/hour, records per batch run
E-commerce: Orders per minute, checkout completions per hour

Throughput answers a fundamentally different question than responsiveness: not "How quickly do we respond?" but "How much work can we do?"

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
// Simple throughput measurement
class ThroughputMonitor {
    private completedOperations: number = 0;
    private startTime: number = Date.now();
    
    recordCompletion() {
        this.completedOperations++;
    }
    
    getOperationsPerSecond(): number {
        const elapsedSeconds = (Date.now() - this.startTime) / 1000;
        return this.completedOperations / elapsedSeconds;
    }
}
 
// Real-world throughput example: order processing system
interface ThroughputMetrics {
    ordersProcessedPerSecond: number;     // Primary metric
    peakOrdersPerSecond: number;          // Capacity planning
    averageOrdersPerHour: number;         // Business reporting
    ordersInBacklog: number;              // Health indicator
    
    // Derived metrics
    processingCapacityUsed: number;       // e.g., 75% of theoretical max
    headroomRemainingPercent: number;     // 25% capacity remaining
}
 
// Throughput calculation for batch processing
function calculateBatchThroughput(
    recordsProcessed: number,
    startTime: Date,
    endTime: Date
): BatchThroughput {
    const durationSeconds = (endTime.getTime() - startTime.getTime()) / 1000;
    const durationHours = durationSeconds / 3600;
    
    return {
        recordsPerSecond: recordsProcessed / durationSeconds,
        recordsPerHour: recordsProcessed / durationHours,
        totalDuration: durationSeconds,
        effectiveRate: recordsProcessed / durationSeconds
    };
}

The throughput bottleneck:

Every system has a maximum throughput determined by its slowest component—the bottleneck. Identifying and eliminating bottlenecks is the core of throughput optimization.

Common bottleneck locations:

Single-threaded processing: Limited by one CPU core's speed
Database connections: Limited by connection pool size
Network bandwidth: Limited by NIC or internet pipe
Disk I/O: Limited by storage IOPS or bandwidth
External APIs: Limited by rate limits or external capacity

Concurrency specifically addresses the single-threaded bottleneck by allowing work to proceed in parallel.

Throughput Is About Efficiency

High throughput means getting more done with the same resources. A system that processes 1,000 RPS on one server is more efficient than one that requires 10 servers for the same work. Concurrency is often the key to unlocking this efficiency.

The Relationship Between Responsiveness and Throughput

Responsiveness and throughput are related but distinct. Understanding their relationship helps us make better architectural decisions.

How they correlate:

✅ Often complementary: The same techniques that improve throughput (parallel processing, non-blocking I/O) also improve responsiveness by freeing the main thread.

✅ Improved throughput reduces queuing: Higher throughput means shorter queues, which means lower wait times and better responsiveness under load.

❌ Sometimes in tension: Optimizing purely for throughput can hurt individual request latency. Batching increases throughput but delays individual items.

❌ Resource competition: Background batch processing might compete for CPU with real-time user requests.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
// Example: Database writes - batching trade-off
 
// Approach 1: Optimize for responsiveness
// Each write executes immediately
async function writeImmediate(record: Record): Promise<void> {
    await database.insert(record);  // 5ms per write
}
// Latency: 5ms per record ✅
// Throughput: 200 records/second (1000ms / 5ms)
 
// Approach 2: Optimize for throughput  
// Batch writes for efficiency
class BatchWriter {
    private buffer: Record[] = [];
    private flushInterval = 100; // ms
    
    async write(record: Record): Promise<void> {
        this.buffer.push(record);
        // Record won't be written until batch flushes
    }
    
    async flush(): Promise<void> {
        if (this.buffer.length > 0) {
            // Batch insert: 10ms for up to 100 records
            await database.batchInsert(this.buffer);
            this.buffer = [];
        }
    }
}
// Latency: 0-100ms (wait for batch) + 10ms = up to 110ms ❌
// Throughput: 1000 records/second (10x improvement) ✅
 
// Approach 3: Balanced solution
// Small batches with short max wait
class BalancedWriter {
    private buffer: Record[] = [];
    private maxWait = 20; // ms
    private maxBatchSize = 20;
    
    async write(record: Record): Promise<void> {
        this.buffer.push(record);
        if (this.buffer.length >= this.maxBatchSize) {
            await this.flush();  // Immediate flush if batch full
        }
        // Otherwise, flush in background after maxWait
    }
}
// Latency: 0-20ms + 8ms = up to 28ms (acceptable)
// Throughput: 500 records/second (2.5x improvement)

Little's Law: The mathematical relationship

Little's Law provides a precise relationship between throughput, latency, and concurrency:

L = λ × W

Where:
- L = Average number of items in system (concurrency level)
- λ = Throughput (arrival rate / completion rate)
- W = Average time in system (latency)

Rearranging for practical use:

Latency = Concurrency / Throughput
Throughput = Concurrency / Latency
Concurrency = Throughput × Latency

This law tells us that to improve throughput while maintaining latency, we must increase concurrency—exactly what concurrent programming provides.

Little's Law Applied: Required Concurrency
Target Throughput	Current Latency	Required Concurrency
100 RPS	50ms	5 concurrent requests
1,000 RPS	50ms	50 concurrent requests
10,000 RPS	50ms	500 concurrent requests
1,000 RPS	200ms	200 concurrent requests
10,000 RPS	200ms	2,000 concurrent requests

The Concurrency Imperative

Little's Law makes it mathematically clear: to achieve high throughput with acceptable latency, you MUST process work concurrently. There is no alternative. Single-threaded systems cannot scale because they limit concurrency to 1.

Concurrency Enables Responsiveness

Let's see exactly how concurrency solves the responsiveness problem. The fundamental technique is separating work from interaction—allowing heavy computation or I/O to proceed without blocking the user-facing components.

Pattern 1: Background thread for heavy work

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
// BEFORE: Single-threaded, UI blocks during heavy work
class ImageEditorBlocking {
    applyFilter(image: Image, filter: Filter): void {
        // 3 seconds of CPU work - UI completely frozen!
        const result = this.processPixels(image, filter);
        this.displayResult(result);
    }
}
 
// AFTER: Concurrent, UI remains responsive
class ImageEditorConcurrent {
    async applyFilter(image: Image, filter: Filter): Promise<void> {
        // Show immediate feedback
        this.showProgressIndicator();
        this.disableFilterButton();  // Prevent double-click
        
        // Move heavy work to background thread
        const result = await this.runInWorker(() => {
            return this.processPixels(image, filter);
        });
        
        // Update UI on main thread
        this.hideProgressIndicator();
        this.displayResult(result);
        this.enableFilterButton();
    }
    
    private runInWorker<T>(task: () => T): Promise<T> {
        return new Promise((resolve) => {
            const worker = new Worker('image-processor.js');
            worker.onmessage = (e) => resolve(e.data);
            worker.postMessage({ task: task.toString() });
        });
    }
}
 
// User experience comparison:
// Blocking: Click → Freeze 3s → Result
// Concurrent: Click → Progress bar → Can cancel → Result

Pattern 2: Asynchronous I/O allows overlapping waits

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
// BEFORE: Sequential blocking I/O
function fetchDashboardDataBlocking(): DashboardData {
    const user = database.fetchUserSync(userId);        // 50ms wait
    const orders = database.fetchOrdersSync(userId);    // 80ms wait
    const recommendations = api.fetchRecsSync(userId);  // 100ms wait
    // Total: 230ms, UI blocked entire time
    
    return { user, orders, recommendations };
}
 
// AFTER: Parallel async I/O
async function fetchDashboardDataAsync(): Promise<DashboardData> {
    // Start all fetches simultaneously
    const [user, orders, recommendations] = await Promise.all([
        database.fetchUser(userId),         // 50ms
        database.fetchOrders(userId),       // 80ms  
        api.fetchRecommendations(userId)    // 100ms
    ]);
    // Total: max(50, 80, 100) = 100ms
    // 56% faster! And we can show partial results even earlier
    
    return { user, orders, recommendations };
}
 
// EVEN BETTER: Progressive rendering
async function fetchDashboardDataProgressive(): Promise<void> {
    // Show skeleton immediately
    this.renderSkeleton();
    
    // Fetch and render as data arrives
    database.fetchUser(userId).then(user => {
        this.renderUserSection(user);  // Shows in ~50ms
    });
    
    database.fetchOrders(userId).then(orders => {
        this.renderOrdersSection(orders);  // Shows in ~80ms
    });
    
    api.fetchRecommendations(userId).then(recs => {
        this.renderRecommendations(recs);  // Shows in ~100ms
    });
    
    // User sees content progressively, feels much faster!
}

Responsiveness Is Achievable

With concurrency, the same work takes the same amount of total time, but the user experience is transformed. The UI stays responsive, progress is visible, and the system feels fast even when doing heavy work.

Concurrency Enables Throughput

Concurrency fundamentally transforms throughput by allowing multiple operations to proceed simultaneously. This increase can be dramatic—often 10x, 100x, or more improvement over single-threaded processing.

The throughput multiplier effect:

Consider a simple web server handling requests that take 50ms each (40ms I/O wait + 10ms CPU work):

Single-threaded:

1 request at a time
1000ms ÷ 50ms = 20 requests/second

Multi-threaded with 10 threads:

10 requests can overlap their I/O waits
CPU work: 10 × 10ms = 100ms sequential (only one core)
But I/O waits overlap! All 10 requests wait simultaneously
Effective time for 10 requests ≈ 50ms (not 500ms)
Throughput: ~200 requests/second (10x improvement)

Multi-threaded with async I/O and thread pool:

Hundreds of concurrent requests during I/O
CPU work parallelized across multiple cores
Throughput: 1,000+ requests/second (50x+ improvement)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
// Sequential processing: one at a time
async function processOrdersSequential(orders: Order[]): Promise<void> {
    for (const order of orders) {
        await validateOrder(order);      // 10ms
        await checkInventory(order);     // 20ms
        await chargePayment(order);      // 50ms
        await sendConfirmation(order);   // 30ms
    }
    // 100 orders × 110ms = 11 seconds
    // Throughput: 9 orders/second
}
 
// Concurrent processing: many at once
async function processOrdersConcurrent(orders: Order[]): Promise<void> {
    // Process all orders concurrently
    await Promise.all(orders.map(async (order) => {
        await validateOrder(order);
        await checkInventory(order);
        await chargePayment(order);
        await sendConfirmation(order);
    }));
    // 100 orders, I/O overlapped
    // Total time: ~200ms (limited by external API capacity)
    // Throughput: 500 orders/second (55x improvement!)
}
 
// Controlled concurrency: bounded parallelism
async function processOrdersControlled(
    orders: Order[], 
    concurrencyLimit: number = 20
): Promise<void> {
    const semaphore = new Semaphore(concurrencyLimit);
    
    await Promise.all(orders.map(async (order) => {
        await semaphore.acquire();
        try {
            await validateOrder(order);
            await checkInventory(order);
            await chargePayment(order);
            await sendConfirmation(order);
        } finally {
            semaphore.release();
        }
    }));
    // Balanced: high throughput without overwhelming external services
    // Throughput: ~180 orders/second with 20 concurrent
}

Understanding throughput gains:

Throughput improvement from concurrency depends on the nature of the work:

I/O-bound work: Massive gains (10x-1000x). Waiting time overlaps completely.
CPU-bound work: Linear gains up to core count. 8 cores ≈ 8x throughput max.
Mixed workloads: Gains depend on I/O vs CPU ratio. Most real workloads are I/O-heavy.

The key insight: most application workloads spend the majority of time waiting for I/O. Concurrency lets us do useful work during that wait time instead of idling.

Throughput Gains by Workload Type
Workload Type	Single-threaded	8 Threads	Async I/O	Improvement
Pure I/O (API calls)	20 RPS	160 RPS	2,000+ RPS	100x
Mixed (70% I/O)	50 RPS	200 RPS	800 RPS	16x
Balanced (50% I/O)	100 RPS	300 RPS	500 RPS	5x
CPU-heavy (20% I/O)	200 RPS	400 RPS	450 RPS	2.25x
Pure CPU	500 RPS	4,000 RPS	4,000 RPS	8x (core limit)

The I/O Opportunity

Most real-world applications are I/O-bound. This is good news! It means concurrency can deliver massive throughput gains without expensive hardware upgrades. The improvement comes from using existing hardware more efficiently.

Balancing Responsiveness and Throughput

In practice, we need both responsiveness and throughput. The challenge is balancing them when they conflict. Here are battle-tested strategies:

Strategy 1: Priority queues / Quality of Service

Separate work into priority classes. User-facing requests get higher priority than background batch work.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
enum WorkPriority {
    REALTIME = 0,    // User is actively waiting (< 100ms target)
    INTERACTIVE = 1, // User-triggered, visible (< 500ms target)
    BACKGROUND = 2,  // Async work (< 30s target)
    BATCH = 3        // Bulk processing (no strict deadline)
}
 
class PriorityWorkQueue {
    private queues: Map<WorkPriority, WorkItem[]> = new Map();
    private workers: Worker[] = [];
    
    submit(work: WorkItem, priority: WorkPriority): void {
        this.queues.get(priority)?.push(work);
    }
    
    // Higher priority work is always processed first
    private getNextWork(): WorkItem | null {
        for (const priority of [
            WorkPriority.REALTIME,
            WorkPriority.INTERACTIVE,
            WorkPriority.BACKGROUND,
            WorkPriority.BATCH
        ]) {
            const queue = this.queues.get(priority);
            if (queue && queue.length > 0) {
                return queue.shift()!;
            }
        }
        return null;
    }
}
 
// Usage: User actions get priority over batch jobs
workQueue.submit(userSearchQuery, WorkPriority.REALTIME);
workQueue.submit(analyticsUpdate, WorkPriority.BACKGROUND);
workQueue.submit(dailyReportGeneration, WorkPriority.BATCH);

Strategy 2: Separate resource pools

Dedicate separate threads/connections for different workload types. User requests can't be blocked by batch processing because they use different resources.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
// Database connection pools separated by use case
const connectionPools = {
    // User requests: small pool, low latency
    interactive: new Pool({
        size: 10,
        maxWait: 100,  // Fail fast if no connection available
        queryTimeout: 5000
    }),
    
    // Analytics queries: larger pool, can wait
    analytics: new Pool({
        size: 5,
        maxWait: 5000,
        queryTimeout: 30000
    }),
    
    // Batch operations: single connection, won't starve others
    batch: new Pool({
        size: 2,
        maxWait: 60000,
        queryTimeout: 300000  // 5 minute timeout for batch
    })
};
 
// Usage: Route work to appropriate pool
async function handleUserRequest(req: Request) {
    // Uses interactive pool - guaranteed fast connection
    return connectionPools.interactive.query(req.sql);
}
 
async function runDailyReport() {
    // Uses batch pool - won't affect user request pool
    return connectionPools.batch.query(reportQuery);
}

Strategy 3: Work chunking with yields

Break large batch operations into small chunks that periodically yield to higher-priority work.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
// Process large dataset without blocking responsive work
async function processLargeDataset(records: Record[]): Promise<void> {
    const CHUNK_SIZE = 100;
    const YIELD_INTERVAL_MS = 10;  // Check for higher-priority work
    
    for (let i = 0; i < records.length; i += CHUNK_SIZE) {
        const chunk = records.slice(i, i + CHUNK_SIZE);
        
        // Process this chunk
        await processChunk(chunk);
        
        // Yield to event loop - allows responsive work to run
        await sleep(YIELD_INTERVAL_MS);
        
        // Report progress for visibility
        console.log(`Processed ${i + chunk.length} / ${records.length}`);
    }
}
 
// In browser context, use requestIdleCallback for even better behavior
function processIdleWork(records: Record[]): void {
    let index = 0;
    
    function processNext(deadline: IdleDeadline) {
        // Process records while we have idle time
        while (index < records.length && deadline.timeRemaining() > 1) {
            processRecord(records[index++]);
        }
        
        if (index < records.length) {
            // More work to do, schedule next idle callback
            requestIdleCallback(processNext);
        }
    }
    
    requestIdleCallback(processNext);
}

The Golden Rule

When responsiveness and throughput conflict, responsiveness usually wins. Users forgive slow background processing, but they don't forgive frozen interfaces. Optimize for throughput only when it doesn't hurt responsiveness, or explicitly inform users of the trade-off.

Summary: The Twin Goals of Concurrency

We've explored the dual objectives that motivate concurrent programming. Let's consolidate the key insights:

Key Takeaways

•Responsiveness is about perception — How quickly users feel the system responds, measured in latency and percentiles, bounded by human perception thresholds.
•Throughput is about capacity — How much work the system completes per unit time, measured in operations/second, bounded by resource limits.
•The two goals are related but distinct — Techniques that improve one often improve both, but sometimes trade-offs are necessary.
•Little's Law is fundamental — Throughput × Latency = Concurrency. To improve throughput while maintaining latency, you must increase concurrency.
•Concurrency unlocks both goals — Background threads enable responsive UI. Parallel processing enables high throughput. Async I/O enables both.
•Balance through isolation and priority — Separate workloads, use priority queues, and chunk large operations to achieve both responsiveness and throughput.

What's next:

We've established why we need concurrency (single-threaded limitations) and what we're trying to achieve (responsiveness and throughput). The next page explores how modern hardware enables concurrency through multi-core architecture. Understanding CPU cores, cache hierarchies, and memory models will ground our concurrent programming techniques in the physical reality of modern computers.

Page Complete

You now understand the twin goals of concurrent programming. Every concurrency technique we'll learn serves one or both of these objectives: keeping systems responsive to users and maximizing the work we can complete with available resources.