Multiplexing Overview - Learning Module

Loading content...

0/240

Efficiency Gains: The Quantitative Case for Multiplexing

The Economics of Sharing

Why do airlines overbook flights? Why do banks not keep cash equal to all deposits? Why do cellular networks sell more capacity than they have?

The answer in every case is the same fundamental insight: most resources are never used at full capacity simultaneously. By carefully analyzing usage patterns and accepting small probabilities of congestion, shared resources can serve far more users than dedicated alternatives at dramatically lower cost per user.

This principle—statistical multiplexing gain—is the quantitative foundation of modern communications. Understanding it explains why Internet access costs dollars per month instead of thousands, why mobile phones work despite limited spectrum, and why cloud computing is economically viable.

This page develops the mathematics and engineering behind efficiency gains, giving you tools to analyze and optimize shared communication systems.

What You Will Learn

By the end of this page, you will understand: the mathematical basis of statistical multiplexing gain, how to calculate efficiency for different traffic types, the tradeoff between efficiency and quality of service, real-world examples of efficiency optimization, and how to reason about capacity planning for shared systems.

Utilization: Dedicated vs. Shared Channels

Utilization is the fundamental efficiency metric: the fraction of channel capacity actually carrying useful data. Let's compare dedicated and shared approaches.

Dedicated Channel Utilization

In a dedicated channel system, each user receives exclusive capacity whether they're using it or not. Consider voice calls:

A dedicated 64 kbps voice channel is allocated for an entire call duration
Active speech occupies only ~35% of call time (due to silence, listening, pauses)
Effective utilization: ~35%
The remaining ~65% is paid for but carries no data

For bursty data applications, the situation is worse:

A dedicated 100 Mbps Ethernet port to a workstation
The user actively transfers data perhaps 5 minutes per hour
Effective utilization: ~8%
92% of capacity sits permanently idle

Shared Channel Utilization

When multiple sources share a channel, their independent utilization patterns combine favorably:

Individual sources may be 10% active, but different ones are active at different times
Aggregate utilization can reach 70-80% while each individual can burst to full speed when needed
The key insight: unused capacity from idle sources is immediately available to active sources

Utilization Comparison: Dedicated vs. Shared Systems
Scenario	Dedicated System	Shared System	Improvement
Voice (35% activity factor)	35% per line	80%+ aggregate	2.3x capacity
Office workstation (8% active)	8% per port	70% aggregate	8.7x capacity
Web browsing (1% active transfer)	1% per user	60% aggregate	60x capacity
IoT sensors (0.1% duty cycle)	0.1% per device	50% aggregate	500x capacity

The Activity Factor

The activity factor is the fraction of time a source is actively using capacity. Lower activity factors mean greater potential for multiplexing gain. Voice has ~35% activity; interactive data has ~1-5%; IoT sensors often have <1%. Applications with very low activity factors benefit most dramatically from statistical multiplexing.

The Mathematics of Aggregation

Let's formalize why aggregation improves utilization. Consider n independent sources, each with:

Peak rate: R (e.g., 10 Mbps)
Average rate: a × R, where a is the activity factor (e.g., 0.1 × 10 Mbps = 1 Mbps)

Dedicated System:

Total capacity required: n × R
Total average utilization: n × a × R
Efficiency: a (e.g., 10%)

Shared System:

Aggregate average load: n × a × R
Required capacity: depends on acceptable congestion probability
Efficiency: approaches the target utilization limit

The shared system doesn't need capacity for all peaks simultaneously—only enough for the statistically likely maximum. As n grows large, this maximum becomes increasingly predictable.

Statistical Multiplexing Gain

Statistical multiplexing gain is the ratio of users a shared system can support compared to a dedicated system with the same total capacity. This gain derives from probability theory and grows with the number of sources.

Derivation Using Central Limit Theorem

Consider n independent, identically distributed sources, each with mean μ and variance σ². The aggregate load X is the sum:

X = X₁ + X₂ + ... + Xₙ

By the Central Limit Theorem, as n grows:

E[X] = n × μ (mean scales linearly)
Var[X] = n × σ² (variance scales linearly)
StdDev[X] = √n × σ (standard deviation scales as square root)

The coefficient of variation (relative variability) is: CV = StdDev[X] / E[X] = (√n × σ) / (n × μ) = σ / (μ × √n)

Crucially, relative variability decreases with √n. With 100 sources, CV is 10× lower than with 1 source. The aggregate becomes increasingly smooth and predictable.

The Square Root Rule

The coefficient of variation decreasing as 1/√n is sometimes called the 'square root rule.' It means that to reduce relative variability by half, you need to aggregate 4× more sources. This explains why larger networks achieve better statistical multiplexing gains than smaller ones—more sources to average.

Calculating Required Capacity

To support n sources with shared capacity C, we need the probability of aggregate demand exceeding C to be below some acceptable threshold ε:

P(X > C) < ε

Using the normal approximation:

C = n × μ + z_ε × √n × σ

Where z_ε is the z-score corresponding to probability ε (e.g., z = 2.33 for ε = 1%).

Multiplexing Gain Calculation

The multiplexing gain G is the ratio of dedicated to shared capacity:

Dedicated capacity (for n users, each needing peak rate R): n × R
Shared capacity (from above): n × μ + z_ε × √n × σ

G = (n × R) / (n × μ + z_ε × √n × σ)

As n → ∞, the z_ε × √n × σ term becomes negligible compared to n × μ, and:

G → R / μ = 1 / a (the inverse of activity factor)

Example: 1000 Voice Sources

Peak rate R: 64 kbps
Mean rate μ: 22.4 kbps (35% activity)
Standard deviation σ: 25 kbps (assume)
Acceptable blocking ε: 1% (z = 2.33)

Dedicated: 1000 × 64 = 64,000 kbps = 64 Mbps Shared: 1000 × 22.4 + 2.33 × √1000 × 25 = 22,400 + 1,842 = 24,242 kbps ≈ 24.2 Mbps

Gain: 64 / 24.2 ≈ 2.6×

The shared system needs only 38% of the dedicated system's capacity while achieving 99% service probability.

Statistical Multiplexing Gain by Number of Sources (35% activity, 1% blocking)
Number of Sources	Dedicated Capacity	Shared Capacity	Multiplexing Gain
10	640 kbps	340 kbps	1.9×
100	6.4 Mbps	2.8 Mbps	2.3×
1,000	64 Mbps	24.2 Mbps	2.6×
10,000	640 Mbps	230 Mbps	2.8×
100,000	6.4 Gbps	2.27 Gbps	2.8×

The Erlang Model for Circuit-Switched Systems

For circuit-switched systems like traditional telephony, the Erlang model provides precise tools for dimensioning shared resources. Developed by Agnar Erlang in the early 1900s, these formulas remain fundamental to telecommunications engineering.

Traffic Load Measurement: The Erlang

The Erlang is the standard unit of telecommunications traffic intensity:

1 Erlang = One circuit occupied continuously
0.5 Erlangs = One circuit occupied 50% of the time
100 Erlangs = 100 circuits worth of average demand

Traffic A (in Erlangs) = λ × h, where:

λ = arrival rate (calls per unit time)
h = average holding time (duration per call)

Example: 600 calls per hour, average duration 3 minutes

λ = 600 calls/hour = 10 calls/minute
h = 3 minutes
A = 10 × 3 = 30 Erlangs

Erlang vs. CCS

Another unit for traffic is CCS (Cent Call Seconds): 100 call-seconds. Since an hour = 3600 seconds, 1 Erlang = 36 CCS. CCS is common in North American telephony; Erlangs are standard internationally. Both measure the same thing: occupancy.

Erlang B Formula: Blocking Probability

For systems where blocked calls are cleared (caller hangs up and may retry later), the Erlang B formula gives the probability of blocking:

P(blocking) = B(A, n) = (Aⁿ/n!) / Σₖ₌₀ⁿ (Aᵏ/k!)

Where:

A = offered traffic in Erlangs
n = number of available circuits

This formula assumes:

Poisson arrivals (random, independent)
Exponential holding times
Blocked calls cleared (no waiting)
Infinite population of potential callers

Erlang C Formula: Waiting Probability

For systems where blocked calls wait (call center queues), the Erlang C formula gives the probability of waiting:

P(waiting > 0) = C(A, n) = [n × B(A, n)] / [n - A × (1 - B(A, n))]

This is used for dimensioning systems with queues, like call centers or packet networks.

Using Erlang Tables for Capacity Planning

Erlang tables (or calculators) determine required circuits for given traffic and blocking targets:

Example: Support 50 Erlangs with ≤2% blocking:

From Erlang B tables: Need 62 circuits
With dedicated circuits (one per Erlang): Would need the equivalent of ~143 circuits at peak
Multiplexing gain: 143/62 ≈ 2.3×

The gain increases with traffic volume. Higher Erlang loads allow proportionally fewer circuits per Erlang.

Erlang B Capacity Required for 2% Blocking
Offered Traffic (Erlangs)	Circuits Required	Erlangs per Circuit	Gain vs. 1:1
10	17	0.59	1.7×
30	42	0.71	2.1×
50	62	0.81	2.5×
100	115	0.87	2.9×
500	527	0.95	3.2×
1,000	1,030	0.97	3.4×

Packet Switching Efficiency

Packet switching takes statistical multiplexing further than circuit switching by sharing capacity at the packet level rather than the call level. This enables dramatically higher efficiency for bursty traffic.

Circuit vs. Packet Efficiency

In circuit switching:

Resources allocated for entire call duration
Silence and pauses waste allocated capacity
Efficiency limited by activity factor (~35% for voice)

In packet switching:

Resources allocated only when packets are ready
No capacity consumed during silence
Efficiency limited only by aggregate load and queuing tolerance

The Bursty Data Advantage

Data applications are far more bursty than voice:

Web browsing: Burst when loading page, idle while reading (activity ~1-5%)
File download: Full speed during transfer, nothing otherwise (activity varies wildly)
Video streaming: Constant during viewing, nothing during pauses (activity ~30-50%)

Packet switching handles this burstiness naturally. A 100 Mbps link serving 1000 users at 1% average activity can let any user burst to 100 Mbps when idle users aren't transmitting.

Efficiency Comparison: Circuit vs. Packet Switching
Traffic Type	Activity Factor	Circuit Efficiency	Packet Efficiency	Packet Advantage
Voice call	35%	35%	35%*	1×
Video conference	50%	50%	50%*	1×
Web browsing	2%	2%	70%	35×
Email/messaging	0.5%	0.5%	60%	120×
IoT sensor	0.1%	0.1%	50%	500×

Voice Efficiency Note

Voice over IP can achieve higher efficiency than circuit-switched voice by using silence suppression—not sending packets during silence periods. This can push voice efficiency from 35% to 50-60%, at the cost of potential clipping if voice activity detection makes mistakes.

Queuing Theory for Packet Networks

Packet network efficiency analysis uses queuing theory. The key relationship is the utilization factor:

ρ = λ / μ

Where:

λ = average packet arrival rate
μ = packet service rate (link rate / average packet size)

For stability, ρ must be < 1 (arrivals must be slower than service on average).

Wait Time and Utilization Tradeoff

For M/M/1 queues (Poisson arrivals, exponential service, single server), average waiting time W is:

W = 1 / (μ - λ) = (1/μ) / (1 - ρ)

As ρ approaches 1:

ρ = 0.5: W = 2 × service time
ρ = 0.9: W = 10 × service time
ρ = 0.99: W = 100 × service time

This nonlinear relationship explains why packet networks can't run at 100% utilization—queuing delays would become unbounded. Practical targets are 70-80% utilization to balance efficiency against latency.

Statistical Multiplexing in Practice

Consider a 1 Gbps link serving 10,000 users:

Each user: 10 Mbps peak, 1 Mbps average (10% activity)
Aggregate average: 10,000 × 1 = 10 Gbps of offered demand
Link capacity: 1 Gbps
Oversubscription ratio: 10:1

If all users transmitted simultaneously (all 10% activity aligned), the link would be overwhelmed. But statistically, this never happens—the law of large numbers ensures aggregate demand stays near the 1 Gbps average with high probability.

Cost Efficiency Analysis

Efficiency gains translate directly to cost savings. Understanding this relationship helps justify investment in multiplexing infrastructure and informs pricing models.

Capital Cost Savings

Multiplexing reduces infrastructure investment proportionally to multiplexing gain:

Transmission equipment: Fewer links means fewer transceivers, amplifiers, regenerators
Rights of way: Fewer cables means lower installation and lease costs
Central office space: Consolidated equipment takes less physical space
Power and cooling: Fewer devices means lower operational costs

Example: Metropolitan Fiber Network

Configuration	Fiber Strands	Equipment Cost	Total Cost
Dedicated 10G per customer (1000 customers)	1,000	$50M	$75M
Shared 100G with 10:1 mux (1000 customers)	10	$5M	$8M
Savings	99%	90%	89%

The multiplexed approach costs 1/9th as much while serving the same customers equally well (for typical usage patterns).

Cost Impact of Multiplexing Efficiency
Cost Component	Dedicated Approach	Multiplexed Approach	Reduction
Physical infrastructure	$500M	$50M	90%
Network equipment	$200M	$30M	85%
Power/cooling (annual)	$10M	$2M	80%
Maintenance (annual)	$5M	$1M	80%
Total 5-year TCO	$775M	$95M	88%

Pricing Implications

Multiplexing efficiency enables consumption-based and shared pricing models:

Committed Information Rate (CIR): Customers purchase guaranteed capacity less than their peak access rate, paying premium only for firm guarantees.

Burst Pricing: Customers pay base rate for normal usage, premium for occasional bursts—matching cost to actual resource consumption.

Best-Effort/Oversubscription: Residential Internet is highly oversubscribed (often 20-50:1 on aggregation links) because most users are inactive most of the time.

Cloud Computing Economics

Cloud providers apply multiplexing principles to compute, storage, and networking:

Aggregate virtual machines from thousands of customers on shared servers
Most VMs are idle most of the time
Oversubscription ratios of 10-20:1 are common
Result: Cloud costs 10-20× less than dedicated infrastructure for typical workloads

The Oversubscription Trap

Aggressive oversubscription increases risk. If usage patterns change (everyone works from home, major event causes simultaneous demand), the statistical assumptions fail. The 2020 pandemic revealed oversubscription limits as residential networks, designed for 5% concurrent use, faced 50% concurrent use. Responsible engineering includes margins for unexpected load.

Efficiency vs. Quality of Service

Higher efficiency comes at the cost of statistical guarantees. Understanding this tradeoff is essential for system design that balances cost against user experience.

The Fundamental Tradeoff

Shared resources achieve efficiency by pooling unused capacity from idle users. But this means:

No absolute guarantees: If everyone transmits simultaneously, not all can succeed
Probabilistic service: We design for "99% of users get good service 99.9% of the time"
Peak degradation: During congestion, service quality drops

Quantifying the Tradeoff

For a given multiplexing gain G and number of users n, there's an associated congestion probability P_c:

Higher G → Higher P_c (more efficiency means more congestion risk)
Larger n → Lower P_c for same G (more users smooth demand)

Service Level Targets

Different applications tolerate different congestion levels:

Emergency services (911): <0.01% blocking (essentially dedicated)
Business voice: <1% blocking
Residential voice: <2% blocking
Business Internet: <5% congestion periods per month
Residential Internet: Best-effort (significant oversubscription acceptable)

Low Oversubscription (Conservative)

•2-4:1 aggregation ratios
•Suitable for business/enterprise
•Strong SLA guarantees possible
•Higher cost per user
•Handles demand spikes gracefully
•Premium service tier

High Oversubscription (Aggressive)

•20-50:1 aggregation ratios
•Suitable for residential/budget
•Best-effort service only
•Lowest cost per user
•Congestion during peaks
•Economy service tier

Quality of Service Mechanisms

To serve multiple service levels on shared infrastructure, QoS mechanisms provide differentiated treatment:

Priority Queuing: Critical traffic (voice, video) gets served before bulk traffic (downloads, backups). During congestion, low-priority traffic waits while high-priority traffic proceeds.

Weighted Fair Queuing: Each class gets proportional share of capacity. Business traffic might get 70% weight, residential 30%—so business users experience less congestion.

Rate Limiting: Each user's traffic is constrained to purchased capacity despite sharing the aggregate link. Prevents any user from monopolizing shared resources.

Admission Control: For circuit-like services (voice), new requests are rejected if accepting them would overload the network. Protects existing sessions at the cost of blocking new ones.

The 80/20 Rule in Networking

Often, 20% of users generate 80% of traffic. QoS mechanisms can constrain heavy users during congestion, reserving capacity for the majority. This makes high oversubscription ratios practical—the few heavy users are rate-limited, the many light users experience no impact.

Real-World Efficiency Examples

Let's examine how these efficiency principles apply in actual systems.

Example 1: Residential Internet Service

A cable ISP serving 100,000 homes:

Access capacity: 1 Gbps per household (DOCSIS 3.1)
Aggregate purchased capacity: 100,000 × 1 Gbps = 100 Tbps if dedicated
Actual backbone capacity: 400 Gbps
Oversubscription ratio: 250:1

This works because:

Average household uses ~15 Mbps sustained (1.5% of peak)
Peak concurrent usage is ~20% of households at prime time
Statistical multiplexing smooths aggregate demand
Capacity: 400 Gbps vs. actual demand: ~300 Gbps peak = adequate

Result: Infrastructure costs 1/250th of dedicated approach, sufficient for 99%+ of situations.

Example 2: Cellular Network

A 5G cell sector serving 1,000 active devices:

Sector capacity: 1 Gbps shared (typical mmWave)
Per-device peak capability: 100 Mbps
If dedicated: 1,000 × 100 Mbps = 100 Gbps needed
Actual: 1 Gbps provides ~1 Mbps average per device

Multiplexing gain: 100× This enables mobile broadband at consumer prices despite spectrum scarcity.

Example 3: Cloud Data Center

A cloud provider with 10,000 VMs on 1,000 servers:

VM-to-server ratio: 10:1
Each VM has virtual 1 Gbps NIC
Physical server NIC: 100 Gbps
NIC oversubscription: Only 10:1 (physical supports virtual)
ToR-to-spine oversubscription: Additional 4:1

Most VMs are idle or low-bandwidth. The few high-bandwidth VMs statistically spread across servers. Total efficiency allows 10,000 × 1 Gbps (10 Tbps) of virtual capacity on ~250 Gbps of physical backbone.

Real-World Multiplexing Efficiency Summary
System	Peak-to-Actual Ratio	Achieved Utilization	Cost Reduction
Residential Cable ISP	250:1	~75%	99.6%
Enterprise WAN	10:1	~60%	90%
5G Cellular Sector	100:1	~50%	99%
Cloud Data Center Network	40:1	~40%	97.5%
Submarine Cable System	5:1	~85%	80%

Why Submarine Cables Are Different

Submarine cables have modest oversubscription because capacity is extremely expensive ($200-400M per cable) and demand is aggregated from entire continents. Traffic is already smoothed across millions of users before reaching the cable. High utilization (85%+) is essential for ROI, and backup cables handle failures rather than statistical headroom.

Summary: Quantifying Multiplexing Efficiency

We've developed the quantitative framework for understanding multiplexing efficiency. Let's consolidate the key insights:

Key Takeaways

•Utilization is the key metric — Shared channels achieve 60-80% utilization vs. 1-35% for dedicated channels
•Statistical multiplexing gain grows with √n — More sources means more predictable aggregate demand and higher efficiency
•Erlang models enable precise dimensioning — Telephony traffic can be exactly calculated for desired blocking probability
•Packet switching extends gain to bursty traffic — Data applications with <5% activity see 20-50× efficiency gains
•Cost savings are dramatic — 80-99% infrastructure cost reduction is common with appropriate oversubscription
•Efficiency trades against guarantees — Higher efficiency means higher congestion probability during peaks
•QoS enables tiered service — Priority and rate limiting allow differentiated service levels on shared infrastructure

What's Next:

With the efficiency gains of multiplexing established, the next page explores the types of multiplexing—the specific techniques (FDM, TDM, WDM, CDM, statistical) that implement channel sharing. Each technique offers different tradeoffs suitable for different applications.

Page Complete

You now understand the quantitative framework for multiplexing efficiency—the mathematical and economic foundations that make modern communications affordable. This knowledge lets you reason about capacity planning, pricing, and the tradeoffs between efficiency and quality of service.

Efficiency Gains: The Quantitative Case for Multiplexing

The Economics of Sharing

Why do airlines overbook flights? Why do banks not keep cash equal to all deposits? Why do cellular networks sell more capacity than they have?

This page develops the mathematics and engineering behind efficiency gains, giving you tools to analyze and optimize shared communication systems.

What You Will Learn

Utilization: Dedicated vs. Shared Channels

Utilization is the fundamental efficiency metric: the fraction of channel capacity actually carrying useful data. Let's compare dedicated and shared approaches.

Dedicated Channel Utilization

In a dedicated channel system, each user receives exclusive capacity whether they're using it or not. Consider voice calls:

A dedicated 64 kbps voice channel is allocated for an entire call duration
Active speech occupies only ~35% of call time (due to silence, listening, pauses)
Effective utilization: ~35%
The remaining ~65% is paid for but carries no data

For bursty data applications, the situation is worse:

A dedicated 100 Mbps Ethernet port to a workstation
The user actively transfers data perhaps 5 minutes per hour
Effective utilization: ~8%
92% of capacity sits permanently idle

Shared Channel Utilization

When multiple sources share a channel, their independent utilization patterns combine favorably:

Individual sources may be 10% active, but different ones are active at different times
Aggregate utilization can reach 70-80% while each individual can burst to full speed when needed
The key insight: unused capacity from idle sources is immediately available to active sources

Utilization Comparison: Dedicated vs. Shared Systems
Scenario	Dedicated System	Shared System	Improvement
Voice (35% activity factor)	35% per line	80%+ aggregate	2.3x capacity
Office workstation (8% active)	8% per port	70% aggregate	8.7x capacity
Web browsing (1% active transfer)	1% per user	60% aggregate	60x capacity
IoT sensors (0.1% duty cycle)	0.1% per device	50% aggregate	500x capacity

The Activity Factor

The Mathematics of Aggregation

Let's formalize why aggregation improves utilization. Consider n independent sources, each with:

Peak rate: R (e.g., 10 Mbps)
Average rate: a × R, where a is the activity factor (e.g., 0.1 × 10 Mbps = 1 Mbps)

Dedicated System:

Total capacity required: n × R
Total average utilization: n × a × R
Efficiency: a (e.g., 10%)

Shared System:

Aggregate average load: n × a × R
Required capacity: depends on acceptable congestion probability
Efficiency: approaches the target utilization limit

The shared system doesn't need capacity for all peaks simultaneously—only enough for the statistically likely maximum. As n grows large, this maximum becomes increasingly predictable.

Statistical Multiplexing Gain

Derivation Using Central Limit Theorem

Consider n independent, identically distributed sources, each with mean μ and variance σ². The aggregate load X is the sum:

X = X₁ + X₂ + ... + Xₙ

By the Central Limit Theorem, as n grows:

E[X] = n × μ (mean scales linearly)
Var[X] = n × σ² (variance scales linearly)
StdDev[X] = √n × σ (standard deviation scales as square root)

The coefficient of variation (relative variability) is: CV = StdDev[X] / E[X] = (√n × σ) / (n × μ) = σ / (μ × √n)

Crucially, relative variability decreases with √n. With 100 sources, CV is 10× lower than with 1 source. The aggregate becomes increasingly smooth and predictable.

The Square Root Rule

Calculating Required Capacity

To support n sources with shared capacity C, we need the probability of aggregate demand exceeding C to be below some acceptable threshold ε:

P(X > C) < ε

Using the normal approximation:

C = n × μ + z_ε × √n × σ

Where z_ε is the z-score corresponding to probability ε (e.g., z = 2.33 for ε = 1%).

Multiplexing Gain Calculation

The multiplexing gain G is the ratio of dedicated to shared capacity:

Dedicated capacity (for n users, each needing peak rate R): n × R
Shared capacity (from above): n × μ + z_ε × √n × σ

G = (n × R) / (n × μ + z_ε × √n × σ)

As n → ∞, the z_ε × √n × σ term becomes negligible compared to n × μ, and:

G → R / μ = 1 / a (the inverse of activity factor)

Example: 1000 Voice Sources

Peak rate R: 64 kbps
Mean rate μ: 22.4 kbps (35% activity)
Standard deviation σ: 25 kbps (assume)
Acceptable blocking ε: 1% (z = 2.33)

Dedicated: 1000 × 64 = 64,000 kbps = 64 Mbps Shared: 1000 × 22.4 + 2.33 × √1000 × 25 = 22,400 + 1,842 = 24,242 kbps ≈ 24.2 Mbps

Gain: 64 / 24.2 ≈ 2.6×

The shared system needs only 38% of the dedicated system's capacity while achieving 99% service probability.

Statistical Multiplexing Gain by Number of Sources (35% activity, 1% blocking)
Number of Sources	Dedicated Capacity	Shared Capacity	Multiplexing Gain
10	640 kbps	340 kbps	1.9×
100	6.4 Mbps	2.8 Mbps	2.3×
1,000	64 Mbps	24.2 Mbps	2.6×
10,000	640 Mbps	230 Mbps	2.8×
100,000	6.4 Gbps	2.27 Gbps	2.8×

The Erlang Model for Circuit-Switched Systems

Traffic Load Measurement: The Erlang

The Erlang is the standard unit of telecommunications traffic intensity:

1 Erlang = One circuit occupied continuously
0.5 Erlangs = One circuit occupied 50% of the time
100 Erlangs = 100 circuits worth of average demand

Traffic A (in Erlangs) = λ × h, where:

λ = arrival rate (calls per unit time)
h = average holding time (duration per call)

Example: 600 calls per hour, average duration 3 minutes

λ = 600 calls/hour = 10 calls/minute
h = 3 minutes
A = 10 × 3 = 30 Erlangs

Erlang vs. CCS

Erlang B Formula: Blocking Probability

For systems where blocked calls are cleared (caller hangs up and may retry later), the Erlang B formula gives the probability of blocking:

P(blocking) = B(A, n) = (Aⁿ/n!) / Σₖ₌₀ⁿ (Aᵏ/k!)

Where:

A = offered traffic in Erlangs
n = number of available circuits

This formula assumes:

Poisson arrivals (random, independent)
Exponential holding times
Blocked calls cleared (no waiting)
Infinite population of potential callers

Erlang C Formula: Waiting Probability

For systems where blocked calls wait (call center queues), the Erlang C formula gives the probability of waiting:

P(waiting > 0) = C(A, n) = [n × B(A, n)] / [n - A × (1 - B(A, n))]

This is used for dimensioning systems with queues, like call centers or packet networks.

Using Erlang Tables for Capacity Planning

Erlang tables (or calculators) determine required circuits for given traffic and blocking targets:

Example: Support 50 Erlangs with ≤2% blocking:

From Erlang B tables: Need 62 circuits
With dedicated circuits (one per Erlang): Would need the equivalent of ~143 circuits at peak
Multiplexing gain: 143/62 ≈ 2.3×

The gain increases with traffic volume. Higher Erlang loads allow proportionally fewer circuits per Erlang.

Erlang B Capacity Required for 2% Blocking
Offered Traffic (Erlangs)	Circuits Required	Erlangs per Circuit	Gain vs. 1:1
10	17	0.59	1.7×
30	42	0.71	2.1×
50	62	0.81	2.5×
100	115	0.87	2.9×
500	527	0.95	3.2×
1,000	1,030	0.97	3.4×

Packet Switching Efficiency

Circuit vs. Packet Efficiency

In circuit switching:

Resources allocated for entire call duration
Silence and pauses waste allocated capacity
Efficiency limited by activity factor (~35% for voice)

In packet switching:

Resources allocated only when packets are ready
No capacity consumed during silence
Efficiency limited only by aggregate load and queuing tolerance

The Bursty Data Advantage

Data applications are far more bursty than voice:

Web browsing: Burst when loading page, idle while reading (activity ~1-5%)
File download: Full speed during transfer, nothing otherwise (activity varies wildly)
Video streaming: Constant during viewing, nothing during pauses (activity ~30-50%)

Packet switching handles this burstiness naturally. A 100 Mbps link serving 1000 users at 1% average activity can let any user burst to 100 Mbps when idle users aren't transmitting.

Efficiency Comparison: Circuit vs. Packet Switching
Traffic Type	Activity Factor	Circuit Efficiency	Packet Efficiency	Packet Advantage
Voice call	35%	35%	35%*	1×
Video conference	50%	50%	50%*	1×
Web browsing	2%	2%	70%	35×
Email/messaging	0.5%	0.5%	60%	120×
IoT sensor	0.1%	0.1%	50%	500×

Voice Efficiency Note

Queuing Theory for Packet Networks

Packet network efficiency analysis uses queuing theory. The key relationship is the utilization factor:

ρ = λ / μ

Where:

λ = average packet arrival rate
μ = packet service rate (link rate / average packet size)

For stability, ρ must be < 1 (arrivals must be slower than service on average).

Wait Time and Utilization Tradeoff

For M/M/1 queues (Poisson arrivals, exponential service, single server), average waiting time W is:

W = 1 / (μ - λ) = (1/μ) / (1 - ρ)

As ρ approaches 1:

ρ = 0.5: W = 2 × service time
ρ = 0.9: W = 10 × service time
ρ = 0.99: W = 100 × service time

Statistical Multiplexing in Practice

Consider a 1 Gbps link serving 10,000 users:

Each user: 10 Mbps peak, 1 Mbps average (10% activity)
Aggregate average: 10,000 × 1 = 10 Gbps of offered demand
Link capacity: 1 Gbps
Oversubscription ratio: 10:1

Cost Efficiency Analysis

Efficiency gains translate directly to cost savings. Understanding this relationship helps justify investment in multiplexing infrastructure and informs pricing models.

Capital Cost Savings

Multiplexing reduces infrastructure investment proportionally to multiplexing gain:

Transmission equipment: Fewer links means fewer transceivers, amplifiers, regenerators
Rights of way: Fewer cables means lower installation and lease costs
Central office space: Consolidated equipment takes less physical space
Power and cooling: Fewer devices means lower operational costs

Example: Metropolitan Fiber Network

Configuration	Fiber Strands	Equipment Cost	Total Cost
Dedicated 10G per customer (1000 customers)	1,000	$50M	$75M
Shared 100G with 10:1 mux (1000 customers)	10	$5M	$8M
Savings	99%	90%	89%

The multiplexed approach costs 1/9th as much while serving the same customers equally well (for typical usage patterns).

Cost Impact of Multiplexing Efficiency
Cost Component	Dedicated Approach	Multiplexed Approach	Reduction
Physical infrastructure	$500M	$50M	90%
Network equipment	$200M	$30M	85%
Power/cooling (annual)	$10M	$2M	80%
Maintenance (annual)	$5M	$1M	80%
Total 5-year TCO	$775M	$95M	88%

Pricing Implications

Multiplexing efficiency enables consumption-based and shared pricing models:

Committed Information Rate (CIR): Customers purchase guaranteed capacity less than their peak access rate, paying premium only for firm guarantees.

Burst Pricing: Customers pay base rate for normal usage, premium for occasional bursts—matching cost to actual resource consumption.

Best-Effort/Oversubscription: Residential Internet is highly oversubscribed (often 20-50:1 on aggregation links) because most users are inactive most of the time.

Cloud Computing Economics

Cloud providers apply multiplexing principles to compute, storage, and networking:

Aggregate virtual machines from thousands of customers on shared servers
Most VMs are idle most of the time
Oversubscription ratios of 10-20:1 are common
Result: Cloud costs 10-20× less than dedicated infrastructure for typical workloads

The Oversubscription Trap

Efficiency vs. Quality of Service

Higher efficiency comes at the cost of statistical guarantees. Understanding this tradeoff is essential for system design that balances cost against user experience.

The Fundamental Tradeoff

Shared resources achieve efficiency by pooling unused capacity from idle users. But this means:

No absolute guarantees: If everyone transmits simultaneously, not all can succeed
Probabilistic service: We design for "99% of users get good service 99.9% of the time"
Peak degradation: During congestion, service quality drops

Quantifying the Tradeoff

For a given multiplexing gain G and number of users n, there's an associated congestion probability P_c:

Higher G → Higher P_c (more efficiency means more congestion risk)
Larger n → Lower P_c for same G (more users smooth demand)

Service Level Targets

Different applications tolerate different congestion levels:

Emergency services (911): <0.01% blocking (essentially dedicated)
Business voice: <1% blocking
Residential voice: <2% blocking
Business Internet: <5% congestion periods per month
Residential Internet: Best-effort (significant oversubscription acceptable)

Low Oversubscription (Conservative)

•2-4:1 aggregation ratios
•Suitable for business/enterprise
•Strong SLA guarantees possible
•Higher cost per user
•Handles demand spikes gracefully
•Premium service tier

High Oversubscription (Aggressive)

•20-50:1 aggregation ratios
•Suitable for residential/budget
•Best-effort service only
•Lowest cost per user
•Congestion during peaks
•Economy service tier

Quality of Service Mechanisms

To serve multiple service levels on shared infrastructure, QoS mechanisms provide differentiated treatment:

Priority Queuing: Critical traffic (voice, video) gets served before bulk traffic (downloads, backups). During congestion, low-priority traffic waits while high-priority traffic proceeds.

Weighted Fair Queuing: Each class gets proportional share of capacity. Business traffic might get 70% weight, residential 30%—so business users experience less congestion.

Rate Limiting: Each user's traffic is constrained to purchased capacity despite sharing the aggregate link. Prevents any user from monopolizing shared resources.

Admission Control: For circuit-like services (voice), new requests are rejected if accepting them would overload the network. Protects existing sessions at the cost of blocking new ones.

The 80/20 Rule in Networking

Real-World Efficiency Examples

Let's examine how these efficiency principles apply in actual systems.

Example 1: Residential Internet Service

A cable ISP serving 100,000 homes:

Access capacity: 1 Gbps per household (DOCSIS 3.1)
Aggregate purchased capacity: 100,000 × 1 Gbps = 100 Tbps if dedicated
Actual backbone capacity: 400 Gbps
Oversubscription ratio: 250:1

This works because:

Average household uses ~15 Mbps sustained (1.5% of peak)
Peak concurrent usage is ~20% of households at prime time
Statistical multiplexing smooths aggregate demand
Capacity: 400 Gbps vs. actual demand: ~300 Gbps peak = adequate

Result: Infrastructure costs 1/250th of dedicated approach, sufficient for 99%+ of situations.

Example 2: Cellular Network

A 5G cell sector serving 1,000 active devices:

Sector capacity: 1 Gbps shared (typical mmWave)
Per-device peak capability: 100 Mbps
If dedicated: 1,000 × 100 Mbps = 100 Gbps needed
Actual: 1 Gbps provides ~1 Mbps average per device

Multiplexing gain: 100× This enables mobile broadband at consumer prices despite spectrum scarcity.

Example 3: Cloud Data Center

A cloud provider with 10,000 VMs on 1,000 servers:

VM-to-server ratio: 10:1
Each VM has virtual 1 Gbps NIC
Physical server NIC: 100 Gbps
NIC oversubscription: Only 10:1 (physical supports virtual)
ToR-to-spine oversubscription: Additional 4:1

Real-World Multiplexing Efficiency Summary
System	Peak-to-Actual Ratio	Achieved Utilization	Cost Reduction
Residential Cable ISP	250:1	~75%	99.6%
Enterprise WAN	10:1	~60%	90%
5G Cellular Sector	100:1	~50%	99%
Cloud Data Center Network	40:1	~40%	97.5%
Submarine Cable System	5:1	~85%	80%

Why Submarine Cables Are Different

Summary: Quantifying Multiplexing Efficiency

We've developed the quantitative framework for understanding multiplexing efficiency. Let's consolidate the key insights:

Key Takeaways

•Utilization is the key metric — Shared channels achieve 60-80% utilization vs. 1-35% for dedicated channels
•Statistical multiplexing gain grows with √n — More sources means more predictable aggregate demand and higher efficiency
•Erlang models enable precise dimensioning — Telephony traffic can be exactly calculated for desired blocking probability
•Packet switching extends gain to bursty traffic — Data applications with <5% activity see 20-50× efficiency gains
•Cost savings are dramatic — 80-99% infrastructure cost reduction is common with appropriate oversubscription
•Efficiency trades against guarantees — Higher efficiency means higher congestion probability during peaks
•QoS enables tiered service — Priority and rate limiting allow differentiated service levels on shared infrastructure

What's Next:

Page Complete