Computer NetworksTraffic Shaping

Traffic Shaping

LevelAdvanced

Duration75 mins

TopicTraffic Shaping

1 / 5

Traffic Shaping Concept

The Art of Controlling Data Flow

Imagine a highway during rush hour. Without traffic signals, lane mergers become chaos—aggressive drivers cut ahead while others wait indefinitely. Some reach their destination quickly while others are stuck for hours. Now imagine the same highway with intelligent traffic management: vehicles enter at controlled rates, high-priority emergency vehicles get dedicated lanes, and everyone's journey becomes predictable.

Traffic shaping in computer networks works on the same principle. It's the sophisticated art of controlling when, how fast, and in what order data packets traverse a network. Without traffic shaping, bursty applications can monopolize bandwidth, latency becomes unpredictable, and network fairness collapses. With proper traffic shaping, networks become predictable, fair, and capable of delivering differentiated service quality.

What You Will Learn

By the end of this page, you will understand what traffic shaping is, why it's essential for modern networks, how it differs from related concepts like traffic policing, and the fundamental principles that underpin all traffic shaping algorithms. You will gain the conceptual foundation necessary to understand specific algorithms like leaky bucket and token bucket covered in subsequent pages.

Defining Traffic Shaping

Traffic shaping (also known as packet shaping or bandwidth shaping) is a network traffic management technique that deliberately delays some or all packets to bring them into compliance with a desired traffic profile or contract. It's a proactive mechanism that smooths out bursty traffic into a more predictable, steady stream.

Formally, traffic shaping is defined as:

A bandwidth management technique that delays some packets to meet a desired traffic profile, ensuring outgoing traffic conforms to a traffic contract and smoothing bursty flows into more regular patterns.

The key insight is that traffic shaping doesn't discard packets—it delays them. This is fundamentally different from traffic policing, which drops or marks excess traffic. Traffic shaping works by buffering packets and releasing them at a controlled rate.

Traffic Shaping Core Characteristics
Characteristic	Description	Implication
Delay-Based	Packets are queued and released at controlled times	Zero packet loss from shaping itself (though buffer overflow is possible)
Proactive	Operates before congestion occurs	Prevents problems rather than reacting to them
Sender-Side	Typically applied at the traffic source or ingress point	Source controls its own traffic behavior
Contract Compliance	Ensures traffic meets agreed parameters (rate, burst)	Enables Service Level Agreements (SLAs)
Smoothing	Converts bursty traffic to steady streams	Improves network predictability and fairness

The Shaping Mindset

Think of traffic shaping as a 'pace car' for network traffic. Just as a pace car controls the speed of vehicles entering a race, traffic shaping controls the rate at which packets enter the network. The goal isn't to slow things down—it's to ensure everyone can proceed smoothly and predictably.

Why Traffic Shaping Exists

Traffic shaping emerged as a solution to several fundamental problems in computer networks. Understanding these problems is essential to appreciating the elegance and necessity of traffic shaping mechanisms.

The Bursty Traffic Problem:

Real network traffic is inherently bursty—it comes in irregular, unpredictable bursts rather than smooth, constant flows. Consider these examples:

A web browser requests a page: nothing happens for seconds, then suddenly 50 HTTP requests fire simultaneously
A video encoder completes a frame: large chunks of data burst out at irregular intervals
A backup job starts: gigabytes of data suddenly flood the network
A user clicks 'refresh': multiple API calls trigger simultaneously

This burstiness creates instantaneous overload even when average utilization is low. A 100 Mbps link might see bursts of 500 Mbps lasting milliseconds—far exceeding capacity.

Problems Caused by Uncontrolled Bursts

•Buffer Overflow — Routers and switches have finite buffer space. Sudden bursts can overflow these buffers, causing packet loss even when average load is sustainable.
•Congestion Collapse — When multiple sources burst simultaneously, queues fill everywhere, retransmissions multiply, and useful throughput can approach zero.
•Latency Spikes — Bursts create queue buildup, causing dramatic latency increases that devastate interactive applications like voice, video, and gaming.
•Unfairness — Bursty sources can 'grab' more than their fair share of bandwidth, starving well-behaved flows.
•Unpredictability — Without shaping, network behavior becomes chaotic and impossible to guarantee for critical applications.

The Network Economics Problem:

Networks are shared resources with finite capacity. Service providers must:

Guarantee capacity to customers who pay for it
Prevent abuse from customers who try to exceed their allocation
Enable tiered service where higher-paying customers get better performance
Ensure fairness so no single user degrades everyone's experience

Traffic shaping is the enforcement mechanism that makes these guarantees possible. Without it, the customer paying for 10 Mbps can burst to 100 Mbps, degrading service for everyone else.

The Trust Problem

In a shared network, you cannot trust all endpoints to be well-behaved. A single misbehaving source—intentional or accidental—can degrade the entire network. Traffic shaping provides network operators with a 'trust but verify' mechanism that enforces contracted behavior regardless of endpoint behavior.

Traffic Shaping vs Traffic Policing

Two closely related but fundamentally different mechanisms exist for enforcing traffic contracts: shaping and policing. Understanding the distinction is critical for network design and troubleshooting.

Traffic Shaping:

Delays packets by buffering them
Releases packets at a controlled rate
Introduces latency but preserves packets
Typically used at the sender/source
'Gentle' enforcement—cooperates with transport protocols

Traffic Policing:

Drops or marks excess packets immediately
No buffering or delay
Preserves latency but causes packet loss
Typically used at network boundaries (ingress)
'Strict' enforcement—can trigger TCP congestion response

Traffic Shaping

•Mechanism: Buffer and delay packets
•Packet fate: Preserved (unless buffer overflows)
•Latency impact: Increased (buffering delay)
•Packet loss: Minimal (only on buffer overflow)
•Typical location: Source/egress point
•TCP interaction: Cooperates with TCP flow control
•Use case: WAN edge, enterprise egress

Traffic Policing

•Mechanism: Drop or mark excess packets
•Packet fate: Dropped or marked
•Latency impact: None (no buffering)
•Packet loss: High during overages
•Typical location: Provider ingress point
•TCP interaction: Triggers congestion response
•Use case: Provider network ingress, DDoS protection

When to Use Each:

Use Traffic Shaping when:

You control the sender and want to smooth outgoing traffic
Preserving packets is more important than minimizing latency
You want to be 'friendly' to TCP and avoid triggering retransmissions
You need to meter traffic before it enters a constrained link

Use Traffic Policing when:

You're at a trust boundary (e.g., provider ingress)
You need strict enforcement with no buffering resources
Excess traffic must be stopped immediately
You want to signal congestion back to the sender via packet loss

Policing Can Hurt TCP Performance

When a policer drops packets from a TCP flow, TCP interprets this as congestion and reduces its sending rate—potentially far below what the policer would allow. This creates a paradox where the user gets significantly less throughput than their contracted rate. Traffic shaping avoids this problem by never triggering TCP's congestion response.

Traffic Contracts and Parameters

Traffic shaping operates according to a traffic contract—a set of parameters that define the allowable traffic profile. Understanding these parameters is essential for configuring and analyzing traffic shapers.

Core Traffic Parameters:

Every traffic contract specifies some combination of the following parameters:

Traffic Contract Parameters
Parameter	Symbol	Definition	Typical Units
Committed Information Rate (CIR)	r	The guaranteed average rate of traffic	bits per second (bps)
Peak Information Rate (PIR)	P	The maximum instantaneous rate allowed	bits per second (bps)
Committed Burst Size (CBS)	Bc	Maximum burst size at CIR	bits or bytes
Excess Burst Size (EBS)	Be	Additional burst allowed above CIR	bits or bytes
Time Interval (Tc)	Tc	Measurement interval for rate calculation	milliseconds or seconds

Understanding CIR and Burst:

The relationship between rate and burst is subtle but critical. Consider a 10 Mbps CIR with a 100 KB burst:

Over 1 second, you can send exactly 10 Mb (10 Mbps × 1s)
But you don't have to send at a constant 10 Mbps
You can burst: send 100 KB instantly, then wait while 'credits' accumulate
The burst size defines how much you can send immediately without waiting

The Bucket Intuition:

Imagine a bucket that fills with 'tokens' at the CIR rate:

Each token represents permission to send one bit (or byte)
The bucket size equals the burst size (CBS)
To send data, you consume tokens from the bucket
If the bucket is empty, you must wait for tokens to accumulate
If the bucket is full, excess tokens are lost (the bucket doesn't overflow)

Burst Size Determines Responsiveness

A larger burst size allows more traffic to be sent immediately, improving responsiveness for bursty applications. However, it also allows larger queue buildups downstream. The art of configuration is balancing burst size against downstream buffer capacity and latency requirements.

Real-World Contract Example:

A typical enterprise WAN circuit might have:

CIR: 100 Mbps (guaranteed average rate)
PIR: 150 Mbps (peak allowed during bursts)
CBS: 1 MB (committed burst size)
EBS: 500 KB (excess burst, marked as lower priority)

This contract says:

You're guaranteed 100 Mbps average throughput
You can burst up to 150 Mbps if the network has capacity
You can send 1 MB immediately without delay
After exhausting CBS, you can send another 500 KB (marked for potential drop)
Traffic beyond this is shaped (delayed) or policed (dropped)

The Traffic Shaping Buffer

The shaping buffer is a critical component of any traffic shaper. It's where packets wait when they arrive faster than the shaper's configured output rate. Understanding buffer dynamics is essential for both configuration and troubleshooting.

Buffer Mechanics:

Converting Mermaid diagram...

When traffic arrives faster than the configured rate:

Packets enter the shaping buffer (queue)
The shaper releases packets at the configured rate
Buffer fills during burst periods
Buffer drains during quiet periods

Buffer Sizing Considerations:

Buffer size involves critical tradeoffs:

Buffer Size Tradeoffs
Buffer Size	Advantages	Disadvantages
Too Small	Low latency, fast feedback to sender	High packet loss during bursts, poor throughput
Too Large	No packet loss, smooth throughput	High latency (bufferbloat), poor interactive performance
Optimal	Balanced latency and throughput	Requires careful tuning to traffic patterns

The Bufferbloat Problem:

Excessively large buffers create bufferbloat—a pathological condition where packets experience enormous delays in overloaded buffers. This is particularly harmful because:

TCP doesn't receive loss signals — Packets are queued, not dropped, so TCP doesn't reduce its rate
Latency balloons — A 1 MB buffer at 10 Mbps introduces 800ms of latency
Interactive applications suffer — VoIP, video conferencing, and gaming become unusable
Retransmissions arrive too late — By the time a retransmission arrives, the original might have made it through

Buffer Sizing Rules of Thumb:

For traditional TCP traffic:

Buffer Size = Bandwidth × RTT

For a 100 Mbps link with 20ms RTT:

Buffer = 100 Mbps × 0.020s = 2 Mb = 250 KB

Modern recommendations (with many flows) suggest even smaller buffers:

Buffer Size = (Bandwidth × RTT) / √n

where n is the number of concurrent flows.

Active Queue Management

Modern traffic shapers often implement Active Queue Management (AQM) algorithms like CoDel or PIE that proactively drop packets before the buffer fills. This provides early congestion signals to TCP while maintaining low latency. AQM is particularly important for combating bufferbloat in shaping buffers.

Where Traffic Shaping Occurs

Traffic shaping can be implemented at various points in the network, each with different characteristics and use cases.

End-Host Shaping:

Traffic shaping at the source (end host) is the most cooperative form:

Location: Application, OS network stack, or NIC
Advantages: Controls traffic before it enters the network; no downstream impact
Examples: Application-level rate limiting, tc (traffic control) on Linux, Windows QoS
Use case: Cloud egress throttling, application bandwidth management

Edge Shaping:

Traffic shaping at the network edge (first-hop router/switch):

Location: Access switches, home routers, enterprise WAN routers
Advantages: Shapes traffic before bottleneck links; customer-controlled
Examples: Router QoS policies, SD-WAN shapers
Use case: WAN optimization, ISP access rate enforcement

Common Traffic Shaping Deployment Points

•Application Layer — Rate limiting in web servers (nginx, HAProxy), API gateways, microservices
•Operating System — Linux traffic control (tc), Windows QoS Policy, macOS pf
•Network Interface — Smart NICs with hardware traffic shaping for line-rate performance
•Customer Premises Equipment — Home routers, enterprise WAN routers, SD-WAN appliances
•Provider Edge — ISP access routers shaping customer traffic to contracted rates
•Data Center — Hypervisor virtual switches, container networking, cloud provider gateways

Ingress vs Egress Shaping:

Egress (Outbound) Shaping:

Most common form of traffic shaping
Shapes traffic leaving an interface
Full control over transmission timing
Can implement sophisticated queuing disciplines

Ingress (Inbound) Shaping:

Less common and more limited
Can only delay/drop already-received packets
Limited queuing options (packets already in memory)
Often policing is preferred for inbound traffic

Shape Where You Control

The general principle is to shape traffic as close to the source as possible. Once traffic enters the network, you cannot 'un-send' it. Upstream congestion cannot be fixed by downstream shaping—the damage (packet drops, queuing delays) has already occurred.

Classification and Marking

Traffic shaping rarely applies uniformly to all traffic. Instead, traffic is classified into different categories, each with its own shaping policy. This classification is fundamental to Quality of Service (QoS) implementations.

Classification Methods:

Traffic Classification Techniques
Classification Method	Layer	Description	Example
Layer 2 (Data Link)	L2	VLAN tag, MAC address, CoS (802.1p)	Shape all VLAN 100 traffic to 50 Mbps
Layer 3 (Network)	L3	IP address, DSCP/ToS, protocol	Shape all traffic to 10.0.0.0/8 to 10 Mbps
Layer 4 (Transport)	L4	Port numbers, TCP/UDP	Shape port 80/443 traffic separately
Layer 7 (Application)	L7	Application identification via DPI	Shape YouTube to 5 Mbps per user
Flow-Based	Multi	5-tuple (src/dst IP, src/dst port, protocol)	Per-flow fair queuing

DSCP (Differentiated Services Code Point):

DSCP is the primary marking mechanism in modern IP networks. It uses 6 bits of the IP header's ToS (Type of Service) byte to indicate traffic class:

EF (Expedited Forwarding, DSCP 46): Premium, low-latency traffic (VoIP)
AF (Assured Forwarding, DSCP 10-38): Four classes with three drop precedences each
CS (Class Selector, DSCP 0-56): Backward compatible with old IP precedence
BE (Best Effort, DSCP 0): Default, no special treatment

Typical Class-Based Shaping Policy:

Class: Voice (EF, DSCP 46)
  - Priority: Strict priority queuing
  - Max Rate: 5 Mbps (prevents starvation of other classes)
  - Burst: Low (latency-sensitive)

Class: Video (AF41, DSCP 34)
  - Priority: Second priority
  - Min Rate: 20 Mbps (guaranteed)
  - Max Rate: 50 Mbps (ceiling)
  - Burst: Medium

Class: Business Data (AF21, DSCP 18)
  - Priority: Normal
  - Min Rate: 30 Mbps (guaranteed)
  - Burst: Large (tolerant of delay)

Class: Best Effort (DSCP 0)
  - Priority: Lowest
  - Rate: Whatever remains

Trust but Verify

In practice, traffic markings from untrusted sources (e.g., internet-facing interfaces) are typically reset to a default value at trust boundaries. Trusting external DSCP markings would allow attackers to gain priority by simply marking their traffic as high-priority.

Real-World Traffic Shaping Applications

Traffic shaping is ubiquitous in modern networking, appearing in contexts from home routers to global cloud infrastructure. Understanding these applications helps cement the conceptual understanding.

ISP Customer Rate Limiting:

Your home internet connection's speed is enforced by traffic shaping:

ISP shapes your traffic to contracted rate (e.g., 100 Mbps)
Prevents you from exceeding your plan's speed
Ensures fair bandwidth allocation across customers
Often implemented at the provider edge router

Traffic Shaping in Modern Infrastructure

•Cloud Provider Egress — AWS, GCP, Azure shape outbound traffic from VMs to prevent network saturation and ensure fair multi-tenant resource allocation.
•SD-WAN — Software-Defined WAN appliances shape traffic across multiple WAN links, prioritizing critical applications and ensuring SLA compliance.
•API Rate Limiting — API gateways shape request rates to prevent abuse, ensure fair usage, and protect backend services from overload.
•CDN Edge Servers — Content delivery networks shape egress to balance bandwidth costs, client experience, and origin server load.
•Video Streaming — Streaming services shape bitrate delivery to match client bandwidth, prevent buffering, and optimize for cost.
•Enterprise WAN — Enterprises shape WAN traffic to prioritize VoIP and video over bulk data transfers across limited WAN links.
•Gaming Networks — Game servers shape update distribution across millions of clients to prevent network congestion during major releases.
•Container Orchestration — Kubernetes network plugins shape inter-pod traffic to enforce bandwidth limits and prevent noisy neighbors.

Case Study: Video Streaming Traffic Shaping

Netflix and similar services use sophisticated traffic shaping:

Server-side shaping: Servers shape egress to match client's measured bandwidth
Adaptive bitrate: Video quality adjusts based on available bandwidth
Buffer management: Pre-buffering during low-activity periods
Multi-CDN shaping: Distribution shaped across multiple CDN providers for cost and performance optimization

This shaping is why Netflix starts with lower quality and improves—it's probing available bandwidth and shaping delivery accordingly.

Shaping is Everywhere

Almost every network service you use involves traffic shaping somewhere. The fact that you don't notice it is a testament to its effectiveness—good traffic shaping is invisible, creating the illusion of abundant, fair, predictable network resources.

Summary: Traffic Shaping Fundamentals

We've established the foundational understanding of traffic shaping—the critical network mechanism that transforms chaotic, bursty traffic into predictable, manageable flows.

Key Takeaways

•Traffic shaping delays packets to conform traffic to a desired profile—it preserves packets rather than dropping them.
•Bursty traffic causes congestion problems including buffer overflow, latency spikes, and unfairness that traffic shaping addresses.
•Shaping differs from policing — shaping delays while policing drops. Shaping is 'friendly' to TCP; policing can hurt performance.
•Traffic contracts define parameters — CIR, PIR, burst sizes, and time intervals specify the allowed traffic profile.
•Buffer sizing is critical — too small causes loss, too large causes bufferbloat. Optimal sizing depends on bandwidth-delay product.
•Classification enables differentiated shaping — different traffic classes can receive different treatment based on priority.
•Traffic shaping is ubiquitous — from ISP rate limiting to cloud egress to API gateways, shaping appears throughout modern infrastructure.

What's Next:

Now that we understand the concept and purpose of traffic shaping, we'll dive into the specific algorithms that implement it. The next page explores the Leaky Bucket Algorithm—one of the two fundamental traffic shaping mechanisms that provides a beautifully simple model for enforcing constant output rates.

Page Complete

You now understand what traffic shaping is, why it exists, and how it fits into the broader network QoS landscape. With this foundation, you're ready to explore the specific algorithms—leaky bucket and token bucket—that make traffic shaping a reality.

1 / 5

Loading learning content...

Computer NetworksTraffic Shaping

Traffic Shaping

LevelAdvanced

Duration75 mins

TopicTraffic Shaping

1 / 5

Traffic Shaping Concept

The Art of Controlling Data Flow

What You Will Learn

Defining Traffic Shaping

Formally, traffic shaping is defined as:

A bandwidth management technique that delays some packets to meet a desired traffic profile, ensuring outgoing traffic conforms to a traffic contract and smoothing bursty flows into more regular patterns.

Traffic Shaping Core Characteristics
Characteristic	Description	Implication
Delay-Based	Packets are queued and released at controlled times	Zero packet loss from shaping itself (though buffer overflow is possible)
Proactive	Operates before congestion occurs	Prevents problems rather than reacting to them
Sender-Side	Typically applied at the traffic source or ingress point	Source controls its own traffic behavior
Contract Compliance	Ensures traffic meets agreed parameters (rate, burst)	Enables Service Level Agreements (SLAs)
Smoothing	Converts bursty traffic to steady streams	Improves network predictability and fairness

The Shaping Mindset

Why Traffic Shaping Exists

The Bursty Traffic Problem:

Real network traffic is inherently bursty—it comes in irregular, unpredictable bursts rather than smooth, constant flows. Consider these examples:

A web browser requests a page: nothing happens for seconds, then suddenly 50 HTTP requests fire simultaneously
A video encoder completes a frame: large chunks of data burst out at irregular intervals
A backup job starts: gigabytes of data suddenly flood the network
A user clicks 'refresh': multiple API calls trigger simultaneously

This burstiness creates instantaneous overload even when average utilization is low. A 100 Mbps link might see bursts of 500 Mbps lasting milliseconds—far exceeding capacity.

Problems Caused by Uncontrolled Bursts

•Buffer Overflow — Routers and switches have finite buffer space. Sudden bursts can overflow these buffers, causing packet loss even when average load is sustainable.
•Congestion Collapse — When multiple sources burst simultaneously, queues fill everywhere, retransmissions multiply, and useful throughput can approach zero.
•Latency Spikes — Bursts create queue buildup, causing dramatic latency increases that devastate interactive applications like voice, video, and gaming.
•Unfairness — Bursty sources can 'grab' more than their fair share of bandwidth, starving well-behaved flows.
•Unpredictability — Without shaping, network behavior becomes chaotic and impossible to guarantee for critical applications.

The Network Economics Problem:

Networks are shared resources with finite capacity. Service providers must:

Guarantee capacity to customers who pay for it
Prevent abuse from customers who try to exceed their allocation
Enable tiered service where higher-paying customers get better performance
Ensure fairness so no single user degrades everyone's experience

Traffic shaping is the enforcement mechanism that makes these guarantees possible. Without it, the customer paying for 10 Mbps can burst to 100 Mbps, degrading service for everyone else.

The Trust Problem

Traffic Shaping vs Traffic Policing

Traffic Shaping:

Delays packets by buffering them
Releases packets at a controlled rate
Introduces latency but preserves packets
Typically used at the sender/source
'Gentle' enforcement—cooperates with transport protocols

Traffic Policing:

Drops or marks excess packets immediately
No buffering or delay
Preserves latency but causes packet loss
Typically used at network boundaries (ingress)
'Strict' enforcement—can trigger TCP congestion response

Traffic Shaping

•Mechanism: Buffer and delay packets
•Packet fate: Preserved (unless buffer overflows)
•Latency impact: Increased (buffering delay)
•Packet loss: Minimal (only on buffer overflow)
•Typical location: Source/egress point
•TCP interaction: Cooperates with TCP flow control
•Use case: WAN edge, enterprise egress

Traffic Policing

•Mechanism: Drop or mark excess packets
•Packet fate: Dropped or marked
•Latency impact: None (no buffering)
•Packet loss: High during overages
•Typical location: Provider ingress point
•TCP interaction: Triggers congestion response
•Use case: Provider network ingress, DDoS protection

When to Use Each:

Use Traffic Shaping when:

You control the sender and want to smooth outgoing traffic
Preserving packets is more important than minimizing latency
You want to be 'friendly' to TCP and avoid triggering retransmissions
You need to meter traffic before it enters a constrained link

Use Traffic Policing when:

You're at a trust boundary (e.g., provider ingress)
You need strict enforcement with no buffering resources
Excess traffic must be stopped immediately
You want to signal congestion back to the sender via packet loss

Policing Can Hurt TCP Performance

Traffic Contracts and Parameters

Core Traffic Parameters:

Every traffic contract specifies some combination of the following parameters:

Traffic Contract Parameters
Parameter	Symbol	Definition	Typical Units
Committed Information Rate (CIR)	r	The guaranteed average rate of traffic	bits per second (bps)
Peak Information Rate (PIR)	P	The maximum instantaneous rate allowed	bits per second (bps)
Committed Burst Size (CBS)	Bc	Maximum burst size at CIR	bits or bytes
Excess Burst Size (EBS)	Be	Additional burst allowed above CIR	bits or bytes
Time Interval (Tc)	Tc	Measurement interval for rate calculation	milliseconds or seconds

Understanding CIR and Burst:

The relationship between rate and burst is subtle but critical. Consider a 10 Mbps CIR with a 100 KB burst:

Over 1 second, you can send exactly 10 Mb (10 Mbps × 1s)
But you don't have to send at a constant 10 Mbps
You can burst: send 100 KB instantly, then wait while 'credits' accumulate
The burst size defines how much you can send immediately without waiting

The Bucket Intuition:

Imagine a bucket that fills with 'tokens' at the CIR rate:

Each token represents permission to send one bit (or byte)
The bucket size equals the burst size (CBS)
To send data, you consume tokens from the bucket
If the bucket is empty, you must wait for tokens to accumulate
If the bucket is full, excess tokens are lost (the bucket doesn't overflow)

Burst Size Determines Responsiveness

Real-World Contract Example:

A typical enterprise WAN circuit might have:

CIR: 100 Mbps (guaranteed average rate)
PIR: 150 Mbps (peak allowed during bursts)
CBS: 1 MB (committed burst size)
EBS: 500 KB (excess burst, marked as lower priority)

This contract says:

You're guaranteed 100 Mbps average throughput
You can burst up to 150 Mbps if the network has capacity
You can send 1 MB immediately without delay
After exhausting CBS, you can send another 500 KB (marked for potential drop)
Traffic beyond this is shaped (delayed) or policed (dropped)

The Traffic Shaping Buffer

Buffer Mechanics:

Converting Mermaid diagram...

When traffic arrives faster than the configured rate:

Packets enter the shaping buffer (queue)
The shaper releases packets at the configured rate
Buffer fills during burst periods
Buffer drains during quiet periods

Buffer Sizing Considerations:

Buffer size involves critical tradeoffs:

Buffer Size Tradeoffs
Buffer Size	Advantages	Disadvantages
Too Small	Low latency, fast feedback to sender	High packet loss during bursts, poor throughput
Too Large	No packet loss, smooth throughput	High latency (bufferbloat), poor interactive performance
Optimal	Balanced latency and throughput	Requires careful tuning to traffic patterns

The Bufferbloat Problem:

Excessively large buffers create bufferbloat—a pathological condition where packets experience enormous delays in overloaded buffers. This is particularly harmful because:

TCP doesn't receive loss signals — Packets are queued, not dropped, so TCP doesn't reduce its rate
Latency balloons — A 1 MB buffer at 10 Mbps introduces 800ms of latency
Interactive applications suffer — VoIP, video conferencing, and gaming become unusable
Retransmissions arrive too late — By the time a retransmission arrives, the original might have made it through

Buffer Sizing Rules of Thumb:

For traditional TCP traffic:

Buffer Size = Bandwidth × RTT

For a 100 Mbps link with 20ms RTT:

Buffer = 100 Mbps × 0.020s = 2 Mb = 250 KB

Modern recommendations (with many flows) suggest even smaller buffers:

Buffer Size = (Bandwidth × RTT) / √n

where n is the number of concurrent flows.

Active Queue Management

Where Traffic Shaping Occurs

Traffic shaping can be implemented at various points in the network, each with different characteristics and use cases.

End-Host Shaping:

Traffic shaping at the source (end host) is the most cooperative form:

Location: Application, OS network stack, or NIC
Advantages: Controls traffic before it enters the network; no downstream impact
Examples: Application-level rate limiting, tc (traffic control) on Linux, Windows QoS
Use case: Cloud egress throttling, application bandwidth management

Edge Shaping:

Traffic shaping at the network edge (first-hop router/switch):

Location: Access switches, home routers, enterprise WAN routers
Advantages: Shapes traffic before bottleneck links; customer-controlled
Examples: Router QoS policies, SD-WAN shapers
Use case: WAN optimization, ISP access rate enforcement

Common Traffic Shaping Deployment Points

•Application Layer — Rate limiting in web servers (nginx, HAProxy), API gateways, microservices
•Operating System — Linux traffic control (tc), Windows QoS Policy, macOS pf
•Network Interface — Smart NICs with hardware traffic shaping for line-rate performance
•Customer Premises Equipment — Home routers, enterprise WAN routers, SD-WAN appliances
•Provider Edge — ISP access routers shaping customer traffic to contracted rates
•Data Center — Hypervisor virtual switches, container networking, cloud provider gateways

Ingress vs Egress Shaping:

Egress (Outbound) Shaping:

Most common form of traffic shaping
Shapes traffic leaving an interface
Full control over transmission timing
Can implement sophisticated queuing disciplines

Ingress (Inbound) Shaping:

Less common and more limited
Can only delay/drop already-received packets
Limited queuing options (packets already in memory)
Often policing is preferred for inbound traffic

Shape Where You Control

Classification and Marking

Classification Methods:

Traffic Classification Techniques
Classification Method	Layer	Description	Example
Layer 2 (Data Link)	L2	VLAN tag, MAC address, CoS (802.1p)	Shape all VLAN 100 traffic to 50 Mbps
Layer 3 (Network)	L3	IP address, DSCP/ToS, protocol	Shape all traffic to 10.0.0.0/8 to 10 Mbps
Layer 4 (Transport)	L4	Port numbers, TCP/UDP	Shape port 80/443 traffic separately
Layer 7 (Application)	L7	Application identification via DPI	Shape YouTube to 5 Mbps per user
Flow-Based	Multi	5-tuple (src/dst IP, src/dst port, protocol)	Per-flow fair queuing

DSCP (Differentiated Services Code Point):

DSCP is the primary marking mechanism in modern IP networks. It uses 6 bits of the IP header's ToS (Type of Service) byte to indicate traffic class:

EF (Expedited Forwarding, DSCP 46): Premium, low-latency traffic (VoIP)
AF (Assured Forwarding, DSCP 10-38): Four classes with three drop precedences each
CS (Class Selector, DSCP 0-56): Backward compatible with old IP precedence
BE (Best Effort, DSCP 0): Default, no special treatment

Typical Class-Based Shaping Policy:

Class: Voice (EF, DSCP 46)
  - Priority: Strict priority queuing
  - Max Rate: 5 Mbps (prevents starvation of other classes)
  - Burst: Low (latency-sensitive)

Class: Video (AF41, DSCP 34)
  - Priority: Second priority
  - Min Rate: 20 Mbps (guaranteed)
  - Max Rate: 50 Mbps (ceiling)
  - Burst: Medium

Class: Business Data (AF21, DSCP 18)
  - Priority: Normal
  - Min Rate: 30 Mbps (guaranteed)
  - Burst: Large (tolerant of delay)

Class: Best Effort (DSCP 0)
  - Priority: Lowest
  - Rate: Whatever remains

Trust but Verify

Real-World Traffic Shaping Applications

Traffic shaping is ubiquitous in modern networking, appearing in contexts from home routers to global cloud infrastructure. Understanding these applications helps cement the conceptual understanding.

ISP Customer Rate Limiting:

Your home internet connection's speed is enforced by traffic shaping:

ISP shapes your traffic to contracted rate (e.g., 100 Mbps)
Prevents you from exceeding your plan's speed
Ensures fair bandwidth allocation across customers
Often implemented at the provider edge router

Traffic Shaping in Modern Infrastructure

•Cloud Provider Egress — AWS, GCP, Azure shape outbound traffic from VMs to prevent network saturation and ensure fair multi-tenant resource allocation.
•SD-WAN — Software-Defined WAN appliances shape traffic across multiple WAN links, prioritizing critical applications and ensuring SLA compliance.
•API Rate Limiting — API gateways shape request rates to prevent abuse, ensure fair usage, and protect backend services from overload.
•CDN Edge Servers — Content delivery networks shape egress to balance bandwidth costs, client experience, and origin server load.
•Video Streaming — Streaming services shape bitrate delivery to match client bandwidth, prevent buffering, and optimize for cost.
•Enterprise WAN — Enterprises shape WAN traffic to prioritize VoIP and video over bulk data transfers across limited WAN links.
•Gaming Networks — Game servers shape update distribution across millions of clients to prevent network congestion during major releases.
•Container Orchestration — Kubernetes network plugins shape inter-pod traffic to enforce bandwidth limits and prevent noisy neighbors.

Case Study: Video Streaming Traffic Shaping

Netflix and similar services use sophisticated traffic shaping:

Server-side shaping: Servers shape egress to match client's measured bandwidth
Adaptive bitrate: Video quality adjusts based on available bandwidth
Buffer management: Pre-buffering during low-activity periods
Multi-CDN shaping: Distribution shaped across multiple CDN providers for cost and performance optimization

This shaping is why Netflix starts with lower quality and improves—it's probing available bandwidth and shaping delivery accordingly.

Shaping is Everywhere

Summary: Traffic Shaping Fundamentals

We've established the foundational understanding of traffic shaping—the critical network mechanism that transforms chaotic, bursty traffic into predictable, manageable flows.

Key Takeaways

•Traffic shaping delays packets to conform traffic to a desired profile—it preserves packets rather than dropping them.
•Bursty traffic causes congestion problems including buffer overflow, latency spikes, and unfairness that traffic shaping addresses.
•Shaping differs from policing — shaping delays while policing drops. Shaping is 'friendly' to TCP; policing can hurt performance.
•Traffic contracts define parameters — CIR, PIR, burst sizes, and time intervals specify the allowed traffic profile.
•Buffer sizing is critical — too small causes loss, too large causes bufferbloat. Optimal sizing depends on bandwidth-delay product.
•Classification enables differentiated shaping — different traffic classes can receive different treatment based on priority.
•Traffic shaping is ubiquitous — from ISP rate limiting to cloud egress to API gateways, shaping appears throughout modern infrastructure.

What's Next:

Page Complete

1 / 5