Computer NetworksNetwork Layer Services

Network Layer Services

LevelIntermediate

Duration75 mins

TopicNetwork Layer Services

4 / 5

Quality of Service

When Best-Effort Isn't Good Enough

A surgeon is performing a remote-assisted operation. The robotic instruments respond to her movements in real-time—a 200ms delay could be dangerous, packet loss could cause erratic movements, and jitter could make precise control impossible. Meanwhile, someone on the same network is downloading a large file.\n\nShould these two applications receive the same treatment from the network? Best-effort says yes—all traffic competes equally. But clearly, life-critical surgery should take precedence over a file download.\n\nThis is the domain of Quality of Service (QoS): mechanisms that differentiate traffic, prioritize critical applications, reserve resources, and provide performance guarantees beyond best-effort. QoS doesn't eliminate the challenges of shared networks; it provides tools to manage them intelligently.

What You Will Learn

By the end of this page, you will understand QoS comprehensively—traffic characteristics and requirements, classification and marking mechanisms, queuing disciplines and scheduling algorithms, IntServ and DiffServ architectures, and practical considerations for implementing QoS in real networks.

What is Quality of Service?

Quality of Service (QoS) refers to the ability of a network to provide differentiated performance levels to various traffic types. Rather than treating all traffic equally (best-effort), QoS-enabled networks can prioritize, reserve resources for, or guarantee performance to specific traffic flows.\n\nThe Core Problem:\n\nNetwork resources—bandwidth, buffer space, processor cycles—are finite. When demand exceeds capacity, something must give. Without QoS, all traffic suffers equally during congestion. With QoS, less important traffic can suffer while critical traffic maintains acceptable performance.\n\nQoS Performance Metrics:

Key QoS Parameters

•Bandwidth — Data rate available to a flow (bits per second). Determines how fast data can flow.
•Latency (Delay) — Time for a packet to travel from source to destination. Key for interactive applications.
•Jitter (Delay Variation) — Variation in latency between packets. Critical for real-time media.
•Packet Loss — Percentage of packets that don't arrive. Affects all applications differently.
•Availability — Percentage of time the service is accessible. Matters for critical infrastructure.

Application QoS Requirements
Application	Bandwidth Needs	Latency Tolerance	Jitter Tolerance	Loss Tolerance
VoIP	Low (64-128 kbps)	Very Low (<150ms)	Very Low (<30ms)	Low (1-3%)
Video Conferencing	Medium (1-6 Mbps)	Low (<200ms)	Low (<50ms)	Low (1%)
Live Streaming	Medium (3-20 Mbps)	Medium (<1s)	Medium	Very Low
Web Browsing	Varies	Medium	Flexible	Very Low
File Transfer	High (maximize)	Flexible	Not Critical	Zero
Email	Low	Flexible	Not Critical	Zero
Telemedicine	Medium-High	Very Low (<50ms)	Very Low	Very Low

QoS is Not About Adding Bandwidth

QoS doesn't create bandwidth—it manages scarcity. If a link has 100 Mbps capacity and 200 Mbps of traffic, QoS determines which traffic gets through smoothly and which suffers. The only way to serve all 200 Mbps is to add capacity. QoS is a management tool, not a magic solution.

Traffic Classification and Marking

Before traffic can be treated differently, it must be identified. Classification identifies traffic types, and marking tags packets for consistent treatment throughout the network.\n\nClassification Methods:

How Traffic is Classified

•Port Numbers — Well-known ports indicate applications (TCP 80 = HTTP, UDP 5060 = SIP). Simple but circumventable.
•IP Addresses — Source/destination addresses identify endpoints. Works for known servers.
•Protocol Number — IP protocol field (6 = TCP, 17 = UDP, 1 = ICMP). Coarse classification.
•Deep Packet Inspection (DPI) — Examine payload to identify applications. Accurate but computationally expensive.
•DSCP/TOS Values — Trust existing packet markings (may be set by endpoints or upstream devices).
•Access Control Lists (ACLs) — Complex rules matching multiple header fields.

Packet Marking:\n\nOnce classified, packets are marked so subsequent network devices know how to treat them without re-classification.\n\nIPv4 Type of Service (TOS) / Differentiated Services Field:

dscp_field.txt
IPv4 Header - Differentiated Services Field (8 bits)
Previously called Type of Service (TOS)
 
+-------+-------+-------+-------+-------+-------+-------+-------+
|  DS5  |  DS4  |  DS3  |  DS2  |  DS1  |  DS0  |  ECN  |  ECN  |
+-------+-------+-------+-------+-------+-------+-------+-------+
|<------- DSCP (6 bits) ------->|<-- ECN (2 bits) -->|
 
DSCP (Differentiated Services Code Point):
- 6 bits = 64 possible values
- Defines Per-Hop Behavior (PHB) class
- Common values:
  - EF (Expedited Forwarding): 46 (101110) - Premium, low-latency
  - AF11-AF43: Assured Forwarding classes (4 classes × 3 drop precedences)
  - CS0-CS7: Class Selectors (backward compatible with IP Precedence)
  - BE (Best Effort): 0 (000000) - Default
 
ECN (Explicit Congestion Notification):
- 2 bits for congestion signaling
- Allows routers to mark (not drop) congested packets

Trust Boundaries:\n\nA critical QoS concept is the trust boundary—where the network trusts markings vs. where it reclassifies.\n\n- Untrusted Edge: User devices may mark packets incorrectly (intentionally or not). Edge devices should reclassify.\n- Trusted Core: Within an administrative domain, markings are trusted for performance.\n- Peering Points: When traffic crosses administrative boundaries, markings may be remapped or ignored.

The Marking Trust Problem

If endpoints could mark their own traffic as high-priority, everyone would do so, and QoS would be meaningless. Networks must classify at ingress based on authorization (who is this user?) and traffic type (what is this application?), not blindly trust endpoint markings. Most enterprise/ISP networks rewrite DSCP at the edge.

Queuing and Scheduling

Once traffic is classified and marked, routers must decide how to handle packets when congestion occurs. Queuing buffers packets; scheduling decides which to transmit next.\n\nQueuing Disciplines:

Queue Management Techniques

•FIFO (First-In, First-Out) — Single queue, packets served in arrival order. Simple but no differentiation—best-effort.
•Priority Queuing (PQ) — Multiple queues with strict priority. High-priority is always served first. Risk: low-priority starvation.
•Weighted Fair Queuing (WFQ) — Each flow gets a queue. Scheduler interleaves based on weights, ensuring fairness.
•Class-Based WFQ (CBWFQ) — Like WFQ but queues represent classes, not flows. Simpler to configure at scale.
•Low-Latency Queuing (LLQ) — CBWFQ with a strict priority queue for real-time traffic. Priority traffic gets immediate service; others get weighted shares.

Converting Mermaid diagram...

Scheduling Algorithms:\n\nThe scheduler is the policy enforcer, deciding which queued packet to transmit next.

Scheduling Algorithm Comparison
Algorithm	Mechanism	Pros	Cons
Strict Priority	Always serve highest-priority first	Lowest latency for priority traffic	Lower priorities can starve
Round Robin	Serve one from each queue in turn	Fair to all queues	Large packets delay small packets
Weighted Round Robin	Serve proportional to weights	Weighted fairness	Doesn't account for packet sizes
Deficit Round Robin	Credits accumulate; accounts for sizes	Fair, size-aware	More complex to implement
LLQ (Cisco)	Priority + CBWFQ hybrid	Low latency + fairness	Priority queue needs policing

The Starvation Problem

With strict priority, if high-priority traffic saturates the link, low-priority traffic gets zero bandwidth. This is called starvation. Modern designs (like LLQ) police the priority queue—limiting voice to, say, 33% of bandwidth—ensuring other traffic gets served even when voice traffic spikes.

Integrated Services (IntServ)

Integrated Services (IntServ) is a QoS framework that provides per-flow guarantees using explicit resource reservation. It attempts to bring connection-oriented guarantees to the IP network.\n\nIntServ Philosophy:\n\nIntServ treats each traffic flow individually. Before sending, an application signals its requirements (bandwidth, latency). Routers along the path reserve resources. If reservation succeeds, the flow gets guaranteed service.\n\nKey Components:

IntServ Architecture

•RSVP (Resource Reservation Protocol) — Signaling protocol that carries reservation requests along the path. Sets up and maintains per-flow reservation state.
•Admission Control — Each router decides if it can meet the requested QoS. If not, reservation fails.
•Classifier — Identifies packets belonging to reserved flows.
•Packet Scheduler — Serves flows according to their reservations.

IntServ Service Classes:

IntServ Service Classes
Class	Description	Guarantees
Guaranteed Service	Worst-case bounded delay, no loss (if compliant)	Hard delay bound, zero congestion loss
Controlled Load	Approximates unloaded network conditions	Low delay, low loss (like light-load network)
Best Effort	Default IP service	None

rsvp_reservation.pseudo
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
// Conceptual RSVP reservation flow
 
// Step 1: Sender sends PATH message toward receiver
// This establishes state about the sender at each router
PATH_message = {
    sender_address: source_ip,
    traffic_spec: {
        peak_rate: 1 Mbps,
        bucket_size: 10 KB,
        min_policed_unit: 64 bytes
    },
    ad_spec: accumulated_path_properties
}
send_to_receiver(PATH_message)
 
// Step 2: Receiver responds with RESV message back toward sender
// This requests actual resource reservation
RESV_message = {
    flow_spec: {
        requested_rate: 1 Mbps,
        slack_term: 0  // No additional delay acceptable
    },
    filter_spec: {  // Identifies traffic to receive this QoS
        source_ip: sender_ip,
        source_port: 5004
    }
}
send_to_sender(RESV_message)
 
// At each router along the path:
function process_RESV(message):
    if can_reserve_resources(message.flow_spec):
        install_reservation(message.flow_spec)
        forward_RESV_upstream(message)
    else:
        send_RESV_rejection(message)

Why IntServ Didn't Scale

IntServ maintains per-flow state in every router. With millions of flows crossing a backbone router, this state explodes. Additionally, RSVP signaling is complex and adds overhead. IntServ is used in some controlled environments (enterprise, service provider edges) but not in the Internet core.

Differentiated Services (DiffServ)

Differentiated Services (DiffServ) takes a fundamentally different approach: instead of per-flow reservations, traffic is grouped into a small number of classes with defined behaviors. This dramatically reduces complexity.\n\nDiffServ Philosophy:\n\n- Traffic is marked at the network edge with a DSCP value\n- Core routers treat all packets with the same DSCP identically\n- No per-flow state in the core\n- Aggregated, class-based treatment\n\nKey Concepts:

DiffServ Architecture

•DSCP (Differentiated Services Code Point) — 6-bit field in IP header that indicates the class. 64 possible values.
•Per-Hop Behavior (PHB) — How a router treats packets of a given class. Standardized behaviors enable interoperability.
•Traffic Conditioning — At the edge: classification, marking, policing/shaping to conform traffic to agreed profiles.
•Service Level Agreements (SLAs) — Contracts specifying how much traffic of each class a customer can send and what treatment it receives.

Standard Per-Hop Behaviors:

DiffServ Per-Hop Behaviors (PHBs)
PHB	DSCP Value	Purpose	Treatment
EF (Expedited Forwarding)	46 (101110)	Premium, low-latency service	Priority queuing, minimal delay/jitter. Like a 'virtual wire'.
AF11-AF43	Various	Assured Forwarding (4 classes)	Weighted bandwidth guarantees. 3 drop precedences per class.
CS0-CS7	0-56 (multiples of 8)	Class Selectors (legacy compatible)	Backward compatible with IP Precedence bits.
BE (Default)	0	Best Effort	Standard treatment, no priority.

Assured Forwarding in Detail:\n\nAF defines 4 classes (AF1-AF4), each with 3 drop precedences:\n\n| Class | Low Drop (1) | Medium Drop (2) | High Drop (3) |\n|-------|--------------|-----------------|---------------|\n| AF1 | AF11 (10) | AF12 (12) | AF13 (14) |\n| AF2 | AF21 (18) | AF22 (20) | AF23 (22) |\n| AF3 | AF31 (26) | AF32 (28) | AF33 (30) |\n| AF4 | AF41 (34) | AF42 (36) | AF43 (38) |\n\nWithin a class, higher drop precedence packets are dropped first during congestion. Traffic can be marked with higher drop precedence if it exceeds its profile (out-of-profile marking).

DiffServ is the Practical Choice

DiffServ is the dominant QoS mechanism in real networks. It scales well (no per-flow state), is relatively simple to deploy, and provides meaningful differentiation. Most enterprise and service provider networks implement some form of DiffServ-based QoS.

Traffic Policing and Shaping

QoS requires controlling traffic rates. Policing enforces limits by dropping or remarking excess traffic. Shaping smooths traffic by delaying excess packets.\n\nToken Bucket Algorithm:\n\nBoth policing and shaping commonly use the token bucket algorithm to measure and control traffic rates.

token_bucket.pseudo
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
// Token Bucket Algorithm
// Used for traffic metering, policing, and shaping
 
class TokenBucket:
    bucket_size    // Maximum tokens (burst size)
    token_rate     // Tokens added per second (committed rate)
    current_tokens // Current token count
    
    function init(rate, burst):
        token_rate = rate
        bucket_size = burst
        current_tokens = burst  // Start full
    
    function add_tokens(elapsed_time):
        // Add tokens based on elapsed time
        new_tokens = elapsed_time * token_rate
        current_tokens = min(bucket_size, current_tokens + new_tokens)
    
    function check_packet(packet):
        packet_size = len(packet)
        
        if current_tokens >= packet_size:
            // Conforming: enough tokens
            current_tokens -= packet_size
            return CONFORM
        else:
            // Non-conforming: insufficient tokens
            return EXCEED
 
// Policing: drop or remark non-conforming
function police(packet):
    result = token_bucket.check_packet(packet)
    if result == CONFORM:
        forward(packet)
    else:
        drop(packet)  // Or remark to lower priority
 
// Shaping: delay non-conforming until tokens available
function shape(packet):
    result = token_bucket.check_packet(packet)
    if result == CONFORM:
        forward(packet)
    else:
        buffer_until_tokens_available(packet)

Policing

•Drops excess traffic immediately
•No buffering delay
•Can cause packet loss
•Typically at ingress (edge)
•Enforces contracted rates
•Uses: Rate limiting at ISP edge

Shaping

•Delays excess traffic
•Buffers during bursts
•Adds latency for bursts
•Typically at egress
•Smooths traffic patterns
•Uses: Conforming to SLAs

CIR, PIR, and Traffic Contracts

In service provider networks, customers purchase specific rates: CIR (Committed Information Rate) is guaranteed bandwidth; PIR (Peak Information Rate) is the maximum burst rate allowed. Policing ensures traffic above PIR is dropped, traffic between CIR and PIR may be marked for potential dropping during congestion, and traffic at or below CIR gets reliable forwarding.

QoS in Practice

Implementing QoS in real networks requires careful planning and ongoing management. Here are practical considerations:\n\nWhere QoS is Applied:

QoS Deployment Points

•Enterprise Edge — Classification at access layer based on port, VLAN, or deep inspection. Trust phones on voice VLAN; don't trust user PCs.
•WAN Edge — Shaping and queuing before slower WAN links where congestion is most likely.
•ISP Ingress — Policing customer traffic to contracted rates. Remarking for the ISP's internal classes.
•ISP Core — Typically overprovisioned; simple priority queuing based on DSCP markings.
•Cloud Ingress — Classification by application, tenant, or service tier.

Common QoS Design Patterns:

Typical Enterprise QoS Policy
Traffic Class	DSCP	Queue	Bandwidth	Comments
Voice	EF (46)	Priority	Up to 33%	Policed to prevent starvation
Video Conferencing	AF41 (34)	Queue 1	25%	Low latency, moderate bandwidth
Business Apps	AF21 (18)	Queue 2	25%	ERP, CRM critical apps
Default	0	Queue 3	15%	General traffic
Scavenger	CS1 (8)	Queue 4	2%	Low-priority (updates, backups)

QoS is End-to-End

QoS only works if applied consistently along the path. A single link without QoS can destroy quality even if all other links are QoS-enabled. This is especially challenging when traffic crosses the public Internet, where no end-to-end QoS is guaranteed.

QoS Across the Internet:\n\nThe public Internet is fundamentally best-effort. Cross-ISP QoS is rare because:\n- No universal trust for markings (ISP A doesn't trust ISP B's markings)\n- No universal billing for QoS (who pays for priority?)\n- Business model conflicts (prioritizing one customer may hurt another)\n\nSolutions for Critical Traffic:\n- Private MPLS networks with end-to-end QoS\n- SD-WAN with application-aware path selection\n- Overprovisioned 'premium' Internet connections\n- Application-layer adaptation (adjust quality to conditions)

Summary: Quality of Service

Quality of Service provides tools to manage network resources for diverse application needs. Let's consolidate the essential understanding:

Key Takeaways

•QoS = Differentiated Treatment — Not all traffic treated equally; critical applications get priority or guarantees.
•Classification and Marking — Traffic is identified and tagged (DSCP) for consistent treatment.
•Queuing and Scheduling — Multiple queues with scheduling algorithms determine transmission order.
•IntServ = Per-Flow Reservation — RSVP signals individual flow requirements; doesn't scale to Internet size.
•DiffServ = Per-Class Treatment — Traffic grouped into classes; scalable and widely deployed.
•Policing and Shaping — Token bucket controls traffic rates; policing drops, shaping delays.
•End-to-End Challenge — QoS must be consistent along the entire path to be effective.

What's Next:\n\nHaving covered all service models—connectionless, connection-oriented, best-effort, and QoS—we'll conclude with a comprehensive comparison of these services. Understanding when to use each model and the tradeoffs involved completes your mastery of network layer services.

Page Complete

You now understand Quality of Service comprehensively. You can explain classification, marking, queuing, scheduling, IntServ, DiffServ, and practical QoS deployment. This knowledge is essential for network engineers, architects, and anyone designing systems requiring performance guarantees.

4 / 5

Loading learning content...

Computer NetworksNetwork Layer Services

Network Layer Services

LevelIntermediate

Duration75 mins

TopicNetwork Layer Services

4 / 5

Quality of Service

When Best-Effort Isn't Good Enough

What You Will Learn

What is Quality of Service?

Key QoS Parameters

•Bandwidth — Data rate available to a flow (bits per second). Determines how fast data can flow.
•Latency (Delay) — Time for a packet to travel from source to destination. Key for interactive applications.
•Jitter (Delay Variation) — Variation in latency between packets. Critical for real-time media.
•Packet Loss — Percentage of packets that don't arrive. Affects all applications differently.
•Availability — Percentage of time the service is accessible. Matters for critical infrastructure.

Application QoS Requirements
Application	Bandwidth Needs	Latency Tolerance	Jitter Tolerance	Loss Tolerance
VoIP	Low (64-128 kbps)	Very Low (<150ms)	Very Low (<30ms)	Low (1-3%)
Video Conferencing	Medium (1-6 Mbps)	Low (<200ms)	Low (<50ms)	Low (1%)
Live Streaming	Medium (3-20 Mbps)	Medium (<1s)	Medium	Very Low
Web Browsing	Varies	Medium	Flexible	Very Low
File Transfer	High (maximize)	Flexible	Not Critical	Zero
Email	Low	Flexible	Not Critical	Zero
Telemedicine	Medium-High	Very Low (<50ms)	Very Low	Very Low

QoS is Not About Adding Bandwidth

Traffic Classification and Marking

How Traffic is Classified

•Port Numbers — Well-known ports indicate applications (TCP 80 = HTTP, UDP 5060 = SIP). Simple but circumventable.
•IP Addresses — Source/destination addresses identify endpoints. Works for known servers.
•Protocol Number — IP protocol field (6 = TCP, 17 = UDP, 1 = ICMP). Coarse classification.
•Deep Packet Inspection (DPI) — Examine payload to identify applications. Accurate but computationally expensive.
•DSCP/TOS Values — Trust existing packet markings (may be set by endpoints or upstream devices).
•Access Control Lists (ACLs) — Complex rules matching multiple header fields.

dscp_field.txt
IPv4 Header - Differentiated Services Field (8 bits)
Previously called Type of Service (TOS)
 
+-------+-------+-------+-------+-------+-------+-------+-------+
|  DS5  |  DS4  |  DS3  |  DS2  |  DS1  |  DS0  |  ECN  |  ECN  |
+-------+-------+-------+-------+-------+-------+-------+-------+
|<------- DSCP (6 bits) ------->|<-- ECN (2 bits) -->|
 
DSCP (Differentiated Services Code Point):
- 6 bits = 64 possible values
- Defines Per-Hop Behavior (PHB) class
- Common values:
  - EF (Expedited Forwarding): 46 (101110) - Premium, low-latency
  - AF11-AF43: Assured Forwarding classes (4 classes × 3 drop precedences)
  - CS0-CS7: Class Selectors (backward compatible with IP Precedence)
  - BE (Best Effort): 0 (000000) - Default
 
ECN (Explicit Congestion Notification):
- 2 bits for congestion signaling
- Allows routers to mark (not drop) congested packets

The Marking Trust Problem

Queuing and Scheduling

Queue Management Techniques

•FIFO (First-In, First-Out) — Single queue, packets served in arrival order. Simple but no differentiation—best-effort.
•Priority Queuing (PQ) — Multiple queues with strict priority. High-priority is always served first. Risk: low-priority starvation.
•Weighted Fair Queuing (WFQ) — Each flow gets a queue. Scheduler interleaves based on weights, ensuring fairness.
•Class-Based WFQ (CBWFQ) — Like WFQ but queues represent classes, not flows. Simpler to configure at scale.
•Low-Latency Queuing (LLQ) — CBWFQ with a strict priority queue for real-time traffic. Priority traffic gets immediate service; others get weighted shares.

Converting Mermaid diagram...

Scheduling Algorithms:\n\nThe scheduler is the policy enforcer, deciding which queued packet to transmit next.

Scheduling Algorithm Comparison
Algorithm	Mechanism	Pros	Cons
Strict Priority	Always serve highest-priority first	Lowest latency for priority traffic	Lower priorities can starve
Round Robin	Serve one from each queue in turn	Fair to all queues	Large packets delay small packets
Weighted Round Robin	Serve proportional to weights	Weighted fairness	Doesn't account for packet sizes
Deficit Round Robin	Credits accumulate; accounts for sizes	Fair, size-aware	More complex to implement
LLQ (Cisco)	Priority + CBWFQ hybrid	Low latency + fairness	Priority queue needs policing

The Starvation Problem

Integrated Services (IntServ)

IntServ Architecture

•RSVP (Resource Reservation Protocol) — Signaling protocol that carries reservation requests along the path. Sets up and maintains per-flow reservation state.
•Admission Control — Each router decides if it can meet the requested QoS. If not, reservation fails.
•Classifier — Identifies packets belonging to reserved flows.
•Packet Scheduler — Serves flows according to their reservations.

IntServ Service Classes:

IntServ Service Classes
Class	Description	Guarantees
Guaranteed Service	Worst-case bounded delay, no loss (if compliant)	Hard delay bound, zero congestion loss
Controlled Load	Approximates unloaded network conditions	Low delay, low loss (like light-load network)
Best Effort	Default IP service	None

rsvp_reservation.pseudo
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
// Conceptual RSVP reservation flow
 
// Step 1: Sender sends PATH message toward receiver
// This establishes state about the sender at each router
PATH_message = {
    sender_address: source_ip,
    traffic_spec: {
        peak_rate: 1 Mbps,
        bucket_size: 10 KB,
        min_policed_unit: 64 bytes
    },
    ad_spec: accumulated_path_properties
}
send_to_receiver(PATH_message)
 
// Step 2: Receiver responds with RESV message back toward sender
// This requests actual resource reservation
RESV_message = {
    flow_spec: {
        requested_rate: 1 Mbps,
        slack_term: 0  // No additional delay acceptable
    },
    filter_spec: {  // Identifies traffic to receive this QoS
        source_ip: sender_ip,
        source_port: 5004
    }
}
send_to_sender(RESV_message)
 
// At each router along the path:
function process_RESV(message):
    if can_reserve_resources(message.flow_spec):
        install_reservation(message.flow_spec)
        forward_RESV_upstream(message)
    else:
        send_RESV_rejection(message)

Why IntServ Didn't Scale

Differentiated Services (DiffServ)

DiffServ Architecture

•DSCP (Differentiated Services Code Point) — 6-bit field in IP header that indicates the class. 64 possible values.
•Per-Hop Behavior (PHB) — How a router treats packets of a given class. Standardized behaviors enable interoperability.
•Traffic Conditioning — At the edge: classification, marking, policing/shaping to conform traffic to agreed profiles.
•Service Level Agreements (SLAs) — Contracts specifying how much traffic of each class a customer can send and what treatment it receives.

Standard Per-Hop Behaviors:

DiffServ Per-Hop Behaviors (PHBs)
PHB	DSCP Value	Purpose	Treatment
EF (Expedited Forwarding)	46 (101110)	Premium, low-latency service	Priority queuing, minimal delay/jitter. Like a 'virtual wire'.
AF11-AF43	Various	Assured Forwarding (4 classes)	Weighted bandwidth guarantees. 3 drop precedences per class.
CS0-CS7	0-56 (multiples of 8)	Class Selectors (legacy compatible)	Backward compatible with IP Precedence bits.
BE (Default)	0	Best Effort	Standard treatment, no priority.

DiffServ is the Practical Choice

Traffic Policing and Shaping

token_bucket.pseudo
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
// Token Bucket Algorithm
// Used for traffic metering, policing, and shaping
 
class TokenBucket:
    bucket_size    // Maximum tokens (burst size)
    token_rate     // Tokens added per second (committed rate)
    current_tokens // Current token count
    
    function init(rate, burst):
        token_rate = rate
        bucket_size = burst
        current_tokens = burst  // Start full
    
    function add_tokens(elapsed_time):
        // Add tokens based on elapsed time
        new_tokens = elapsed_time * token_rate
        current_tokens = min(bucket_size, current_tokens + new_tokens)
    
    function check_packet(packet):
        packet_size = len(packet)
        
        if current_tokens >= packet_size:
            // Conforming: enough tokens
            current_tokens -= packet_size
            return CONFORM
        else:
            // Non-conforming: insufficient tokens
            return EXCEED
 
// Policing: drop or remark non-conforming
function police(packet):
    result = token_bucket.check_packet(packet)
    if result == CONFORM:
        forward(packet)
    else:
        drop(packet)  // Or remark to lower priority
 
// Shaping: delay non-conforming until tokens available
function shape(packet):
    result = token_bucket.check_packet(packet)
    if result == CONFORM:
        forward(packet)
    else:
        buffer_until_tokens_available(packet)

Policing

•Drops excess traffic immediately
•No buffering delay
•Can cause packet loss
•Typically at ingress (edge)
•Enforces contracted rates
•Uses: Rate limiting at ISP edge

Shaping

•Delays excess traffic
•Buffers during bursts
•Adds latency for bursts
•Typically at egress
•Smooths traffic patterns
•Uses: Conforming to SLAs

CIR, PIR, and Traffic Contracts

QoS in Practice

Implementing QoS in real networks requires careful planning and ongoing management. Here are practical considerations:\n\nWhere QoS is Applied:

QoS Deployment Points

•Enterprise Edge — Classification at access layer based on port, VLAN, or deep inspection. Trust phones on voice VLAN; don't trust user PCs.
•WAN Edge — Shaping and queuing before slower WAN links where congestion is most likely.
•ISP Ingress — Policing customer traffic to contracted rates. Remarking for the ISP's internal classes.
•ISP Core — Typically overprovisioned; simple priority queuing based on DSCP markings.
•Cloud Ingress — Classification by application, tenant, or service tier.

Common QoS Design Patterns:

Typical Enterprise QoS Policy
Traffic Class	DSCP	Queue	Bandwidth	Comments
Voice	EF (46)	Priority	Up to 33%	Policed to prevent starvation
Video Conferencing	AF41 (34)	Queue 1	25%	Low latency, moderate bandwidth
Business Apps	AF21 (18)	Queue 2	25%	ERP, CRM critical apps
Default	0	Queue 3	15%	General traffic
Scavenger	CS1 (8)	Queue 4	2%	Low-priority (updates, backups)

QoS is End-to-End

Summary: Quality of Service

Quality of Service provides tools to manage network resources for diverse application needs. Let's consolidate the essential understanding:

Key Takeaways

•QoS = Differentiated Treatment — Not all traffic treated equally; critical applications get priority or guarantees.
•Classification and Marking — Traffic is identified and tagged (DSCP) for consistent treatment.
•Queuing and Scheduling — Multiple queues with scheduling algorithms determine transmission order.
•IntServ = Per-Flow Reservation — RSVP signals individual flow requirements; doesn't scale to Internet size.
•DiffServ = Per-Class Treatment — Traffic grouped into classes; scalable and widely deployed.
•Policing and Shaping — Token bucket controls traffic rates; policing drops, shaping delays.
•End-to-End Challenge — QoS must be consistent along the entire path to be effective.

Page Complete

4 / 5