Topology Comparison - Learning Module

Loading content...

0/228

Performance

Performance: The User Experience of Network Architecture

When users complain that "the network is slow," what they experience is the cumulative performance characteristics of the underlying topology. Every click, every video stream, every real-time collaboration traverses a path determined by topology decisions made months or years earlier. Performance isn't just a technical metric—it's the tangible manifestation of network architecture in everyday user experience.

Network performance encompasses multiple dimensions: how fast data travels (latency), how much data can flow (throughput), how consistent the experience is (jitter), and how the network behaves under load (congestion response). Each topology creates a unique performance profile with distinct strengths and limitations.

Understanding topology performance characteristics enables network architects to make informed decisions that directly impact application responsiveness, user productivity, and ultimately business outcomes. A video conferencing application that works flawlessly on one topology may suffer crippling quality issues on another—not due to bandwidth limitations, but due to fundamental topology-induced latency or jitter characteristics.

This page provides a comprehensive analysis of performance across network topologies. We will examine theoretical performance models, measure practical behavior under different traffic conditions, and develop optimization strategies tailored to each topology's characteristics.

Learning Objectives

By the end of this page, you will be able to: (1) Analyze latency characteristics including propagation, transmission, queuing, and processing delays, (2) Calculate theoretical throughput for each topology type, (3) Understand jitter sources and their topology-specific behaviors, (4) Predict congestion behavior and oversubscription impacts, (5) Apply Quality of Service (QoS) principles within topology constraints, and (6) Optimize performance for specific workloads and topologies.

Latency Analysis

Latency is the time elapsed between sending data and receiving confirmation or response. It is the most perceptible performance metric—users notice latency as "slowness" or "responsiveness." Understanding latency requires decomposing it into fundamental components and analyzing how each topology affects them.

Latency Components

1. Propagation Delay (d_prop) Time for signal to travel through physical media:

d_prop = Distance / Propagation Speed

• Copper: ~2.1 × 10⁸ m/s (0.7c) • Fiber: ~2.0 × 10⁸ m/s (0.67c) • Air (wireless): ~3.0 × 10⁸ m/s (c)

For 100m copper cable: 100 / (2.1 × 10⁸) = 0.48 μs

2. Transmission Delay (d_trans) Time to push all bits onto the medium:

d_trans = Packet Size / Link Bandwidth

• 1,500 bytes at 1 Gbps: 12 μs • 1,500 bytes at 10 Gbps: 1.2 μs • 1,500 bytes at 100 Gbps: 0.12 μs

3. Processing Delay (d_proc) Time for network devices to process packets: • Simple switch forwarding: 1-5 μs (cut-through) • Store-and-forward switch: 10-50 μs • Router with ACL checking: 50-200 μs • Firewall with deep inspection: 100-1000 μs

4. Queuing Delay (d_queue) Time waiting in device queues: • Light load: Negligible • Moderate load: Microseconds to milliseconds • Heavy load: Milliseconds to tens of milliseconds • Overloaded: Queue drops (packet loss)

Total Latency

d_total = (d_prop + d_trans + d_proc + d_queue) × Number of Hops

Latency Characteristics by Topology
Topology	Typical Hops	Base Latency	Worst Case	Latency Variance
Bus	0-1	< 10 μs	Collision backoff: ms	High (contention)
Star	2	10-50 μs	100 μs	Low (switched)
Ring	n/2 average	Varies with ring size	n × node delay	Moderate (deterministic)
Full Mesh	1-2	< 20 μs	50 μs	Very Low
Partial Mesh	2-4	20-100 μs	200 μs	Low-Moderate
Two-Tier	4	40-100 μs	300 μs	Low
Three-Tier	6-8	60-200 μs	500 μs	Low
Spine-Leaf	3-4	20-60 μs	100 μs	Very Low

Topology-Specific Latency Behaviors

Bus Topology: Contention-Dominated Latency

Bus latency is unpredictable due to CSMA/CD contention: • Best case (no contention): < 10 μs • Light load: 20-50 μs average • Moderate load: 100-500 μs (collision backoffs) • Heavy load: Milliseconds, with significant variance

The collision backoff algorithm introduces exponential variance—latency can range from microseconds to tens of milliseconds in the same conversation.

Star Topology: Consistent, Switch-Dependent

Star latency is highly predictable: • Two hops for any intra-network communication • Latency dominated by switch processing time • Cut-through switches: 5-10 μs added latency • Store-and-forward: 20-50 μs added latency

Variance is minimal—makes star excellent for latency-sensitive applications.

Ring Topology: Distance-Dependent

Ring latency depends on position: • Adjacent nodes: 1 hop, minimal latency • Opposite side of ring: n/2 hops • Token passing rings add token wait time (0.5 × token rotation time average)

For large rings, maximum latency becomes significant: 100-node ring = 50-hop worst case.

Full Mesh: Optimal Latency

Full mesh provides minimum possible latency: • Direct path between any two nodes • 1 hop (typically) for peer communication • No intermediate forwarding delays • Latency bounded by propagation + single switch processing

Hierarchical: Tiered, Predictable

Hierarchy trades latency for scalability: • Intra-access-switch: 2 hops (10-30 μs) • Inter-access, same distribution: 4 hops (40-80 μs) • Inter-distribution: 6+ hops (60-150 μs)

Latency is predictable based on topology distance—excellent for capacity planning.

Latency Budgets

Different applications have different latency tolerances. Voice: < 150ms one-way. Video conferencing: < 400ms. Real-time gaming: < 50ms. Financial trading: < 1ms. When selecting topology, sum worst-case topology latency + WAN latency + application processing and verify it meets requirements.

Throughput Analysis

Throughput is the actual rate of successful data delivery across the network. Unlike raw bandwidth (link capacity), throughput accounts for protocol overhead, contention, and topology-induced constraints. Understanding how topologies affect throughput is essential for capacity planning.

Throughput vs. Bandwidth

• Bandwidth: Maximum theoretical capacity (e.g., 1 Gbps port) • Throughput: Actual delivered bits per second (e.g., 940 Mbps) • Efficiency: Throughput / Bandwidth (typically 70-95%)

Throughput Reduction Factors

Protocol Overhead: Ethernet frames, IP headers, TCP headers consume 5-10%
Contention: Shared media reduces effective throughput
Collision/Backoff: Bus topologies waste bandwidth on retransmissions
Bottlenecks: Oversubscribed links constrain aggregate throughput
Flow Control: TCP windowing, switch buffering limit burst throughput

Topology Throughput Characteristics

Aggregate Throughput Capacity by Topology
Topology (50 nodes)	Link Speed	Theoretical Max	Practical Max	Limiting Factor
Bus	10 Mbps	10 Mbps	~4 Mbps	Collision overhead (60-70%)
Star (1G switch)	1 Gbps	~24 Gbps (backplane)	~20 Gbps	Switch backplane, uplinks
Ring (FDDI)	100 Mbps	100 Mbps	~80 Mbps	Token hold time, ring sharing
Full Mesh (1G)	1 Gbps/link	~1.2 Tbps	~1 Tbps	Node processing, NIC limits
Two-Tier (10G uplink)	1G access, 10G dist	~200 Gbps	~150 Gbps	Distribution uplinks
Spine-Leaf (10G)	10 Gbps	~500 Gbps	~400 Gbps	Oversubscription ratio

Bus Topology: Throughput Collapse Under Load

Bus throughput degrades non-linearly as utilization increases:

Efficiency = 1 / (1 + 5a)
where a = propagation_time / packet_transmission_time

At high utilization, Ethernet bus efficiency drops to 30-40% due to collisions. A 10 Mbps network might deliver only 3-4 Mbps under heavy load.

Star Topology: Switched Throughput

Modern star networks use switches that provide: • Full wire-speed per port (non-blocking) • Aggregate throughput = sum of simultaneous conversations • Backplane capacity typically 2× sum of port capacities

A 48-port 1GbE switch with 100 Gbps backplane can forward at wire speed for typical traffic patterns.

Full Mesh: Maximum Aggregate Throughput

Full mesh provides theoretical maximum throughput: • Every node can transmit simultaneously to different destinations • No shared links (except at node interfaces) • Aggregate throughput scales with n(n-1)/2 links

For 20 nodes with 10 Gbps links: 190 links × 10 Gbps = 1.9 Tbps aggregate capacity.

Hierarchical: Oversubscription Design

Hierarchical networks intentionally oversubscribe to balance cost and performance:

Oversubscription Ratio = Access Port Bandwidth / Uplink Bandwidth

Typical ratios: • Access → Distribution: 4:1 to 8:1 • Distribution → Core: 2:1 to 4:1 • Data center leaf → spine: 1:1 to 3:1

Example: 48 × 1GbE access ports with 2 × 10GbE uplinks

Oversubscription = 48 Gbps / 20 Gbps = 2.4:1

If all ports transmit simultaneously, each gets ~417 Mbps upstream (ignoring local switching).

throughput_calculator.py
Python
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
def calculate_throughput_characteristics(
    topology: str,
    num_nodes: int,
    link_speed_gbps: float,
    uplink_speed_gbps: float = None
) -> dict:
    """
    Calculate throughput characteristics for different topologies.
    """
    if topology == 'bus':
        # Bus: shared medium, collision overhead
        efficiency = 0.35  # Typical under load
        max_aggregate = link_speed_gbps * efficiency
        per_node_average = max_aggregate / num_nodes
        bottleneck = "Collision domain"
        
    elif topology == 'star':
        # Star: switched, limited by backplane and uplinks
        switch_backplane = link_speed_gbps * num_nodes * 2  # Typical 2x
        max_aggregate = min(switch_backplane, link_speed_gbps * num_nodes)
        per_node_average = link_speed_gbps * 0.95  # Near wire speed
        bottleneck = "Switch backplane / uplinks"
        
    elif topology == 'ring':
        # Ring: shared ring bandwidth
        efficiency = 0.75  # Better than bus, no collision
        max_aggregate = link_speed_gbps * efficiency
        per_node_average = max_aggregate / num_nodes
        bottleneck = "Ring capacity"
        
    elif topology == 'full_mesh':
        # Full mesh: maximum theoretical throughput
        num_links = (num_nodes * (num_nodes - 1)) // 2
        max_aggregate = num_links * link_speed_gbps
        per_node_average = (num_nodes - 1) * link_speed_gbps * 0.9
        bottleneck = "Node interface capacity"
        
    elif topology == 'hierarchical':
        # Hierarchical: oversubscription model
        if uplink_speed_gbps is None:
            uplink_speed_gbps = link_speed_gbps  # Default 1:1
        
        oversubscription = (num_nodes * link_speed_gbps) / uplink_speed_gbps
        max_aggregate = min(num_nodes * link_speed_gbps, uplink_speed_gbps)
        per_node_average = uplink_speed_gbps / num_nodes
        bottleneck = f"Uplink capacity ({oversubscription:.1f}:1 oversubscription)"
        
    elif topology == 'spine_leaf':
        # Spine-leaf: low oversubscription
        oversubscription = 1.5  # Typical for spine-leaf
        max_aggregate = num_nodes * link_speed_gbps / oversubscription
        per_node_average = link_speed_gbps / oversubscription
        bottleneck = "Spine bandwidth"
        
    else:
        raise ValueError(f"Unknown topology: {topology}")
    
    return {
        'topology': topology,
        'num_nodes': num_nodes,
        'link_speed_gbps': link_speed_gbps,
        'max_aggregate_gbps': max_aggregate,
        'per_node_average_gbps': per_node_average,
        'bottleneck': bottleneck
    }
 
 
# Compare throughput for 100-node networks
print("=" * 75)
print(f"{'THROUGHPUT ANALYSIS: 100-NODE NETWORK':^75}")
print("=" * 75)
 
configs = [
    ('bus', 100, 0.01, None),           # 10 Mbps bus
    ('star', 100, 1.0, None),           # 1G star
    ('ring', 100, 0.1, None),           # 100 Mbps ring
    ('full_mesh', 20, 10.0, None),      # 10G mesh (20 nodes max practical)
    ('hierarchical', 100, 1.0, 20.0),   # 1G access, 20G uplink
    ('spine_leaf', 100, 10.0, None),    # 10G spine-leaf
]
 
for topo, nodes, link, uplink in configs:
    result = calculate_throughput_characteristics(topo, nodes, link, uplink)
    print(f"\n{result['topology'].upper()}:")
    print(f"  Nodes: {result['num_nodes']}, Link Speed: {result['link_speed_gbps']} Gbps")
    print(f"  Max Aggregate Throughput:  {result['max_aggregate_gbps']:.2f} Gbps")
    print(f"  Per-Node Average:          {result['per_node_average_gbps']*1000:.2f} Mbps")
    print(f"  Bottleneck: {result['bottleneck']}")

Oversubscription is Intentional

Oversubscription isn't a design flaw—it's an economic necessity. If 1,000 users each have 1 Gbps ports, provisioning 1 Tbps of core bandwidth would be prohibitively expensive. Traffic analysis shows typical utilization of 1-10% per user, making 10:1 oversubscription reasonable for many workloads. Match oversubscription to actual traffic patterns.

Jitter and Timing Consistency

Jitter is the variation in packet arrival times. While latency measures average delay, jitter measures the consistency of that delay. Low jitter is critical for real-time applications (voice, video, control systems) that rely on predictable timing.

Jitter = |Delay(packet n+1) - Delay(packet n)|

Why Jitter Matters

• Voice: Jitter > 30ms causes audible quality degradation • Video: Jitter causes stutter, buffering, frame drops • Industrial Control: Jitter can cause control loop instability • Financial Trading: Jitter creates timing uncertainty

Jitter Sources

Queuing Variation: Different queue depths at different times
Processing Variation: Variable processing time per packet
Route Changes: Different paths with different delays
Contention: Variable wait times for shared resources
Buffering: Jitter buffers trade latency for consistency

Jitter Characteristics by Topology
Topology	Typical Jitter	Worst Case Jitter	Deterministic	Real-Time Suitability
Bus (CSMA/CD)	1-50 ms	100+ ms	No	Poor
Star (Switched)	10-100 μs	1-5 ms	High	Excellent
Ring (Token)	100 μs-1 ms	5-10 ms	Moderate	Good
Ring (TDMA)	< 10 μs	< 100 μs	Yes	Excellent
Full Mesh	10-100 μs	500 μs	High	Excellent
Partial Mesh	50-500 μs	2-5 ms	Moderate	Good
Hierarchical	100 μs-1 ms	5-10 ms	Moderate	Good
Spine-Leaf	50-200 μs	1-2 ms	High	Very Good

Topology-Specific Jitter Behavior

Bus Topology: Maximum Jitter

CSMA/CD introduces extreme jitter variability: • Collision backoff can range from 0 to 1,023 slot times • Exponential backoff adds multiplicative jitter on congested networks • A single packet may experience 0 μs or 50 ms delay depending on contention

Bus topology is fundamentally incompatible with jitter-sensitive applications.

Star Topology: Consistent, Low Jitter

Switched star networks provide excellent jitter characteristics: • No contention within switch fabric • Consistent processing time per packet • Jitter primarily from queue depth variation • Priority queuing can guarantee jitter bounds

Ring Topology: Deterministic Token-Based

Traditional token ring provides bounded jitter: • Maximum wait = token rotation time • Deterministic access order • Jitter proportional to ring size

Modern industrial rings (PROFINET, EtherNet/IP) use time-division for microsecond-level jitter.

Full Mesh: Minimal Jitter

Direct paths minimize jitter sources: • No intermediate queuing (single hop) • Consistent path for each flow • Jitter dominated by node processing variation • Ideal for lowest-jitter requirements

Hierarchical: Manageable with QoS

Hierarchy introduces more jitter sources (more hops) but is manageable: • Each tier adds queuing/processing jitter • Total jitter compounds across tiers • QoS mechanisms at each tier can bound jitter • Well-designed hierarchy achieves < 5ms jitter

Time-Sensitive Networking (TSN)

For deterministic networking needs, TSN standards (IEEE 802.1Qbv, etc.) provide: • Time-synchronized transmission • Guard bands for scheduled traffic • Guaranteed latency and jitter bounds

TSN is topology-independent but effectiveness varies: • Star/hierarchy: Full TSN benefits • Mesh: Complex scheduling across multiple paths

Jitter Budgeting

Jitter budgets are consumption-based. A voice call tolerating 30ms total jitter must allocate across: LAN jitter (2-5ms), WAN jitter (10-20ms), and codec buffering (5-10ms). Leave margin for peaks. Measure actual jitter with tools like iperf, TWAMP, or dedicated network monitoring. Topology selection directly impacts the LAN component.

Congestion Behavior and Response

Network performance ultimately depends on how topologies behave when demand exceeds capacity. Congestion occurs when offered load approaches or exceeds available bandwidth. Different topologies exhibit dramatically different congestion behaviors—some degrade gracefully while others collapse.

Congestion Indicators

• Queue buildup: Increasing latency as packets wait • Packet loss: Queue overflow drops packets • Throughput reduction: TCP backoff reduces aggregate throughput • Jitter increase: Variable queue depths cause timing variation • Complete collapse: Severe congestion can halt all traffic

Congestion Severity Levels

Level	Utilization	Symptoms	Recovery
Normal	0-50%	None	N/A
Moderate	50-80%	Slight latency increase	Self-limiting
Heavy	80-95%	Noticeable latency, some drops	Manual intervention may help
Overload	95-100%	High latency, significant drops	Requires load reduction
Collapse	>100%	Congestion collapse, near-zero throughput	Emergency response

Congestion Response by Topology
Topology	Degradation Mode	Collapse Risk	Recovery Behavior	Mitigation Tools
Bus	Exponential degradation	High (collision cascade)	Very slow (backoff)	None effective
Star	Graceful queue buildup	Low (isolated ports)	Fast (queue drain)	QoS, policing
Ring	Token delays increase	Moderate	Moderate (token purge)	Priority tokens
Full Mesh	Per-link degradation	Very Low (isolated)	Fast (no interaction)	Load balancing
Partial Mesh	Hotspot formation	Low-Moderate	Moderate	Traffic engineering
Hierarchical	Aggregation point queuing	Low (tiered)	Moderate	QoS, traffic shaping
Spine-Leaf	ECMP balancing, gradual	Very Low	Fast	Load balancing, QoS

Congestion Behavior Deep Dive

Bus Topology: Catastrophic Congestion Collapse

Bus networks experience positive feedback loops during congestion:

High traffic → more collisions
Collisions → retransmissions
Retransmissions → even higher traffic
Result: Exponential degradation, potential complete collapse

At 30-40% offered load, bus throughput begins declining. At 60%+ offered load, throughput can drop below 10% of capacity. This makes bus networks extremely fragile under load.

Star Topology: Isolated, Manageable Congestion

Switched star networks localize congestion: • Port-to-port congestion doesn't affect unrelated traffic • Head-of-line blocking solved by virtual output queuing (VOQ) • Switch buffers absorb burst traffic • Congestion affects only overloaded ports/VLANs

Congestion is visible, localized, and manageable with QoS.

Full Mesh: Maximum Congestion Isolation

Mesh topologies provide inherent congestion isolation: • Each link is independent • Congestion on A→B doesn't affect C→D • Routing protocols can avoid congested links • Per-link queuing contains problems

However, if node processing becomes congested, mesh doesn't help.

Hierarchical: Aggregation Creates Bottlenecks

Hierarchy concentrates traffic at aggregation points: • Distribution switches aggregate access traffic • Core switches aggregate distribution traffic • Congestion typically manifests at uplinks first

Mitigation requires: • Adequate oversubscription ratios • QoS policies at each tier • Traffic engineering to balance load • Monitoring to identify emerging hotspots

Spine-Leaf: ECMP Distributes Load

Spine-leaf uses multiple equal-cost paths: • Traffic hashed across multiple spine switches • No single aggregation bottleneck • Congestion spreads across fabric • Graceful degradation under load

The Congestion Cliff

Networks often appear fine until they fall off a "congestion cliff." At 70% utilization, everything works. At 80%, minor effects. At 90%, severe degradation. Monitor utilization trends and provision before reaching 80% sustained. The cliff location varies by topology—bus falls earliest, mesh latest.

Quality of Service (QoS) by Topology

Quality of Service (QoS) encompasses mechanisms that prioritize certain traffic over others, providing differentiated performance guarantees. QoS effectiveness varies dramatically by topology—some topologies enable sophisticated QoS while others fundamentally cannot prioritize traffic.

QoS Mechanisms

1. Classification and Marking • Identify traffic by application, port, address, or DSCP marking • Tag packets for priority treatment • Must be configured at ingress points

2. Queuing • Multiple queues with different priorities • Strict Priority (SP): Higher priority always first • Weighted Fair Queuing (WFQ): Proportional bandwidth allocation • Class-Based Weighted Fair Queuing (CBWFQ): Guaranteed minimums

3. Scheduling • Controls when packets are transmitted • Shapes traffic to avoid downstream congestion • Provides bandwidth guarantees

4. Policing and Shaping • Limit traffic to contracted rates • Drop or remark excess traffic • Smooth burst traffic

QoS Capability by Topology
Topology	QoS Capability	Mechanisms Available	End-to-End Guarantee	Complexity
Bus	None	CSMA/CD has no priority	Cannot guarantee	N/A
Star	Excellent	Full queuing, policing, shaping	Per-switch, end-to-end possible	Moderate
Ring (token)	Limited	Priority tokens	Ring-level priority	Low
Ring (TSN)	Excellent	Time-synchronized, guard bands	Deterministic bounds	High
Full Mesh	Excellent	Per-link full QoS	Easy (1 hop)	Low
Partial Mesh	Good	Per-link, needs coordination	Requires path awareness	Moderate-High
Hierarchical	Excellent	Full stack per tier	End-to-end with design	Moderate
Spine-Leaf	Excellent	Uniform, ECMP-aware	Consistent across fabric	Moderate

QoS Implementation Guidelines by Topology

Star/Hierarchical Networks: Full QoS Stack

Implement QoS at each network tier:

1. Edge/Access Layer:
   • Classify and mark traffic at ingress
   • Trust markings from IP phones, video endpoints
   • Apply ingress policing for untrusted sources

2. Distribution Layer:
   • Queuing for aggregated traffic
   • Re-mark non-conformant traffic
   • Traffic shaping toward core

3. Core Layer:
   • High-speed queuing (minimal depth)
   • Honor DSCP markings from distribution
   • Never modify markings (trust lower layers)

Common QoS Classes (Cisco example)

Priority	Queue	DSCP	Traffic Type
1 (Highest)	Priority Queue	EF (46)	Voice RTP
2	AF41 Class	AF41 (34)	Video conferencing
3	AF31 Class	AF31 (26)	Mission-critical data
4	AF21 Class	AF21 (18)	Transactional data
5	Default	BE (0)	Best effort, bulk
6	Scavenger	CS1 (8)	Non-business, deprioritized

Spine-Leaf QoS Considerations

• ECMP hashing is flow-based—QoS applies per-flow • Ensure all spine switches have identical QoS policies • Leaf switches perform classification at edge • Consistent marking across all paths • Consider flow polarization effects on queue utilization

Mesh Network QoS Challenges

Mesh networks complicate QoS: • Multiple paths with potentially different QoS configurations • Dynamic routing may shift traffic to different QoS-enabled paths • Requires coordinated QoS policy across all mesh nodes • Often simplified to trust-based model (trust peer markings)

QoS Cannot Create Bandwidth

QoS reshuffles existing bandwidth—it cannot create more. If a link is 100% utilized, QoS can ensure voice gets its 10% share, but the other 90% remains contested. QoS without capacity planning is ineffective. Combined, they provide guaranteed service levels.

Traffic Patterns and Topology Selection

Different applications generate different traffic patterns, and topologies vary in how well they accommodate each pattern. Matching topology to expected traffic patterns is essential for optimal performance.

Traffic Pattern Types

1. Client-Server (North-South) Traffic flows from clients to centralized servers and back: • Typical: 80-90% of enterprise traffic historically • Pattern: Many clients → few servers • Volume: Moderate, predictable • Optimized by: Hierarchical topologies (aggregation toward servers)

2. Peer-to-Peer (East-West) Traffic flows between nodes at the same tier: • Modern data centers: Often >70% east-west • Pattern: Any node → any other node • Volume: Can be very high • Optimized by: Mesh, spine-leaf, flat topologies

3. Broadcast/Multicast Traffic from one source to many destinations: • Examples: ARP, service discovery, streaming • Pattern: 1 → many • Volume: Modest per-stream, but replicates • Optimized by: Star (switch replication), tree (hierarchical multicast)

4. Anycast Traffic to nearest instance of a service: • Examples: DNS, CDN, load balancing • Pattern: Client → nearest server • Volume: Highly variable • Optimized by: Mesh (routing to nearest), hierarchical (localized services)

Traffic Pattern Performance by Topology
Topology	North-South	East-West	Broadcast	Anycast	Best Fit
Star	Good	Poor (hairpin)	Excellent	N/A	Small client-server
Ring	Moderate	Moderate	Good	N/A	Sequential workflows
Full Mesh	Good	Excellent	Poor (n copies)	Excellent	High east-west
Partial Mesh	Good	Good	Moderate	Good	Balanced workloads
Three-Tier	Excellent	Poor	Good (with IGMP)	Poor	Traditional enterprise
Spine-Leaf	Good	Excellent	Moderate	Good	Data center, cloud
Hybrid	Good	Good	Good	Good	Mixed requirements

East-West Traffic: The Modern Challenge

Modern data centers have shifted dramatically toward east-west traffic:

Traditional (2000s): 80% north-south, 20% east-west Modern (2020s): 20-30% north-south, 70-80% east-west

Drivers of east-west traffic: • Microservices architectures (service-to-service calls) • Distributed storage (replication, striping) • Container orchestration (pod-to-pod communication) • Machine learning (parameter servers, distributed training) • Database clusters (replication, distributed queries)

Three-tier hierarchy was designed for north-south and performs poorly with heavy east-west: • East-west traffic must traverse up to core, then back down • Core becomes severe bottleneck • Latency doubles (2 traversals vs. 1) • This drove the adoption of spine-leaf in data centers

Spine-Leaf Optimizes East-West

    Spine A ═══════════ Spine B ═══════════ Spine C
       ║         X          ║         X          ║
    Leaf 1 ──────────── Leaf 2 ──────────── Leaf 3
       │                    │                    │
   Servers              Servers              Servers

• Every leaf connects to every spine • Maximum 3 hops between any two servers • No traffic tromboning through core • Equal bandwidth to any destination

Matching Topology to Workload

Workload	Primary Pattern	Recommended Topology
Office workers	North-south	Three-tier hierarchical
VDI cluster	North-south	Star/two-tier
Hadoop/Spark	East-west	Spine-leaf
Kubernetes	East-west	Spine-leaf
HPC cluster	East-west with alltoall	Full mesh / fat-tree
Video surveillance	North-south (ingest), east-west (storage)	Hybrid
IoT sensors	North-south	Hierarchical with edge

Measure Before Designing

Don't assume traffic patterns—measure them. NetFlow, sFlow, or SPAN port analysis reveals actual traffic matrices. Many organizations discover unexpected patterns: a "north-south" workload with massive east-west backup traffic, or "east-west" microservices making most calls to external APIs. Let data drive topology decisions.

Performance Optimization Strategies

Once a topology is deployed, various optimization strategies can extract maximum performance within that topology's constraints. These strategies range from link-level tuning to architectural enhancements.

Link-Level Optimizations

1. Maximum Transmission Unit (MTU) Tuning • Standard MTU: 1,500 bytes (Ethernet default) • Jumbo frames: 9,000 bytes (data center standard) • Benefit: Reduced packet header overhead, lower CPU utilization • Requirement: All devices in path must support same MTU

2. Link Aggregation (LAG/LACP) • Bundle multiple physical links as single logical link • Provides both redundancy and increased bandwidth • Hash-based load distribution per-flow • Effective up to 8 links typically

3. Full Duplex Optimization • Ensure all links operate at full duplex • Half-duplex restricts bandwidth to one direction at a time • Auto-negotiation issues can force half-duplex unintentionally

Routing Optimizations

1. Equal-Cost Multi-Path (ECMP) • Multiple paths with same metric share load • Hash traffic across available paths • Effective in mesh and spine-leaf topologies • Can achieve near-linear bandwidth scaling

2. Traffic Engineering • Explicit path selection based on traffic characteristics • Balance load across non-equal paths • Technologies: MPLS-TE, SD-WAN, segment routing

3. Fast Convergence Tuning • Reduce routing protocol timers for faster failover • BFD (Bidirectional Forwarding Detection) for sub-second detection • Trade-off: Faster convergence increases CPU load

Topology-Specific Optimization Techniques

•Star: Enable flow control (802.3x or PFC), optimize switch buffers for traffic profile, use cut-through switching for latency-sensitive traffic, implement ingress/egress QoS at switch.
•Hierarchical: Increase uplink bandwidth at congestion points, implement Weighted Random Early Detection (WRED) at distribution layer, enable IP routing at distribution to reduce core load, deploy local traffic localization.
•Mesh: Enable ECMP with 16+ way hashing, tune IGP metrics for optimal path selection, implement BFD for rapid fault detection, consider segment routing for traffic engineering.
•Spine-Leaf: Use multi-path hashing algorithms (5-tuple), ensure symmetric spine count and leaf count, implement congestion-aware load balancing (adaptive routing), standardize MTU fabric-wide.
•Ring: Tune token hold timers for latency vs. fairness trade-off, implement priority tokens for real-time traffic, consider dual-ring for bandwidth doubling, monitor ring utilization for hotspot detection.

Buffering and Queue Optimization

Buffer Sizing

Switch buffers absorb burst traffic—sizing affects performance:

• Shallow buffers (< 1 MB): Low latency, early drops under burst • Deep buffers (> 16 MB): Higher latency, absorbs bursts • Ideal: Match buffer depth to bandwidth-delay product of expected flows

Buffer_depth = Bandwidth × RTT × Number_of_flows

For 10 Gbps with 1ms RTT and 100 flows:

Buffer = 10 Gbps × 0.001s × 100 = 1 Gbit = 128 MB (ideal)

Queue Structures

• Single Queue: Simple but no differentiation • Multi-Queue: Separate queues by traffic class • Virtual Output Queuing (VOQ): Per-destination queuing, eliminates head-of-line blocking • Priority + Weighted Fair: Real-time queue + proportional sharing

Monitoring and Tuning Workflow

Baseline: Measure current performance (latency, throughput, drops)
Identify Bottlenecks: Find congested links, overloaded devices
Apply Targeted Optimization: Address specific bottleneck
Validate: Remeasure to confirm improvement
Iterate: Continue until performance meets requirements

Tools: perfSONAR, iPerf, TWAMP, vendor-specific analytics (Cisco DNA, Arista CloudVision)

Optimization Has Limits

Optimization can only extract potential that exists in the topology. A bus network cannot be optimized to perform like a mesh. A three-tier hierarchy cannot match spine-leaf east-west performance. When optimization hits topology limits, the only solution is topology change.

Summary: Mastering Topology Performance

Network performance is the lived experience of your topology decisions. Latency, throughput, jitter, and congestion response all flow directly from the topology you choose. Understanding these relationships enables performance-aware design that meets application requirements.

Key Takeaways

•Latency is multi-component — Propagation, transmission, processing, and queuing delays accumulate across hops. Topology determines hop count and device processing.
•Throughput depends on topology aggregation — Bus networks share bandwidth (poor). Switched networks provide per-port bandwidth. Oversubscription ratios determine aggregate capacity.
•Jitter varies dramatically by topology — Bus networks have extreme jitter (ms). Switched/mesh networks achieve low jitter (μs). Real-time applications require topology-appropriate jitter.
•Congestion response determines stability — Bus networks collapse under load. Switched/mesh networks degrade gracefully. Understanding congestion behavior is essential for capacity planning.
•QoS effectiveness is topology-dependent — Bus networks cannot implement QoS. Switched networks enable full QoS capabilities. Mesh networks require coordinated QoS.
•Traffic patterns should drive topology selection — North-south favors hierarchy. East-west favors mesh/spine-leaf. Measure actual patterns before designing.
•Optimization exists within topology constraints — Tuning can improve performance but cannot overcome topology limitations. Know when to optimize vs. when to redesign.

Page Complete

You now understand how network topology determines performance characteristics. You can analyze latency components, calculate throughput capacity, evaluate jitter suitability, predict congestion behavior, implement topology-appropriate QoS, and optimize within topology constraints. The final page explores selection criteria—how to synthesize cost, reliability, scalability, and performance into informed topology decisions.