Arp - Learning Module | OneNoughtOne

Loading content...

0/228

ARP Cache

The Performance Imperative

Without the ARP cache, every single IP packet destined for a local host would require a broadcast and reply cycle—introducing latency, consuming bandwidth, and overwhelming the network with redundant traffic. The ARP cache is what transforms ARP from a constant overhead into an occasional lookup.

The cache is deceptively simple in concept: it's just a table mapping IP addresses to MAC addresses. But its implementation involves sophisticated data structures, careful timing policies, multiple entry states, and security considerations. A well-managed ARP cache is invisible; a corrupted or stale cache causes mysterious connectivity failures that frustrate even experienced engineers.

What You Will Learn

By the end of this page, you will understand how operating systems structure and manage their ARP caches, the different entry states and their transitions, aging and expiration policies, how to inspect and manipulate the cache, and the tradeoffs involved in cache timeout tuning.

ARP Cache Purpose and Structure

The ARP cache (also called the ARP table, neighbor cache, or ARP resolution cache) is a local database maintained by each host's operating system. It stores the results of successful ARP resolutions, allowing future packets to the same destination to be transmitted immediately without waiting for new ARP exchanges.

Fundamental Cache Properties:

ARP Cache Characteristics

•Per-Interface: Each network interface has its own ARP cache. A multi-homed host maintains separate caches for each NIC.
•Temporary Entries: Entries expire after a timeout period. There's no permanent storage—the cache is rebuilt after each reboot.
•Dynamic Learning: Entries are created automatically from ARP reply processing, not manually configured (though static entries are possible).
•Limited Size: Operating systems cap cache size. Rarely-used entries may be evicted under memory pressure.
•State Machine: Entries transition through multiple states (incomplete, reachable, stale, etc.) based on confirmation and aging.

Basic Cache Entry Structure:

Each cache entry typically contains:

Field	Description
IP Address	The Layer 3 address that was resolved
MAC Address	The corresponding Layer 2 hardware address
Interface	Which NIC the entry applies to
State	Current entry state (complete, incomplete, stale, etc.)
Timestamp	When the entry was created or last confirmed
Expiration	When the entry becomes invalid
Flags	Static/dynamic, publish, etc.

Cache vs. Table Terminology

The terms 'ARP cache' and 'ARP table' are often used interchangeably. Technically, 'cache' emphasizes the temporary, performance-oriented nature, while 'table' focuses on the data structure. Windows documentation tends to use 'table'; Linux/Unix documentation prefers 'cache' or 'neighbor cache.'

Cache Entry States

Modern operating systems implement ARP cache entries as state machines. Rather than simple present/absent, entries transition through multiple states that affect how they're used and refreshed.

Linux Neighbor Cache States (also applicable to IPv6 NDP):

ARP Cache Entry States (Linux)
State	Description	Entry Usable?	Next Transition
NONE	No entry exists for this IP	No	→ INCOMPLETE when resolution starts
INCOMPLETE	ARP request sent, awaiting reply	No (packets queued)	→ REACHABLE (on reply) or FAILED (on timeout)
REACHABLE	Entry confirmed valid, actively used	Yes	→ STALE (after reachable timeout)
STALE	Entry aged, validity unconfirmed	Yes (with caveats)	→ DELAY when used, or removed
DELAY	Stale entry used, confirmation pending	Yes	→ REACHABLE (confirmed) or PROBE (timeout)
PROBE	Sending ARP to verify entry still valid	Yes	→ REACHABLE (reply) or FAILED (no reply)
FAILED	Resolution failed, entry invalid	No	Entry removed after brief period

Converting Mermaid diagram...

Key Concepts:

REACHABLE vs. Stale: An entry is REACHABLE when the operating system has recent evidence that the entry is correct. This evidence can come from:

The initial ARP reply
Upper-layer confirmations (e.g., TCP ACKs indicate the host is reachable)
Subsequent ARP replies

Without confirmation, entries transition to STALE. Stale entries are still usable, but the system knows it should verify them soon.

The DELAY State: When a STALE entry is used, it doesn't immediately trigger ARP. Instead, the system enters DELAY, expecting upper-layer confirmation (like a TCP ACK). This optimization avoids ARP traffic when TCP already confirms reachability. If no confirmation arrives within the delay timeout, the system PROBES with a direct unicast ARP (not broadcast).

The PROBE State: Probing sends ARP directly to the cached MAC address (unicast, not broadcast). If the host is still there with the same MAC, it replies. This is more efficient than broadcast re-resolution.

Windows Simplification

Windows uses a simpler model: entries are either 'dynamic' (learned) or 'static' (manually configured), with a single reachable timeout. It doesn't expose the STALE/DELAY/PROBE states to administrators, though similar logic exists internally.

Cache Timeout Policies

ARP cache timeouts balance two competing concerns:

Performance: Longer timeouts mean fewer ARP broadcasts, lower overhead
Correctness: Shorter timeouts mean faster detection of changed mappings

Different operating systems and environments choose different balances.

Default ARP Cache Timeouts by Operating System
Operating System	Default Reachable Timeout	Default Stale Behavior	Maximum Entries
Linux (default)	30 seconds (base_reachable_time)	Transitions to STALE, may linger	1024 per interface (tunable)
Windows 10/11	15-45 seconds (random within range)	Re-resolved on use after timeout	Dynamic, typically thousands
macOS	20 minutes	Hard expiration	Dynamic
FreeBSD	20 minutes	Similar to macOS	Dynamic
Cisco IOS	4 hours	Long-lived by default	Configurable

Linux Timeout Parameters (sysctl tunables):

# View current settings
sysctl net.ipv4.neigh.default.base_reachable_time_ms
sysctl net.ipv4.neigh.default.gc_stale_time
sysctl net.ipv4.neigh.default.delay_first_probe_time

# Typical defaults:
# base_reachable_time_ms = 30000 (30 seconds)
# gc_stale_time = 60 (seconds, when stale entries can be garbage collected)
# delay_first_probe_time = 5 (seconds before probing a stale entry)

The Randomization Factor:

To prevent synchronized cache expirations (which would cause ARP broadcast storms as many hosts simultaneously re-resolve), operating systems randomize actual timeout values around the base timeout:

Linux: actual timeout = base_reachable_time × random(0.5 to 1.5)
Windows: random between ArpCacheMinReferencedLife and ArpCacheMaxReferencedLife

This spreads cache expirations over time, smoothing network load.

Timeout Tradeoffs in Practice

Shorter timeouts detect MAC changes faster but increase ARP traffic. In hypervisor environments where VMs migrate frequently, short timeouts (seconds) are critical. In stable server networks, longer timeouts (minutes) reduce overhead. The 'right' value depends entirely on your environment's dynamism.

Viewing the ARP Cache

Every network professional must know how to inspect the ARP cache on various platforms. This is fundamental troubleshooting.

Linux:

Linux ARP Commands
1
2
3
4
5
6
7
8
9
10
11
12
13
14
# Traditional command (deprecated but ubiquitous)
$ arp -a
router (192.168.1.1) at a0:b1:c2:d3:e4:f5 [ether] on eth0
webserver (192.168.1.50) at 00:11:22:33:44:55 [ether] on eth0
 
# Modern command (iproute2 - recommended)
$ ip neigh show
192.168.1.1 dev eth0 lladdr a0:b1:c2:d3:e4:f5 REACHABLE
192.168.1.50 dev eth0 lladdr 00:11:22:33:44:55 STALE
192.168.1.200 dev eth0 FAILED
 
# Detailed view with timestamps
$ ip -s neigh show
192.168.1.1 dev eth0 lladdr a0:b1:c2:d3:e4:f5 ref 4 used 0/0/0 probes 0 REACHABLE

Windows:

Windows ARP Commands
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
# Command Prompt
C:\> arp -a
 
Interface: 192.168.1.100 --- 0x4
  Internet Address      Physical Address      Type
  192.168.1.1           a0-b1-c2-d3-e4-f5     dynamic
  192.168.1.50          00-11-22-33-44-55     dynamic
  192.168.1.255         ff-ff-ff-ff-ff-ff     static
  224.0.0.22            01-00-5e-00-00-16     static
 
# PowerShell (more detailed)
PS C:\> Get-NetNeighbor | Format-Table -AutoSize
 
ifIndex IPAddress       LinkLayerAddress      State
------- ---------       ----------------      -----
4       192.168.1.1     A0-B1-C2-D3-E4-F5     Reachable
4       192.168.1.50    00-11-22-33-44-55     Stale
4       192.168.1.255   FF-FF-FF-FF-FF-FF     Permanent

macOS:

macOS ARP Commands
1
2
3
4
5
6
7
# Standard command
$ arp -a
router (192.168.1.1) at a0:b1:c2:d3:e4:f5 on en0 ifscope [ethernet]
webserver (192.168.1.50) at 00:11:22:33:44:55 on en0 ifscope [ethernet]
 
# More detailed with interface specification
$ arp -a -i en0

Interpreting Cache Output

When troubleshooting, look for: (1) Missing entries for devices that should be reachable, (2) FAILED/INCOMPLETE entries indicating resolution problems, (3) Wrong MAC addresses that might indicate ARP spoofing, (4) Stale entries for hosts that have moved or changed NICs.

Manipulating the ARP Cache

Sometimes administrators need to manually modify the ARP cache—adding static entries for security, deleting stale entries for troubleshooting, or flushing the entire cache to force re-resolution.

Adding Static Entries:

Static entries never expire and take precedence over dynamic ones. They're useful for:

Protecting critical hosts (gateways, DNS servers) from ARP spoofing
Ensuring consistent mappings in high-availability configurations
Working around hosts that don't respond to ARP properly

Adding Static ARP Entries
1
2
3
4
5
6
7
8
9
10
11
12
# Linux (requires root)
$ sudo arp -s 192.168.1.1 a0:b1:c2:d3:e4:f5
# Or using ip command
$ sudo ip neigh add 192.168.1.1 lladdr a0:b1:c2:d3:e4:f5 nud permanent dev eth0
 
# Windows (requires Administrator)
C:\> arp -s 192.168.1.1 a0-b1-c2-d3-e4-f5
# Or using netsh
C:\> netsh interface ipv4 add neighbors "Ethernet" 192.168.1.1 a0-b1-c2-d3-e4-f5
 
# macOS (requires root)
$ sudo arp -s 192.168.1.1 a0:b1:c2:d3:e4:f5 permanent

Deleting Entries:

Deleting ARP Entries
1
2
3
4
5
6
7
8
9
10
11
12
13
14
# Linux
$ sudo arp -d 192.168.1.50
# Or
$ sudo ip neigh del 192.168.1.50 dev eth0
 
# Windows
C:\> arp -d 192.168.1.50
# Or delete all entries
C:\> arp -d *
# Or using netsh
C:\> netsh interface ipv4 delete neighbors "Ethernet" 192.168.1.50
 
# macOS
$ sudo arp -d 192.168.1.50

Flushing the Entire Cache:

Complete cache flush forces all entries to be re-resolved—useful when troubleshooting widespread connectivity issues or after significant network changes.

Flushing ARP Cache

# Linux - Flush all neighbor entries
$ sudo ip neigh flush all
 
# Windows - Delete all entries from specific interface
C:\> netsh interface ipv4 delete arpcache
 
# macOS - Delete all entries
$ sudo arp -a -d

Cache Flush Impact

Flushing the ARP cache causes temporary connectivity impact. Every subsequent packet will trigger ARP resolution, introducing latency until caches repopulate. On busy servers, this can cause noticeable delays. Plan cache flushes during maintenance windows when possible.

Static vs. Dynamic Entries

The distinction between static and dynamic entries is fundamental to ARP cache management. Each has specific use cases and tradeoffs.

Dynamic Entries

•Created automatically from ARP replies
•Expire after timeout (OS-dependent)
•Adapt to changes — new MAC is learned on re-resolution
•Zero configuration — works out of the box
•Majority of entries in typical networks

Static Entries

•Manually configured by administrator
•Never expire (persist until removed or reboot)
•Resist spoofing — can't be overwritten by ARP replies
•Require maintenance when MAC addresses change
•Special use cases — security-critical hosts

When to Use Static Entries:

1. Gateway/Router Protection

The default gateway is the most critical ARP entry. If an attacker spoofs the gateway's MAC, they can intercept all external traffic. Static entries for gateways prevent this:

# Linux: Protect gateway from spoofing
sudo ip neigh add 192.168.1.1 lladdr a0:b1:c2:d3:e4:f5 nud permanent dev eth0

2. Critical Server Protection

DNS servers, domain controllers, and other infrastructure should have static entries on client machines to prevent redirection attacks.

3. High-Availability Pairs

When two servers share a virtual IP and fail over between them, static entries ensure the MAC address handoff is controlled, not subject to racing ARP replies.

4. Embedded Systems

Devices with fixed network topology benefit from static entries that survive ARP cache timeout cycles.

Persistence Across Reboots

By default, static ARP entries are lost on reboot. To make them persistent, add the commands to startup scripts (rc.local on Linux, Task Scheduler on Windows) or use configuration management tools (Ansible, Puppet, GPO). Some systems support /etc/ethers or similar files for persistent entries.

Cache Data Structures

Operating systems implement ARP caches using efficient data structures optimized for fast lookups. Understanding these internals helps explain behavior and performance characteristics.

Common Implementation Approaches:

ARP Cache Data Structure Implementations
Approach	Lookup Time	Memory Usage	Used By
Hash Table	O(1) average	Moderate (hash buckets)	Linux (neighbor cache), Windows
Radix Tree / Trie	O(k) where k = key length	Efficient for sparse data	Some BSD variants
Simple Array	O(n) linear scan	Minimal overhead	Simple embedded systems
LRU Cache + Hash	O(1) with bounded size	Fixed, predictable	Memory-constrained systems

Linux Neighbor Cache Implementation:

Linux uses a hash table with configurable bucket count. Each bucket contains a linked list of entries that hash to that bucket. Key parameters:

# View neighbor table size parameters
sysctl net.ipv4.neigh.default.gc_thresh1  # Minimum entries before GC
sysctl net.ipv4.neigh.default.gc_thresh2  # Soft limit (GC more aggressive)
sysctl net.ipv4.neigh.default.gc_thresh3  # Hard limit (refuse new entries)

# Typical defaults:
# gc_thresh1 = 128   (don't GC below this)
# gc_thresh2 = 512   (start considering GC)
# gc_thresh3 = 1024  (hard maximum)

Garbage Collection (GC):

When the cache grows beyond thresholds, garbage collection removes entries based on:

Age: Oldest stale entries removed first
Usage: Least recently used entries are candidates
State: FAILED and old STALE entries removed before REACHABLE

GC runs periodically (gc_interval, typically 30 seconds) or when hard limits are reached.

Cache Exhaustion

In networks with many hosts (thousands), hitting gc_thresh3 causes new ARP entries to fail. Symptoms include intermittent connectivity and 'neighbour table overflow' kernel messages. Increase thresholds for large networks or implement network segmentation to reduce broadcast domain size.

Cache Coherency Challenges

The ARP cache is a local data structure with no distributed coordination. This creates coherency challenges when network conditions change.

Problem 1: MAC Address Changes

When a host's MAC address changes (NIC replacement, VM migration, failover), cached entries across the network become stale. Until caches expire or are updated, traffic may be misdirected.

Solutions:

Gratuitous ARP announces the new mapping
Shorter cache timeouts accelerate discovery
Some systems detect link-layer failures and invalidate entries

Problem 2: Host Movement (VM Migration)

Virtual machines can migrate between physical hosts, changing their Layer 2 location while keeping the same MAC/IP. Switches must update their MAC tables, and peers may need cache updates.

Solutions:

RARP/GARP upon migration completion
VMware vMotion sends GARP after migration
SDN controllers can push updates to all switches

Common Cache Coherency Problems

•Stale Cache After Failover: Active-passive cluster failover completes, but clients still send to the old active's MAC for up to 20 minutes (cache timeout). Fix: Proper GARP from new active.
•VM Cloning Issues: Cloned VM retains the same MAC as original. Both have same IP in different caches. Chaos ensues. Fix: Ensure unique MAC assignment on clone.
•ARP Race After Power Outage: Multiple hosts boot simultaneously, all sending GARP. Switches receive conflicting updates. Fix: Stagger device boot or use spanning tree to delay forwarding.
•DHCP IP Reuse: Host A releases IP, Host B gets same IP. Caches for old Host A persist with wrong MAC. Fix: Ensure GARP on DHCP lease acquisition or use short ARP timeouts.

Designing for Cache Coherency

In environments with frequent changes (cloud, containers, VMs), shorter ARP timeouts trade broadcast overhead for faster coherency. Data center networks often use 30-60 second timeouts rather than the 20-minute defaults of traditional workstations.

Summary: The ARP Cache

The ARP cache transforms ARP from a constant overhead into occasional lookups. Understanding its structure, states, and management is essential for network troubleshooting and optimization. Let's consolidate the key takeaways:

Key Takeaways

•Caching enables efficiency — Without cached mappings, every packet would require ARP broadcast overhead.
•Entries have states — Modern OSes track INCOMPLETE, REACHABLE, STALE, DELAY, PROBE, and FAILED states with defined transitions.
•Timeouts balance performance and correctness — Longer timeouts reduce traffic; shorter timeouts detect changes faster.
•Static entries provide security — Manually configured entries resist spoofing but require maintenance.
•Cache manipulation is essential troubleshooting — Viewing, adding, deleting, and flushing entries diagnoses connectivity issues.
•Data structures matter at scale — Hash tables provide O(1) lookups; garbage collection prevents memory exhaustion.
•Coherency is challenging — Distributed caches with no coordination create stale entry problems during network changes.

What's Next:

With the cache understood, we'll dive deeper into the ARP Request/Reply cycle—examining the specific packet formats, Ethernet frame construction, and the nuances of how requests and replies are matched and processed.

Page Complete

You now have comprehensive knowledge of the ARP cache. From state machines to garbage collection, you understand how operating systems store and manage IP-to-MAC mappings. Next, we'll examine the ARP request/reply packets themselves in precise detail.

ARP Cache

The Performance Imperative

What You Will Learn

ARP Cache Purpose and Structure

Fundamental Cache Properties:

ARP Cache Characteristics

•Per-Interface: Each network interface has its own ARP cache. A multi-homed host maintains separate caches for each NIC.
•Temporary Entries: Entries expire after a timeout period. There's no permanent storage—the cache is rebuilt after each reboot.
•Dynamic Learning: Entries are created automatically from ARP reply processing, not manually configured (though static entries are possible).
•Limited Size: Operating systems cap cache size. Rarely-used entries may be evicted under memory pressure.
•State Machine: Entries transition through multiple states (incomplete, reachable, stale, etc.) based on confirmation and aging.

Basic Cache Entry Structure:

Each cache entry typically contains:

Field	Description
IP Address	The Layer 3 address that was resolved
MAC Address	The corresponding Layer 2 hardware address
Interface	Which NIC the entry applies to
State	Current entry state (complete, incomplete, stale, etc.)
Timestamp	When the entry was created or last confirmed
Expiration	When the entry becomes invalid
Flags	Static/dynamic, publish, etc.

Cache vs. Table Terminology

Cache Entry States

Modern operating systems implement ARP cache entries as state machines. Rather than simple present/absent, entries transition through multiple states that affect how they're used and refreshed.

Linux Neighbor Cache States (also applicable to IPv6 NDP):

ARP Cache Entry States (Linux)
State	Description	Entry Usable?	Next Transition
NONE	No entry exists for this IP	No	→ INCOMPLETE when resolution starts
INCOMPLETE	ARP request sent, awaiting reply	No (packets queued)	→ REACHABLE (on reply) or FAILED (on timeout)
REACHABLE	Entry confirmed valid, actively used	Yes	→ STALE (after reachable timeout)
STALE	Entry aged, validity unconfirmed	Yes (with caveats)	→ DELAY when used, or removed
DELAY	Stale entry used, confirmation pending	Yes	→ REACHABLE (confirmed) or PROBE (timeout)
PROBE	Sending ARP to verify entry still valid	Yes	→ REACHABLE (reply) or FAILED (no reply)
FAILED	Resolution failed, entry invalid	No	Entry removed after brief period

Converting Mermaid diagram...

Key Concepts:

REACHABLE vs. Stale: An entry is REACHABLE when the operating system has recent evidence that the entry is correct. This evidence can come from:

The initial ARP reply
Upper-layer confirmations (e.g., TCP ACKs indicate the host is reachable)
Subsequent ARP replies

Without confirmation, entries transition to STALE. Stale entries are still usable, but the system knows it should verify them soon.

Windows Simplification

Cache Timeout Policies

ARP cache timeouts balance two competing concerns:

Performance: Longer timeouts mean fewer ARP broadcasts, lower overhead
Correctness: Shorter timeouts mean faster detection of changed mappings

Different operating systems and environments choose different balances.

Default ARP Cache Timeouts by Operating System
Operating System	Default Reachable Timeout	Default Stale Behavior	Maximum Entries
Linux (default)	30 seconds (base_reachable_time)	Transitions to STALE, may linger	1024 per interface (tunable)
Windows 10/11	15-45 seconds (random within range)	Re-resolved on use after timeout	Dynamic, typically thousands
macOS	20 minutes	Hard expiration	Dynamic
FreeBSD	20 minutes	Similar to macOS	Dynamic
Cisco IOS	4 hours	Long-lived by default	Configurable

Linux Timeout Parameters (sysctl tunables):

# View current settings
sysctl net.ipv4.neigh.default.base_reachable_time_ms
sysctl net.ipv4.neigh.default.gc_stale_time
sysctl net.ipv4.neigh.default.delay_first_probe_time

# Typical defaults:
# base_reachable_time_ms = 30000 (30 seconds)
# gc_stale_time = 60 (seconds, when stale entries can be garbage collected)
# delay_first_probe_time = 5 (seconds before probing a stale entry)

The Randomization Factor:

To prevent synchronized cache expirations (which would cause ARP broadcast storms as many hosts simultaneously re-resolve), operating systems randomize actual timeout values around the base timeout:

Linux: actual timeout = base_reachable_time × random(0.5 to 1.5)
Windows: random between ArpCacheMinReferencedLife and ArpCacheMaxReferencedLife

This spreads cache expirations over time, smoothing network load.

Timeout Tradeoffs in Practice

Viewing the ARP Cache

Every network professional must know how to inspect the ARP cache on various platforms. This is fundamental troubleshooting.

Linux:

Linux ARP Commands
1
2
3
4
5
6
7
8
9
10
11
12
13
14
# Traditional command (deprecated but ubiquitous)
$ arp -a
router (192.168.1.1) at a0:b1:c2:d3:e4:f5 [ether] on eth0
webserver (192.168.1.50) at 00:11:22:33:44:55 [ether] on eth0
 
# Modern command (iproute2 - recommended)
$ ip neigh show
192.168.1.1 dev eth0 lladdr a0:b1:c2:d3:e4:f5 REACHABLE
192.168.1.50 dev eth0 lladdr 00:11:22:33:44:55 STALE
192.168.1.200 dev eth0 FAILED
 
# Detailed view with timestamps
$ ip -s neigh show
192.168.1.1 dev eth0 lladdr a0:b1:c2:d3:e4:f5 ref 4 used 0/0/0 probes 0 REACHABLE

Windows:

Windows ARP Commands
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
# Command Prompt
C:\> arp -a
 
Interface: 192.168.1.100 --- 0x4
  Internet Address      Physical Address      Type
  192.168.1.1           a0-b1-c2-d3-e4-f5     dynamic
  192.168.1.50          00-11-22-33-44-55     dynamic
  192.168.1.255         ff-ff-ff-ff-ff-ff     static
  224.0.0.22            01-00-5e-00-00-16     static
 
# PowerShell (more detailed)
PS C:\> Get-NetNeighbor | Format-Table -AutoSize
 
ifIndex IPAddress       LinkLayerAddress      State
------- ---------       ----------------      -----
4       192.168.1.1     A0-B1-C2-D3-E4-F5     Reachable
4       192.168.1.50    00-11-22-33-44-55     Stale
4       192.168.1.255   FF-FF-FF-FF-FF-FF     Permanent

macOS:

macOS ARP Commands
1
2
3
4
5
6
7
# Standard command
$ arp -a
router (192.168.1.1) at a0:b1:c2:d3:e4:f5 on en0 ifscope [ethernet]
webserver (192.168.1.50) at 00:11:22:33:44:55 on en0 ifscope [ethernet]
 
# More detailed with interface specification
$ arp -a -i en0

Interpreting Cache Output

Manipulating the ARP Cache

Sometimes administrators need to manually modify the ARP cache—adding static entries for security, deleting stale entries for troubleshooting, or flushing the entire cache to force re-resolution.

Adding Static Entries:

Static entries never expire and take precedence over dynamic ones. They're useful for:

Protecting critical hosts (gateways, DNS servers) from ARP spoofing
Ensuring consistent mappings in high-availability configurations
Working around hosts that don't respond to ARP properly

Adding Static ARP Entries
1
2
3
4
5
6
7
8
9
10
11
12
# Linux (requires root)
$ sudo arp -s 192.168.1.1 a0:b1:c2:d3:e4:f5
# Or using ip command
$ sudo ip neigh add 192.168.1.1 lladdr a0:b1:c2:d3:e4:f5 nud permanent dev eth0
 
# Windows (requires Administrator)
C:\> arp -s 192.168.1.1 a0-b1-c2-d3-e4-f5
# Or using netsh
C:\> netsh interface ipv4 add neighbors "Ethernet" 192.168.1.1 a0-b1-c2-d3-e4-f5
 
# macOS (requires root)
$ sudo arp -s 192.168.1.1 a0:b1:c2:d3:e4:f5 permanent

Deleting Entries:

Deleting ARP Entries
1
2
3
4
5
6
7
8
9
10
11
12
13
14
# Linux
$ sudo arp -d 192.168.1.50
# Or
$ sudo ip neigh del 192.168.1.50 dev eth0
 
# Windows
C:\> arp -d 192.168.1.50
# Or delete all entries
C:\> arp -d *
# Or using netsh
C:\> netsh interface ipv4 delete neighbors "Ethernet" 192.168.1.50
 
# macOS
$ sudo arp -d 192.168.1.50

Flushing the Entire Cache:

Complete cache flush forces all entries to be re-resolved—useful when troubleshooting widespread connectivity issues or after significant network changes.

Flushing ARP Cache

# Linux - Flush all neighbor entries
$ sudo ip neigh flush all
 
# Windows - Delete all entries from specific interface
C:\> netsh interface ipv4 delete arpcache
 
# macOS - Delete all entries
$ sudo arp -a -d

Cache Flush Impact

Static vs. Dynamic Entries

The distinction between static and dynamic entries is fundamental to ARP cache management. Each has specific use cases and tradeoffs.

Dynamic Entries

•Created automatically from ARP replies
•Expire after timeout (OS-dependent)
•Adapt to changes — new MAC is learned on re-resolution
•Zero configuration — works out of the box
•Majority of entries in typical networks

Static Entries

•Manually configured by administrator
•Never expire (persist until removed or reboot)
•Resist spoofing — can't be overwritten by ARP replies
•Require maintenance when MAC addresses change
•Special use cases — security-critical hosts

When to Use Static Entries:

1. Gateway/Router Protection

The default gateway is the most critical ARP entry. If an attacker spoofs the gateway's MAC, they can intercept all external traffic. Static entries for gateways prevent this:

# Linux: Protect gateway from spoofing
sudo ip neigh add 192.168.1.1 lladdr a0:b1:c2:d3:e4:f5 nud permanent dev eth0

2. Critical Server Protection

DNS servers, domain controllers, and other infrastructure should have static entries on client machines to prevent redirection attacks.

3. High-Availability Pairs

When two servers share a virtual IP and fail over between them, static entries ensure the MAC address handoff is controlled, not subject to racing ARP replies.

4. Embedded Systems

Devices with fixed network topology benefit from static entries that survive ARP cache timeout cycles.

Persistence Across Reboots

Cache Data Structures

Operating systems implement ARP caches using efficient data structures optimized for fast lookups. Understanding these internals helps explain behavior and performance characteristics.

Common Implementation Approaches:

ARP Cache Data Structure Implementations
Approach	Lookup Time	Memory Usage	Used By
Hash Table	O(1) average	Moderate (hash buckets)	Linux (neighbor cache), Windows
Radix Tree / Trie	O(k) where k = key length	Efficient for sparse data	Some BSD variants
Simple Array	O(n) linear scan	Minimal overhead	Simple embedded systems
LRU Cache + Hash	O(1) with bounded size	Fixed, predictable	Memory-constrained systems

Linux Neighbor Cache Implementation:

Linux uses a hash table with configurable bucket count. Each bucket contains a linked list of entries that hash to that bucket. Key parameters:

# View neighbor table size parameters
sysctl net.ipv4.neigh.default.gc_thresh1  # Minimum entries before GC
sysctl net.ipv4.neigh.default.gc_thresh2  # Soft limit (GC more aggressive)
sysctl net.ipv4.neigh.default.gc_thresh3  # Hard limit (refuse new entries)

# Typical defaults:
# gc_thresh1 = 128   (don't GC below this)
# gc_thresh2 = 512   (start considering GC)
# gc_thresh3 = 1024  (hard maximum)

Garbage Collection (GC):

When the cache grows beyond thresholds, garbage collection removes entries based on:

Age: Oldest stale entries removed first
Usage: Least recently used entries are candidates
State: FAILED and old STALE entries removed before REACHABLE

GC runs periodically (gc_interval, typically 30 seconds) or when hard limits are reached.

Cache Exhaustion

Cache Coherency Challenges

The ARP cache is a local data structure with no distributed coordination. This creates coherency challenges when network conditions change.

Problem 1: MAC Address Changes

When a host's MAC address changes (NIC replacement, VM migration, failover), cached entries across the network become stale. Until caches expire or are updated, traffic may be misdirected.

Solutions:

Gratuitous ARP announces the new mapping
Shorter cache timeouts accelerate discovery
Some systems detect link-layer failures and invalidate entries

Problem 2: Host Movement (VM Migration)

Virtual machines can migrate between physical hosts, changing their Layer 2 location while keeping the same MAC/IP. Switches must update their MAC tables, and peers may need cache updates.

Solutions:

RARP/GARP upon migration completion
VMware vMotion sends GARP after migration
SDN controllers can push updates to all switches

Common Cache Coherency Problems

•Stale Cache After Failover: Active-passive cluster failover completes, but clients still send to the old active's MAC for up to 20 minutes (cache timeout). Fix: Proper GARP from new active.
•VM Cloning Issues: Cloned VM retains the same MAC as original. Both have same IP in different caches. Chaos ensues. Fix: Ensure unique MAC assignment on clone.
•ARP Race After Power Outage: Multiple hosts boot simultaneously, all sending GARP. Switches receive conflicting updates. Fix: Stagger device boot or use spanning tree to delay forwarding.
•DHCP IP Reuse: Host A releases IP, Host B gets same IP. Caches for old Host A persist with wrong MAC. Fix: Ensure GARP on DHCP lease acquisition or use short ARP timeouts.

Designing for Cache Coherency

Summary: The ARP Cache

Key Takeaways

•Caching enables efficiency — Without cached mappings, every packet would require ARP broadcast overhead.
•Entries have states — Modern OSes track INCOMPLETE, REACHABLE, STALE, DELAY, PROBE, and FAILED states with defined transitions.
•Timeouts balance performance and correctness — Longer timeouts reduce traffic; shorter timeouts detect changes faster.
•Static entries provide security — Manually configured entries resist spoofing but require maintenance.
•Cache manipulation is essential troubleshooting — Viewing, adding, deleting, and flushing entries diagnoses connectivity issues.
•Data structures matter at scale — Hash tables provide O(1) lookups; garbage collection prevents memory exhaustion.
•Coherency is challenging — Distributed caches with no coordination create stale entry problems during network changes.

What's Next:

Page Complete