Loading learning content...
Before a single byte of user data traverses an Ethernet network, a fundamental challenge must be addressed: How does the receiving device know when a frame begins?
In the electrical realm of networks, the physical medium carries nothing more than voltage fluctuations—a continuous stream of energy states that the receiver must interpret as meaningful binary data. Without explicit synchronization, the receiver cannot distinguish the start of a new frame from idle line noise, mid-transmission data, or the end of a previous frame.
This is the problem that the Preamble and Start Frame Delimiter (SFD) were designed to solve—and their elegant solution has remained essentially unchanged since Ethernet's invention in the 1970s.
By the end of this page, you will understand the precise bit-level structure of the Preamble and SFD, the physics and engineering rationale behind their design, how clock synchronization is achieved through Manchester encoding, and the evolution of these fields across Ethernet generations from 10 Mbps to 100 Gbps.
To understand the Preamble and SFD, we must first appreciate the physical layer environment in which they operate. Ethernet frames do not simply appear fully-formed at the receiver—they must be reconstructed from raw electrical or optical signals.
The Signal Recovery Problem
When an Ethernet transmitter sends data, it converts binary bits into physical signals—voltage levels on copper wire, or light pulses on optical fiber. The receiver must perform the inverse operation: observe physical signals and recover the original bit stream.
This process is complicated by several factors:
The Role of the Preamble
The Preamble addresses these challenges by providing a known, predictable bit pattern that the receiver can use to:
Without these preparatory bits, the receiver would attempt to interpret actual data before its clock recovery circuits had stabilized, resulting in random bit errors at the start of every frame.
Original Ethernet (10BASE5, 1980) used thick coaxial cable with vampire taps, where multiple transceivers shared a single cable segment. In this broadcast medium, any transceiver could begin transmitting at any time. The Preamble was essential because receivers had no advance warning that a frame was incoming—they had to detect and synchronize to the signal in real-time.
The Ethernet Preamble consists of exactly 7 bytes (56 bits) in a specific, carefully chosen pattern:
10101010 10101010 10101010 10101010 10101010 10101010 10101010
Each byte has the hexadecimal value 0xAA. But why this particular pattern?
The Alternating Bit Pattern
The alternating 1-0-1-0 pattern is not arbitrary—it is the optimal waveform for clock recovery. When transmitted using Manchester encoding (used in 10 Mbps Ethernet), this pattern produces a continuous sequence of transitions at the maximum possible rate:
No other bit pattern provides as many regular transitions per unit time. A sequence of all 1s or all 0s would produce no transitions at all (after the initial edge), making clock recovery impossible.
| Byte Position | Binary Value | Hex Value | Purpose |
|---|---|---|---|
| Byte 1 | 10101010 | 0xAA | Initial clock acquisition |
| Byte 2 | 10101010 | 0xAA | PLL frequency locking |
| Byte 3 | 10101010 | 0xAA | Equalizer settling |
| Byte 4 | 10101010 | 0xAA | AGC stabilization |
| Byte 5 | 10101010 | 0xAA | Baseline wander recovery |
| Byte 6 | 10101010 | 0xAA | Fine clock adjustment |
| Byte 7 | 10101010 | 0xAA | Final synchronization |
Ethernet transmits the least significant bit (LSB) first. So the pattern 10101010 in transmission order means the first bit on the wire is '1', then '0', then '1', and so on. When viewing this as a hexadecimal byte value, the LSB-first transmission of 10101010 equals 0xAA (where bit 0, transmitted first, is '0', and bit 7, transmitted last, is '1' in the storage convention).
Why Exactly 7 Bytes?
The 56-bit Preamble length was determined empirically by Ethernet's designers at Xerox PARC in the 1970s. The length provides sufficient time for:
Modern Ethernet receivers can lock much faster than 1970s hardware, but the Preamble length remains at 7 bytes for backward compatibility and to account for worst-case scenarios.
The Preamble's effectiveness is intimately tied to the line coding scheme used. In original 10 Mbps Ethernet, Manchester encoding was employed, which embeds the clock signal within the data stream.
How Manchester Encoding Works
In Manchester encoding, each bit period is divided into two halves, and the bit value is encoded as a transition at the midpoint:
This guarantees at least one transition per bit, regardless of the data pattern, ensuring that clock information is always present in the signal.
| Data Bit | First Half | Second Half | Mid-Bit Transition |
|---|---|---|---|
| 0 | High (+) | Low (-) | Falling edge (↓) |
| 1 | Low (-) | High (+) | Rising edge (↑) |
Preamble in Manchester Encoding
When the alternating 10101010 Preamble pattern is transmitted using Manchester encoding, the result is a sequence of transitions at every bit boundary AND every mid-bit point. The waveform has the maximum transition density possible:
Bit: 1 0 1 0 1 0 1 0
┌──┐ ┌──┐ ┌──┐ ┌──┐ ┌──┐ ┌──┐ ┌──┐ ┌──┐
Manchester: ┘ └─┘ └─┘ └─┘ └─┘ └─┘ └─┘ └─┘ └─
↑ ↓ ↑ ↓ ↑ ↓ ↑ ↓ ↑ ↓ ↑ ↓ ↑ ↓ ↑ ↓
This creates a steady 10 MHz square wave for 10 Mbps Ethernet (two transitions per 100ns bit period). The receiver's phase-locked loop sees this as a pure frequency reference and can rapidly lock onto the precise bit timing.
The Clock Recovery Circuit
A simplified clock recovery circuit for Manchester-encoded Ethernet consists of:
During the Preamble, the PLL has 56 bit periods of perfect reference signal to achieve lock. By the time actual frame data arrives (which may have fewer transitions), the clock is solidly synchronized.
After the Preamble ends, actual frame data may contain long runs of the same bit (e.g., addresses with many consecutive zeros). Manchester encoding guarantees at least one mid-bit transition per bit, but boundary transitions depend on consecutive bit values. This is why the PLL must acquire solid lock during the Preamble—it must maintain synchronization through subsequent low-transition-density data.
Following the 7-byte Preamble, a single byte marks the transition from synchronization to actual frame content. This is the Start Frame Delimiter (SFD), defined as:
10101011 (binary) = 0xAB (hexadecimal)
Distinguishing SFD from Preamble
The SFD is almost identical to the Preamble bytes—six of its eight bits are the same alternating pattern. Only the final two bits differ:
This subtle change—two consecutive '1' bits at the end—creates a distinctive pattern that the receiver can detect to identify the exact byte boundary where frame content begins.
| Field | Bit Pattern | Hex | Final 2 Bits | Purpose |
|---|---|---|---|---|
| Preamble (×7) | 10101010 | 0xAA | '10' | Clock synchronization |
| SFD (×1) | 10101011 | 0xAB | '11' | Frame boundary marker |
Why Two Consecutive 1s?
The choice of ending the SFD with '11' rather than continuing the '10' pattern is significant:
Unique Pattern — In the alternating Preamble sequence, two consecutive identical bits never occur. The '11' ending is therefore unambiguous.
Easy Detection — Hardware can detect the SFD by watching for the first occurrence of two same-polarity transitions in a row after the Preamble begins.
Single-Bit Uniqueness — Even if one Preamble bit is corrupted, the '11' pattern still stands out from the predominant '10' alternation.
Minimal Change — Changing only the last bit maintains maximum clock recovery performance throughout most of the Preamble+SFD sequence.
Frame Content Alignment
The SFD serves as the zero-reference point for frame content. Once the receiver detects the SFD pattern, it knows that:
The SFD essentially tells the receiver: "Synchronization is complete. Real data starts now."
In practice, the receiver's state machine monitors the incoming bit stream for the pattern '10101011'. Upon detection, it asserts an internal 'frame start' signal and begins capturing subsequent bits into the receive buffer. Any bits before this pattern are discarded as synchronization overhead.
In the IEEE 802.3 standard, the Preamble and SFD are sometimes described as a combined 8-byte (64-bit) unit at the beginning of each frame:
Bytes 1-7: 10101010 10101010 10101010 10101010 10101010 10101010 10101010
Byte 8: 10101011 (SFD)
However, it's important to understand the layering:
Both specifications describe the same bit sequence—the difference is purely definitional.
| Specification | Preamble Definition | SFD Definition | Total Bytes |
|---|---|---|---|
| DIX Ethernet (1980) | 8 bytes (includes SFD) | Part of Preamble | 8 |
| IEEE 802.3 (1985) | 7 bytes (0xAA × 7) | 1 byte (0xAB) | 8 |
| Modern Usage | 7 bytes | 1 byte | 8 |
Relationship to Frame Size Calculations
When calculating Ethernet frame sizes, it's crucial to understand what is and isn't counted:
| Component | Bytes | Counted in Frame Size? |
|---|---|---|
| Preamble | 7 | No |
| SFD | 1 | No |
| Destination MAC | 6 | Yes |
| Source MAC | 6 | Yes |
| Type/Length | 2 | Yes |
| Payload | 46-1500 | Yes |
| FCS | 4 | Sometimes |
The Preamble and SFD are considered Physical Layer overhead—they are transmitted on the wire but are not part of the MAC frame proper. Network analyzers typically do not display the Preamble/SFD because it is stripped by the receiving NIC before the frame reaches software.
Similarly, the 64-byte minimum frame size and 1518-byte maximum frame size (excluding VLANs) do not include the Preamble and SFD.
On the physical wire, an Ethernet frame transmission includes 8 bytes of Preamble+SFD, the MAC frame, and a 12-byte Inter-Packet Gap. A minimum-size frame (64 MAC bytes) actually occupies 84 bytes of line time (8 + 64 + 12). This distinction matters for bandwidth calculations and protocol timing.
As Ethernet evolved from 10 Mbps to 100 Gbps and beyond, the physical layer encoding changed dramatically. Manchester encoding gave way to more efficient schemes that pack more bits per symbol. Yet the Preamble and SFD concepts persist—adapted to each new physical layer.
10 Mbps Ethernet (10BASE-T, 10BASE5, 10BASE2)
100 Mbps Fast Ethernet (100BASE-TX)
1 Gbps Ethernet (1000BASE-T, 1000BASE-X)
| Speed | Encoding | Synchronization Mechanism | Frame Delimiter |
|---|---|---|---|
| 10 Mbps | Manchester | 56-bit alternating Preamble | SFD (0xAB) |
| 100 Mbps | 4B/5B + MLT-3 | Idle symbols (/I/) | J-K symbol pair |
| 1 Gbps | 8B/10B / PAM-5 | Continuous idle pattern | /S/ Start-of-Packet |
| 10 Gbps | 64B/66B | 66-bit sync headers | Start control block |
| 25/40/100 Gbps | 64B/66B / PAM-4 | Lane alignment markers | Start control block |
The Consistent Principle
Despite the changing terminology and encoding schemes, the underlying principle remains constant:
Higher-speed Ethernets simply implement these functions with more sophisticated signaling schemes optimized for greater bandwidth efficiency. The 10 Mbps Preamble used 8 bytes out of a minimum 72-byte transmission (11% overhead). Modern encoding schemes reduce this overhead significantly while maintaining reliability.
Multi-speed Ethernet NICs and switches must handle frames from devices operating at different speeds. Auto-negotiation determines the link speed, and the appropriate physical layer encoding is used. The MAC-layer frame format remains consistent—only the physical layer wrapping changes.
Understanding the Preamble and SFD has several practical implications for network design, troubleshooting, and performance analysis.
Bandwidth Overhead Calculation
The Preamble and SFD add 8 bytes to every frame transmitted. Combined with the 12-byte Inter-Packet Gap (IPG), every frame carries 20 bytes of physical layer overhead regardless of size:
For small packets (VoIP, gaming, acknowledgments), this overhead is significant. A 64-byte frame has 24% physical layer overhead (20/84).
If you observe frames being dropped but no CRC errors in your statistics, consider the possibility of Preamble/SFD corruption. Physical layer issues like cable damage, connector problems, or EMI can corrupt synchronization bits, causing the receiver to lose frame lock entirely—the frame is then never processed far enough to check its FCS.
The Preamble and Start Frame Delimiter represent the critical bridge between the physical layer's analog signal processing and the data link layer's digital frame processing. Let's consolidate what we've learned:
What's Next
With synchronization established and the frame boundary identified, the receiver is ready to process actual frame content. The next page examines the Destination and Source MAC Address fields—the addressing mechanism that enables directed communication within local area networks.
You now understand the engineering rationale, bit-level structure, and practical implications of the Ethernet Preamble and Start Frame Delimiter. These 8 bytes of synchronization overhead enable the reliable, high-speed frame detection that underpins all Ethernet communication.