Operating SystemsSegmentation Concepts

Segmentation Concepts

LevelIntermediate

Duration90 mins

TopicSegmentation Concepts

1 / 5

Logical Segments

Memory Through the Programmer's Eyes

When you write a program, you don't think of memory as a flat, undifferentiated array of bytes. You think in terms of structure: your code goes in one place, your global variables in another, your stack grows downward, your heap grows upward, and your dynamically loaded libraries occupy their own regions. This is the programmer's natural view of memory—organized, structured, and meaningful.

But paging, despite its elegance in solving fragmentation, imposes a fundamentally different view. To a paging system, memory is a uniform collection of fixed-size pages—there's no distinction between a page holding code and one holding data. The logical structure of your program is invisible to the hardware.

Segmentation bridges this gap. It organizes memory into logical segments—variable-sized blocks that correspond to the meaningful units of a program. Each segment represents a complete, logically distinct unit: the main program's code, a library module, a symbol table, a stack, or a heap. This alignment between memory organization and program structure is what makes segmentation compelling.

What You Will Learn

By the end of this page, you will understand: what logical segments are and why they exist, the fundamental distinction between segments and pages, how segments reflect program structure, the historical motivation for segmentation, addressing within segmented memory, and the relationship between segmentation and other memory management techniques.

What Are Logical Segments?

A logical segment is a contiguous block of memory that represents a complete, meaningful unit of a program. Unlike pages, which are arbitrary fixed-size chunks created for hardware convenience, segments correspond to the logical divisions that programmers naturally create when writing software.

Consider a typical C program. When compiled and loaded, it consists of several distinct logical units:

Text segment: The compiled machine code (instructions)
Data segment: Initialized global and static variables
BSS segment: Uninitialized global and static variables
Heap segment: Dynamically allocated memory (malloc/free)
Stack segment: Local variables, function call frames, return addresses
Shared library segments: Code and data for dynamically linked libraries

Each of these is a logical segment—a coherent unit with its own purpose, access patterns, and lifetime. The key insight of segmentation is that memory management should respect these logical boundaries.

Defining Characteristics of Logical Segments

•Logical Coherence — A segment contains related data or code that belongs together conceptually. The entire main() function's code is in one segment, not split across arbitrary boundaries.
•Variable Size — Unlike fixed-size pages, segments can be any size needed. A tiny utility function segment might be 100 bytes; a large data structure segment might be megabytes.
•Distinct Identity — Each segment has a name or number that identifies it uniquely. The address of any byte includes both the segment identifier and the offset within that segment.
•Independent Attributes — Segments can have different access permissions (read, write, execute), allowing fine-grained protection based on logical meaning.
•Separate Lifetime — Segments can be loaded, extended, protected, or removed independently, reflecting the dynamic nature of program execution.

The Formal Definition:

Formally, a segment is defined by a tuple (segment_number, base_address, limit). The segment number uniquely identifies the segment within the process's address space. The base address indicates where the segment begins in physical memory. The limit specifies the segment's size, ensuring access remains within bounds.

An address in a segmented system is a two-component address: (segment_number, offset). To access memory, the hardware:

Uses the segment number to look up the segment's base and limit
Checks that the offset is less than the limit (bounds check)
Computes the physical address: base + offset

This two-dimensional addressing scheme is fundamental to how segmentation represents the programmer's view of memory.

The Name Comes from the Structure

The term 'segment' comes from the idea of dividing (segmenting) a program into its natural parts. Just as a worm's body has segments that are complete functional units, a program has segments that are complete logical units. This biological metaphor captures the essential idea: each segment is complete and meaningful on its own.

Historical Motivation: Why Segmentation Was Invented

Segmentation emerged in the 1960s as computer scientists grappled with fundamental questions about memory organization. The systems of that era faced challenges that made segmentation a natural solution.

The Problem with Flat Address Spaces

Early computers used flat, one-dimensional address spaces. A program was loaded starting at some base address, and all addresses were relative to that base. This simplicity came with significant problems:

No logical structure: The hardware couldn't distinguish code from data, making protection difficult
Relocation complexity: Moving a program in memory required patching all internal addresses
Sharing difficulty: Two programs couldn't easily share a library without duplicating it entirely
Programming burden: Programmers had to manually calculate offsets across module boundaries

Historical Systems and Segmentation
System	Year	Segmentation Innovation	Impact
Burroughs B5000	1961	Tagged architecture with code/data segments	Pioneered structured memory
Multics	1965	Segments + pages, rich segment attributes	Defined modern segmentation concepts
Intel 8086	1978	Four 64KB segments (CS, DS, SS, ES)	Brought segmentation to microprocessors
Intel 80286	1982	Protected mode with segment descriptors	Added hardware protection to segments
Intel 80386	1985	Segments + paging combined	Full modern implementation

The Multics Vision

The most influential early segmented system was Multics (Multiplexed Information and Computing Service), developed at MIT starting in 1964. Multics introduced a revolutionary concept: treat all of memory as a collection of named segments that persist independently of processes.

In Multics:

Every file was a segment that could be mapped directly into memory
Programs consisted of multiple segments that could be shared
Each segment had individual access controls
Segments could grow dynamically as needed
The distinction between memory and files blurred—a radical idea

This vision was so ambitious that Multics was considered overengineered by some, leading Ken Thompson and Dennis Ritchie to create Unix as a simpler alternative. Yet the concepts pioneered by Multics—segments with attributes, protection domains, and the unification of files and memory—remain influential today.

The Intel 8086's Pragmatic Segmentation

When Intel designed the 8086 processor in 1978, they needed to address more than 64KB of memory with 16-bit registers. Their solution: use segment registers that provide a base address, shifted left by 4 bits, added to an offset. This allowed 1MB addressing (20 bits) with 16-bit components. While pragmatic, this design choice forced generations of programmers to wrestle with 'near' and 'far' pointers—a consequence of hardware segmentation meeting real-world constraints.

Segments vs. Pages: A Fundamental Comparison

Understanding the distinction between segments and pages is crucial for grasping why both exist and how they can complement each other. These two approaches to memory organization represent fundamentally different philosophies.

Segmentation

•Programmer-visible organization
•Variable-size units (bytes to megabytes)
•Reflects logical structure of programs
•Two-dimensional addressing (segment, offset)
•Subject to external fragmentation
•Natural unit for protection and sharing
•Segments can be named meaningfully
•Provides programmer's view of memory

Paging

•Programmer-invisible organization
•Fixed-size units (typically 4KB)
•Reflects hardware requirements
•One-dimensional addressing (page frame + offset)
•No external fragmentation
•Uniform treatment, implicit protection
•Pages are numbered, not named
•Provides system's view of memory

The Philosophical Divide

The segment-page dichotomy reflects a deeper tension in systems design: logical organization vs. physical efficiency.

Segmentation says: "Memory should be organized the way programmers think. Give each logical unit its own space, let it grow as needed, protect it according to its purpose."

Paging says: "Memory should be organized for efficient use. Divide everything into uniform chunks that can be shuffled, swapped, and managed without fragmentation."

Neither view is wrong—they're solving different problems. The insight of modern systems is that they can be combined: use segmentation at the logical level (for protection, sharing, and programmer convenience) and paging at the physical level (for memory management efficiency).

Where Each Excels:

Aspect	Segmentation Better	Paging Better
Matching program structure	✓
Eliminating external fragmentation		✓
Enabling sharing by semantic unit	✓
Simplifying memory management		✓
Fine-grained protection	✓
Supporting virtual memory		✓
Handling variable-size data	✓

External Fragmentation is Segmentation's Achilles Heel

Because segments have variable sizes and live in physical memory, allocation and deallocation create external fragmentation—scattered free regions that can't satisfy large requests even when total free memory is sufficient. This problem is why pure segmentation is rarely used alone; combining with paging eliminates external fragmentation while preserving segmentation's logical benefits.

Addressing in Segmented Memory

In a segmented memory system, every address consists of two components: a segment selector (or segment number) and an offset within that segment. This two-dimensional addressing is fundamental to how segmentation works.

segmented_address.c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
// Conceptual representation of a segmented address
// In a segmented system, addresses have two components:
 
// Logical Address Structure
struct logical_address {
    uint16_t segment;   // Segment selector (which segment)
    uint32_t offset;    // Offset within the segment
};
 
// Example: Address "segment 3, offset 0x1A4"
// This means: byte 0x1A4 within the 3rd segment
 
// The hardware performs translation:
// 1. Look up segment 3 in the segment table
// 2. Get the base address of segment 3 (e.g., 0x4000)
// 3. Check that offset (0x1A4) < segment limit
// 4. Physical address = base + offset = 0x4000 + 0x1A4 = 0x41A4
 
// In Intel x86 notation, this might be written as:
// segment:offset  ->  CS:0x1A4 (for code segment)
// segment:offset  ->  DS:0x400 (for data segment)

Advantages of Two-Dimensional Addressing

Natural Separation: Different segments occupy different address spaces. A reference to "segment 2, offset 100" and "segment 7, offset 100" access completely different memory locations, even though the offsets are identical.
Independent Relocation: Each segment can be moved in physical memory independently. Only the segment table entry needs updating; all offsets within the segment remain valid.
Natural Bounds Checking: Each segment has an associated limit. Any access beyond this limit generates a hardware trap, catching buffer overflows and pointer errors at the source.
Meaningful Addresses: Addresses carry semantic information. "Code segment, offset X" means instruction at position X. "Stack segment, offset Y" means stack location Y. This aids debugging and security.

The Translation Process

When a program issues a memory reference, the following occurs:

The CPU extracts the segment number from the address
The segment number indexes into the segment table (we'll cover this in detail later)
The segment table entry provides: base address, limit, and permissions
Hardware compares offset against limit
If offset ≥ limit: segmentation fault (trap to OS)
If permissions don't allow the access type: protection fault
If valid: physical_address = base + offset

Converting Mermaid diagram...

Implicit vs. Explicit Segment References

In many segmented architectures (like Intel x86), common instructions use implicit segment registers. Code fetches automatically use the Code Segment (CS), stack operations use the Stack Segment (SS), and most data references use the Data Segment (DS). Programmers can override these defaults for specific accesses, but the implicit mapping reduces the burden of managing segments in everyday code.

Segments Reflect Program Structure

The fundamental insight of segmentation is that programs are not amorphous blobs of data—they have structure. Segmentation makes this structure visible to the hardware, enabling memory management that respects the program's logical organization.

A Typical Program's Segment Structure:

When a program is compiled and linked, it naturally divides into segments that reflect different purposes and access patterns:

Common Program Segments and Their Characteristics
Segment	Contents	Access	Lifespan	Growth
Text/Code	Machine instructions	Execute + Read	Static (process lifetime)	Never changes
Data (initialized)	Global/static vars with initial values	Read + Write	Static (process lifetime)	Never changes
BSS	Uninitialized global/static vars	Read + Write	Static (process lifetime)	Never changes
Heap	Dynamic allocations (malloc)	Read + Write	Dynamic	Grows upward on demand
Stack	Local vars, call frames, return addrs	Read + Write	Dynamic	Grows downward on call, shrinks on return
Shared libs	Dynamically linked library code/data	Varies by section	Process lifetime (ref-counted)	Never changes after load

Why This Structure Matters for Memory Management:

Different Protection Requirements
- Code should be executable but not writable (prevents code injection)
- Data should be readable and writable but not executable (prevents data execution attacks)
- Stack should be non-executable on modern systems
- Read-only data should be read-only
Different Sharing Potential
- Code segments can be shared among all processes running the same program
- Read-only data can be shared similarly
- Stack and heap are per-process and must never be shared
- Shared library code is shared among all processes using that library
Different Growth Patterns
- Code and data segments don't grow during execution
- Heap grows dynamically with allocations
- Stack grows and shrinks with function calls
- The system must accommodate these patterns
Different Lifetime Requirements
- A library segment might be loaded when first needed and shared indefinitely
- A process's private pages are released when the process terminates
- Some segments might be lazily loaded (loaded only when accessed)

segment_layout_example.c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
// Example: How a C program maps to segments
// Consider this simple program:
 
int initialized_global = 42;    // DATA segment (has initial value)
int uninitialized_global;       // BSS segment (zero-initialized)
const char* message = "Hello";  // DATA segment (pointer + string in RODATA)
 
void helper_function() {        // TEXT segment
    int local_var;              // STACK segment (created at runtime)
    // ...
}
 
int main() {                    // TEXT segment
    int* heap_array;            // STACK segment (the pointer itself)
    
    heap_array = malloc(100 * sizeof(int));  // HEAP segment allocation
    
    helper_function();
    
    free(heap_array);           // Returns memory to HEAP
    return 0;
}
 
// Memory layout (approximate):
// 
// High addresses  ┌─────────────────┐
//                 │  STACK          │ ← grows downward
//                 │  (local_var,    │
//                 │   heap_array)   │
//                 ├─────────────────┤
//                 │        ↓        │
//                 │  (free space)   │
//                 │        ↑        │
//                 ├─────────────────┤
//                 │  HEAP           │ ← grows upward
//                 │  (malloc'd data)│
//                 ├─────────────────┤
//                 │  BSS            │ ← uninitialized_global
//                 ├─────────────────┤
//                 │  DATA           │ ← initialized_global
//                 ├─────────────────┤
//                 │  RODATA         │ ← "Hello" string
//                 ├─────────────────┤
//                 │  TEXT           │ ← main, helper_function
// Low addresses   └─────────────────┘

The ELF Segment Connection

On Unix-like systems, executable files use the ELF (Executable and Linkable Format) format, which explicitly defines program segments. When you run 'readelf -l program', you see the program headers describing each segment: its type, virtual address, physical address, file size, memory size, and flags. These ELF segments directly correspond to the logical segments loaded into memory.

Segments Enable Natural Sharing

One of segmentation's most elegant features is how naturally it enables memory sharing. Because segments correspond to logical program units, sharing segments means sharing meaningful components—not arbitrary pages that happen to overlap.

The Sharing Scenario:

Consider 50 users all running the same text editor. Without sharing:

50 copies of identical code in memory
50 copies of identical read-only data
Massive memory waste

With segment sharing:

ONE copy of the code segment, referenced by all 50 processes
ONE copy of read-only data, shared by all
Each process has its own stack and heap segments (private data)
Memory savings: potentially 90% or more

Converting Mermaid diagram...

How Sharing Works:

Segment Table Entries Point to Same Physical Memory
- Each process has its own segment table
- For shared segments, multiple segment table entries point to the same physical base address
- Each process can have different permissions (e.g., read-only in one, read-write in another)
Reference Counting
- The OS tracks how many processes reference each shared segment
- When a process terminates, it decrements the reference count
- Only when the count reaches zero is the segment physically deallocated
Copy-on-Write for Data Segments
- Initially, even writable segments can be shared (marked read-only)
- When a process tries to write, a segment fault occurs
- The OS creates a private copy for that process
- This optimization delays copying until absolutely necessary

What Can Be Shared:

Segment Type	Sharable?	Notes
Code (Text)	Always	Multiple readers, no writers
Read-only Data	Always	Constant strings, lookup tables
Initialized Data	With COW	Each process gets private copy on write
BSS	With COW	Copy only the portion written
Heap	No	Process-private by nature
Stack	Never	Fundamental to process identity
Shared Libraries	Code: Yes, Data: COW	This is their purpose

Shared Libraries Exemplify Segment Sharing

The C library (libc) is used by nearly every program on a Unix system. With segmentation, one copy of libc's code resides in memory, shared by hundreds of processes. The memory savings are enormous. This is why shared libraries are called 'shared'—they share segments across process boundaries, not just share access to the same file on disk.

Segments Enable Natural Protection

Protection in a segmented system is remarkably natural because access control aligns with program structure. Each segment can have its own access permissions, and those permissions make semantic sense.

Segment-Level Protection Attributes:

Protection Bits in Segment Descriptors

•Read (R) — Segment contents can be read. Essential for data; optional for code (some systems allow execute-only code for security).
•Write (W) — Segment contents can be modified. Never set for code segments; always set for stack and heap.
•Execute (X) — Segment contents can be executed as instructions. Set only for code segments; disabling on data segments prevents code injection attacks.
•Privilege Level (DPL) — In systems with protection rings, segments have a Descriptor Privilege Level. User code cannot access segments with higher privilege.
•Present (P) — Segment is currently in physical memory. If not present, accessing it triggers a segment-not-present fault for demand loading.

Protection Scenarios:

Scenario 1: Preventing Code Injection

// Attacker tries to execute data as code
// Data segment has permissions: Read, Write, NO Execute
// Hardware blocks execution attempt → Protection Fault

Scenario 2: Preventing Code Modification

// Bug or attack tries to overwrite code
// Code segment has permissions: Read, Execute, NO Write
// Hardware blocks write attempt → Protection Fault

Scenario 3: Protecting Kernel Segments

// User process tries to access kernel data
// Kernel segment DPL = 0, User process CPL = 3
// DPL < CPL → General Protection Fault

Scenario 4: Stack Smashing Protection

// Buffer overflow on stack injects code
// Stack segment: Read, Write, NO Execute
// Even if injected, code cannot execute

These protections happen in hardware at every memory access, with no software overhead. The segment descriptor is cached in the segment register, so protection checks add zero cycles to most memory operations.

The NX/XD Bit Revolution

For years, Intel x86 processors lacked a no-execute bit for pages (x86 segmentation had E/X permission, but widely-used flat memory models bypassed it). The AMD64 architecture finally added the NX (No-eXecute) bit, and Intel followed with XD (eXecute Disable). This seemingly small addition dramatically improved security by allowing operating systems to mark data regions as non-executable at the page level, complementing segment-level protections.

The Modern Role of Segments

You might wonder: if paging solved fragmentation so elegantly, why do modern systems still use segments? The answer is nuanced. Pure segmentation has largely given way to paging, but segmentation concepts persist in important ways.

Current State of Segmentation:

Segmentation in Modern Systems
System/Architecture	Segmentation Role	Details
x86-64 (Long Mode)	Vestigial	Base fixed at 0, limit disabled; FS/GS used for TLS
Linux (x86-64)	Minimal	Uses FS for thread-local storage, GS for per-CPU data
Windows (x86-64)	Minimal	Similar TLS use; segments effectively unused for MM
ARM (64-bit)	None	Pure paging with no segmentation hardware
RISC-V	None	Clean paging design, no segmentation
WebAssembly	Conceptual	Linear memory with bounds checking echoes segmentation

Why Segmentation Retreated:

Implementation Complexity: Managing variable-size segments is harder than uniform pages
External Fragmentation: Unavoidable without compaction, which is expensive
Flat Memory Models: Programmers preferred simpler flat address spaces
64-bit Addressing: With 64-bit virtual addresses, address space is practically unlimited; no need for segments to extend range
Page-Based Virtual Memory: Paging handles swapping elegantly; segment swapping is coarser

Where Segmentation Concepts Survive:

ELF Program Headers: Executables still define segments (loader maps to virtual memory)
Memory Protection: Even without hardware segments, regions have different permissions
Thread-Local Storage (TLS): x86-64 uses FS/GS segment registers for per-thread data
Address Space Layout: Programs still conceptually have text, data, heap, stack
Virtual Memory Regions: Linux's vm_area_struct and Windows's VAD trees track contiguous regions with segment-like properties
Language Runtimes: Managed languages often use segment-like concepts internally

The Legacy Lives in Software

Even though hardware segmentation has faded, operating systems maintain segment-like abstractions internally. Linux's vm_area_struct describes contiguous regions with consistent permissions—essentially software segments. The logical concepts of segmentation remain valuable; only the hardware implementation has shifted to paging.

Summary: Logical Segments

This page has provided a comprehensive exploration of logical segments as the foundational concept of memory segmentation. Let's consolidate the key insights:

Key Takeaways

•Logical segments are variable-size memory units that correspond to meaningful program components (code, data, stack, etc.).
•Two-dimensional addressing (segment + offset) enables independent relocation, natural bounds checking, and meaningful addresses.
•Segmentation reflects program structure, making protection and sharing semantically meaningful rather than arbitrary.
•Segments differ from pages: segments are logical units visible to programmers; pages are physical units invisible to programmers.
•External fragmentation is segmentation's main weakness, addressed by combining segmentation with paging.
•Sharing is natural with segments—shared code segments save memory when multiple processes run the same program.
•Protection is natural with segments—each segment's permissions match its logical purpose (execute code, read/write data).
•Modern systems have largely moved to paging, but segmentation concepts persist in software abstractions and specialized uses like TLS.

What's Next:

With the theoretical foundation established, the next page examines the specific segment types found in typical programs: code segments, data segments, and stack segments. We'll explore how these segments differ in their contents, access patterns, and management requirements—understanding that forms the basis for practical segmentation implementation.

Page Complete

You now understand what logical segments are, why they exist, and how they provide a programmer's view of memory organization. This foundational knowledge prepares you both for understanding specific segment types and for appreciating how segmentation combines with paging in modern systems.

1 / 5

Loading learning content...

Operating SystemsSegmentation Concepts

Segmentation Concepts

LevelIntermediate

Duration90 mins

TopicSegmentation Concepts

1 / 5

Logical Segments

Memory Through the Programmer's Eyes

What You Will Learn

What Are Logical Segments?

Consider a typical C program. When compiled and loaded, it consists of several distinct logical units:

Text segment: The compiled machine code (instructions)
Data segment: Initialized global and static variables
BSS segment: Uninitialized global and static variables
Heap segment: Dynamically allocated memory (malloc/free)
Stack segment: Local variables, function call frames, return addresses
Shared library segments: Code and data for dynamically linked libraries

Defining Characteristics of Logical Segments

•Logical Coherence — A segment contains related data or code that belongs together conceptually. The entire main() function's code is in one segment, not split across arbitrary boundaries.
•Variable Size — Unlike fixed-size pages, segments can be any size needed. A tiny utility function segment might be 100 bytes; a large data structure segment might be megabytes.
•Distinct Identity — Each segment has a name or number that identifies it uniquely. The address of any byte includes both the segment identifier and the offset within that segment.
•Independent Attributes — Segments can have different access permissions (read, write, execute), allowing fine-grained protection based on logical meaning.
•Separate Lifetime — Segments can be loaded, extended, protected, or removed independently, reflecting the dynamic nature of program execution.

The Formal Definition:

An address in a segmented system is a two-component address: (segment_number, offset). To access memory, the hardware:

Uses the segment number to look up the segment's base and limit
Checks that the offset is less than the limit (bounds check)
Computes the physical address: base + offset

This two-dimensional addressing scheme is fundamental to how segmentation represents the programmer's view of memory.

The Name Comes from the Structure

Historical Motivation: Why Segmentation Was Invented

The Problem with Flat Address Spaces

No logical structure: The hardware couldn't distinguish code from data, making protection difficult
Relocation complexity: Moving a program in memory required patching all internal addresses
Sharing difficulty: Two programs couldn't easily share a library without duplicating it entirely
Programming burden: Programmers had to manually calculate offsets across module boundaries

Historical Systems and Segmentation
System	Year	Segmentation Innovation	Impact
Burroughs B5000	1961	Tagged architecture with code/data segments	Pioneered structured memory
Multics	1965	Segments + pages, rich segment attributes	Defined modern segmentation concepts
Intel 8086	1978	Four 64KB segments (CS, DS, SS, ES)	Brought segmentation to microprocessors
Intel 80286	1982	Protected mode with segment descriptors	Added hardware protection to segments
Intel 80386	1985	Segments + paging combined	Full modern implementation

The Multics Vision

In Multics:

Every file was a segment that could be mapped directly into memory
Programs consisted of multiple segments that could be shared
Each segment had individual access controls
Segments could grow dynamically as needed
The distinction between memory and files blurred—a radical idea

The Intel 8086's Pragmatic Segmentation

Segments vs. Pages: A Fundamental Comparison

Segmentation

•Programmer-visible organization
•Variable-size units (bytes to megabytes)
•Reflects logical structure of programs
•Two-dimensional addressing (segment, offset)
•Subject to external fragmentation
•Natural unit for protection and sharing
•Segments can be named meaningfully
•Provides programmer's view of memory

Paging

•Programmer-invisible organization
•Fixed-size units (typically 4KB)
•Reflects hardware requirements
•One-dimensional addressing (page frame + offset)
•No external fragmentation
•Uniform treatment, implicit protection
•Pages are numbered, not named
•Provides system's view of memory

The Philosophical Divide

The segment-page dichotomy reflects a deeper tension in systems design: logical organization vs. physical efficiency.

Segmentation says: "Memory should be organized the way programmers think. Give each logical unit its own space, let it grow as needed, protect it according to its purpose."

Paging says: "Memory should be organized for efficient use. Divide everything into uniform chunks that can be shuffled, swapped, and managed without fragmentation."

Where Each Excels:

Aspect	Segmentation Better	Paging Better
Matching program structure	✓
Eliminating external fragmentation		✓
Enabling sharing by semantic unit	✓
Simplifying memory management		✓
Fine-grained protection	✓
Supporting virtual memory		✓
Handling variable-size data	✓

External Fragmentation is Segmentation's Achilles Heel

Addressing in Segmented Memory

segmented_address.c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
// Conceptual representation of a segmented address
// In a segmented system, addresses have two components:
 
// Logical Address Structure
struct logical_address {
    uint16_t segment;   // Segment selector (which segment)
    uint32_t offset;    // Offset within the segment
};
 
// Example: Address "segment 3, offset 0x1A4"
// This means: byte 0x1A4 within the 3rd segment
 
// The hardware performs translation:
// 1. Look up segment 3 in the segment table
// 2. Get the base address of segment 3 (e.g., 0x4000)
// 3. Check that offset (0x1A4) < segment limit
// 4. Physical address = base + offset = 0x4000 + 0x1A4 = 0x41A4
 
// In Intel x86 notation, this might be written as:
// segment:offset  ->  CS:0x1A4 (for code segment)
// segment:offset  ->  DS:0x400 (for data segment)

Advantages of Two-Dimensional Addressing

Natural Separation: Different segments occupy different address spaces. A reference to "segment 2, offset 100" and "segment 7, offset 100" access completely different memory locations, even though the offsets are identical.
Independent Relocation: Each segment can be moved in physical memory independently. Only the segment table entry needs updating; all offsets within the segment remain valid.
Natural Bounds Checking: Each segment has an associated limit. Any access beyond this limit generates a hardware trap, catching buffer overflows and pointer errors at the source.
Meaningful Addresses: Addresses carry semantic information. "Code segment, offset X" means instruction at position X. "Stack segment, offset Y" means stack location Y. This aids debugging and security.

The Translation Process

When a program issues a memory reference, the following occurs:

The CPU extracts the segment number from the address
The segment number indexes into the segment table (we'll cover this in detail later)
The segment table entry provides: base address, limit, and permissions
Hardware compares offset against limit
If offset ≥ limit: segmentation fault (trap to OS)
If permissions don't allow the access type: protection fault
If valid: physical_address = base + offset

Converting Mermaid diagram...

Implicit vs. Explicit Segment References

Segments Reflect Program Structure

A Typical Program's Segment Structure:

When a program is compiled and linked, it naturally divides into segments that reflect different purposes and access patterns:

Common Program Segments and Their Characteristics
Segment	Contents	Access	Lifespan	Growth
Text/Code	Machine instructions	Execute + Read	Static (process lifetime)	Never changes
Data (initialized)	Global/static vars with initial values	Read + Write	Static (process lifetime)	Never changes
BSS	Uninitialized global/static vars	Read + Write	Static (process lifetime)	Never changes
Heap	Dynamic allocations (malloc)	Read + Write	Dynamic	Grows upward on demand
Stack	Local vars, call frames, return addrs	Read + Write	Dynamic	Grows downward on call, shrinks on return
Shared libs	Dynamically linked library code/data	Varies by section	Process lifetime (ref-counted)	Never changes after load

Why This Structure Matters for Memory Management:

Different Protection Requirements
- Code should be executable but not writable (prevents code injection)
- Data should be readable and writable but not executable (prevents data execution attacks)
- Stack should be non-executable on modern systems
- Read-only data should be read-only
Different Sharing Potential
- Code segments can be shared among all processes running the same program
- Read-only data can be shared similarly
- Stack and heap are per-process and must never be shared
- Shared library code is shared among all processes using that library
Different Growth Patterns
- Code and data segments don't grow during execution
- Heap grows dynamically with allocations
- Stack grows and shrinks with function calls
- The system must accommodate these patterns
Different Lifetime Requirements
- A library segment might be loaded when first needed and shared indefinitely
- A process's private pages are released when the process terminates
- Some segments might be lazily loaded (loaded only when accessed)

segment_layout_example.c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
// Example: How a C program maps to segments
// Consider this simple program:
 
int initialized_global = 42;    // DATA segment (has initial value)
int uninitialized_global;       // BSS segment (zero-initialized)
const char* message = "Hello";  // DATA segment (pointer + string in RODATA)
 
void helper_function() {        // TEXT segment
    int local_var;              // STACK segment (created at runtime)
    // ...
}
 
int main() {                    // TEXT segment
    int* heap_array;            // STACK segment (the pointer itself)
    
    heap_array = malloc(100 * sizeof(int));  // HEAP segment allocation
    
    helper_function();
    
    free(heap_array);           // Returns memory to HEAP
    return 0;
}
 
// Memory layout (approximate):
// 
// High addresses  ┌─────────────────┐
//                 │  STACK          │ ← grows downward
//                 │  (local_var,    │
//                 │   heap_array)   │
//                 ├─────────────────┤
//                 │        ↓        │
//                 │  (free space)   │
//                 │        ↑        │
//                 ├─────────────────┤
//                 │  HEAP           │ ← grows upward
//                 │  (malloc'd data)│
//                 ├─────────────────┤
//                 │  BSS            │ ← uninitialized_global
//                 ├─────────────────┤
//                 │  DATA           │ ← initialized_global
//                 ├─────────────────┤
//                 │  RODATA         │ ← "Hello" string
//                 ├─────────────────┤
//                 │  TEXT           │ ← main, helper_function
// Low addresses   └─────────────────┘

The ELF Segment Connection

Segments Enable Natural Sharing

The Sharing Scenario:

Consider 50 users all running the same text editor. Without sharing:

50 copies of identical code in memory
50 copies of identical read-only data
Massive memory waste

With segment sharing:

ONE copy of the code segment, referenced by all 50 processes
ONE copy of read-only data, shared by all
Each process has its own stack and heap segments (private data)
Memory savings: potentially 90% or more

Converting Mermaid diagram...

How Sharing Works:

Segment Table Entries Point to Same Physical Memory
- Each process has its own segment table
- For shared segments, multiple segment table entries point to the same physical base address
- Each process can have different permissions (e.g., read-only in one, read-write in another)
Reference Counting
- The OS tracks how many processes reference each shared segment
- When a process terminates, it decrements the reference count
- Only when the count reaches zero is the segment physically deallocated
Copy-on-Write for Data Segments
- Initially, even writable segments can be shared (marked read-only)
- When a process tries to write, a segment fault occurs
- The OS creates a private copy for that process
- This optimization delays copying until absolutely necessary

What Can Be Shared:

Segment Type	Sharable?	Notes
Code (Text)	Always	Multiple readers, no writers
Read-only Data	Always	Constant strings, lookup tables
Initialized Data	With COW	Each process gets private copy on write
BSS	With COW	Copy only the portion written
Heap	No	Process-private by nature
Stack	Never	Fundamental to process identity
Shared Libraries	Code: Yes, Data: COW	This is their purpose

Shared Libraries Exemplify Segment Sharing

Segments Enable Natural Protection

Segment-Level Protection Attributes:

Protection Bits in Segment Descriptors

•Read (R) — Segment contents can be read. Essential for data; optional for code (some systems allow execute-only code for security).
•Write (W) — Segment contents can be modified. Never set for code segments; always set for stack and heap.
•Execute (X) — Segment contents can be executed as instructions. Set only for code segments; disabling on data segments prevents code injection attacks.
•Privilege Level (DPL) — In systems with protection rings, segments have a Descriptor Privilege Level. User code cannot access segments with higher privilege.
•Present (P) — Segment is currently in physical memory. If not present, accessing it triggers a segment-not-present fault for demand loading.

Protection Scenarios:

Scenario 1: Preventing Code Injection

// Attacker tries to execute data as code
// Data segment has permissions: Read, Write, NO Execute
// Hardware blocks execution attempt → Protection Fault

Scenario 2: Preventing Code Modification

// Bug or attack tries to overwrite code
// Code segment has permissions: Read, Execute, NO Write
// Hardware blocks write attempt → Protection Fault

Scenario 3: Protecting Kernel Segments

// User process tries to access kernel data
// Kernel segment DPL = 0, User process CPL = 3
// DPL < CPL → General Protection Fault

Scenario 4: Stack Smashing Protection

// Buffer overflow on stack injects code
// Stack segment: Read, Write, NO Execute
// Even if injected, code cannot execute

The NX/XD Bit Revolution

The Modern Role of Segments

Current State of Segmentation:

Segmentation in Modern Systems
System/Architecture	Segmentation Role	Details
x86-64 (Long Mode)	Vestigial	Base fixed at 0, limit disabled; FS/GS used for TLS
Linux (x86-64)	Minimal	Uses FS for thread-local storage, GS for per-CPU data
Windows (x86-64)	Minimal	Similar TLS use; segments effectively unused for MM
ARM (64-bit)	None	Pure paging with no segmentation hardware
RISC-V	None	Clean paging design, no segmentation
WebAssembly	Conceptual	Linear memory with bounds checking echoes segmentation

Why Segmentation Retreated:

Implementation Complexity: Managing variable-size segments is harder than uniform pages
External Fragmentation: Unavoidable without compaction, which is expensive
Flat Memory Models: Programmers preferred simpler flat address spaces
64-bit Addressing: With 64-bit virtual addresses, address space is practically unlimited; no need for segments to extend range
Page-Based Virtual Memory: Paging handles swapping elegantly; segment swapping is coarser

Where Segmentation Concepts Survive:

ELF Program Headers: Executables still define segments (loader maps to virtual memory)
Memory Protection: Even without hardware segments, regions have different permissions
Thread-Local Storage (TLS): x86-64 uses FS/GS segment registers for per-thread data
Address Space Layout: Programs still conceptually have text, data, heap, stack
Virtual Memory Regions: Linux's vm_area_struct and Windows's VAD trees track contiguous regions with segment-like properties
Language Runtimes: Managed languages often use segment-like concepts internally

The Legacy Lives in Software

Summary: Logical Segments

This page has provided a comprehensive exploration of logical segments as the foundational concept of memory segmentation. Let's consolidate the key insights:

Key Takeaways

•Logical segments are variable-size memory units that correspond to meaningful program components (code, data, stack, etc.).
•Two-dimensional addressing (segment + offset) enables independent relocation, natural bounds checking, and meaningful addresses.
•Segmentation reflects program structure, making protection and sharing semantically meaningful rather than arbitrary.
•Segments differ from pages: segments are logical units visible to programmers; pages are physical units invisible to programmers.
•External fragmentation is segmentation's main weakness, addressed by combining segmentation with paging.
•Sharing is natural with segments—shared code segments save memory when multiple processes run the same program.
•Protection is natural with segments—each segment's permissions match its logical purpose (execute code, read/write data).
•Modern systems have largely moved to paging, but segmentation concepts persist in software abstractions and specialized uses like TLS.

What's Next:

Page Complete

1 / 5