Operating SystemsSegment Table

Segment Table

LevelIntermediate

Duration75 mins

TopicSegment Table

5 / 5

Segment Table Location

Finding the Map to Memory

We've explored what segment table entries contain—base addresses, limits, protection bits. But there's a fundamental question we haven't fully addressed: Where are these segment tables stored, and how does the CPU find them?

This is a classic bootstrapping problem. The CPU needs segment descriptors to access memory, but those descriptors are themselves stored in memory. How does the hardware break this circular dependency?

The answer lies in special CPU registers that hold the physical addresses and sizes of descriptor tables. These registers—GDTR (Global Descriptor Table Register) and LDTR (Local Descriptor Table Register)—are the root of trust for segmented memory access. Everything flows from them.

Understanding descriptor table location is essential for operating system development. The OS must:

Set up the GDT during boot
Create per-process LDTs for isolation
Switch LDTs during context switches
Protect descriptor tables from user-mode modification

This page demystifies the architecture of descriptor table management.

What You Will Learn

By the end of this page, you will understand the Global and Local Descriptor Tables (GDT/LDT), the GDTR and LDTR registers, descriptor table structure and layout, selector-to-descriptor lookup, per-process LDT management, context switch handling, and descriptor table protection.

The Two-Table Architecture

x86 protected mode uses a two-tier descriptor table architecture:

1. Global Descriptor Table (GDT):

System-wide, shared by all processes
Contains descriptors for OS kernel, shared services
Must always be present; CPU requires it
Typically one GDT for the entire system (or per-CPU on SMP)

2. Local Descriptor Table (LDT):

Per-process, unique to each process
Contains descriptors specific to that process
Optional; many processes don't use LDTs
Provides additional isolation between processes

Why Two Tables?

The two-table design separates concerns:

GDT: System segments that must be accessible regardless of which process is running (kernel code, kernel data, TSS, etc.)
LDT: Process-specific segments that should differ between processes (user code, user data, user stack)

This design allows the OS to:

Keep kernel segments in GDT (always available)
Switch LDT on context switch (changes user segments)
Limit GDT modifications (security-critical, kernel-only)
Let processes have different segment layouts via different LDTs

Global Descriptor Table

•Single table for entire system
•Located via GDTR register
•Contains kernel segments
•Holds TSS and LDT descriptors
•Modified only by kernel
•Fixed during normal operation

Local Descriptor Table

•Per-process table
•Located via LDTR register
•Contains user segments
•Switched on context switch
•Can be customized per process
•Optional (modern OSes often skip)

Modern OS Practice

Many modern operating systems (Linux, Windows in 64-bit mode) minimize LDT usage. With flat memory models and paging providing isolation, per-process LDTs add complexity without significant benefit. The GDT is still essential, but often only a handful of descriptors are actually used.

The GDTR Register

The Global Descriptor Table Register (GDTR) is a special CPU register that holds the location and size of the GDT. It cannot be accessed like general-purpose registers—only special instructions can read or write it.

GDTR Structure:

┌────────────────────────────────────────────────────────┐
│ GDTR: 48 bits (10 bytes on 32-bit, 80 bits on 64-bit) │
├────────────────────────────────────────────────────────┤
│  Limit (16 bits)   │     Base Address (32/64 bits)    │
│  Bytes 0-1         │     Bytes 2-5 (or 2-9 on 64-bit) │
└────────────────────┴───────────────────────────────────┘

Limit: Size of GDT in bytes minus 1 (max 65535 = 8192 entries × 8 bytes)
Base:  Linear (virtual) address where GDT starts

Critical Points:

Base is Linear Address: With paging enabled, the GDT base is a virtual address that must be mapped in page tables.
Limit is Size-1: A limit of 0x17 means 24 bytes (3 entries × 8 bytes), not 0x17 entries.
Entry 0 is Null: GDT entry 0 is reserved as the "null descriptor" and cannot be used.

GDTR Instructions:

gdtr_operations.asm

x86 Assembly

; LGDT - Load GDT Register
; Operand is a 6-byte (or 10-byte) memory location
lgdt [gdt_descriptor]
 
; SGDT - Store GDT Register  
; Saves current GDTR to memory (can be executed in user mode!)
sgdt [gdt_save_location]
 
; GDT descriptor structure (for LGDT)
gdt_descriptor:
    dw gdt_end - gdt_start - 1    ; Limit (size - 1)
    dd gdt_start                   ; Base address (32-bit)
    
; 64-bit version
gdt_descriptor_64:
    dw gdt_end - gdt_start - 1    ; Limit (size - 1)
    dq gdt_start                   ; Base address (64-bit)
 
; Example GDT
align 16
gdt_start:
gdt_null:         ; Entry 0: Null descriptor (required)
    dq 0
gdt_code:         ; Entry 1: Kernel code segment
    dw 0xFFFF     ; Limit[15:0]
    dw 0x0000     ; Base[15:0]
    db 0x00       ; Base[23:16]
    db 10011010b  ; Access byte: P=1, DPL=0, S=1, Type=Code Execute/Read
    db 11001111b  ; Flags + Limit[19:16]: G=1, D=1, Limit=0xF
    db 0x00       ; Base[31:24]
gdt_data:         ; Entry 2: Kernel data segment
    dw 0xFFFF
    dw 0x0000
    db 0x00
    db 10010010b  ; Access byte: P=1, DPL=0, S=1, Type=Data Read/Write
    db 11001111b  ; G=1, D=1
    db 0x00
gdt_end:

Security Consideration: SGDT Vulnerability

The SGDT instruction can be executed from user mode, revealing the GDT's location. This has been exploited for:

VM Detection: Hypervisors often relocate GDT, SGDT reveals this
ASLR Bypass: GDT location might leak kernel address space layout
Fingerprinting: Distinguish OS/hypervisor by GDT layout

Modern systems mitigate this:

Hardware virtualization intercepts SGDT
Some OSes randomize GDT location
UMIP (User-Mode Instruction Prevention) blocks SGDT from Ring 3

LGDT is Privileged

While SGDT can run in user mode (unfortunately), LGDT is strictly privileged (Ring 0 only). Executing LGDT from Ring 3 causes #GP. This is critical: if users could load arbitrary GDTs, all protection would be meaningless.

The LDTR Register

The Local Descriptor Table Register (LDTR) points to the current process's LDT. Unlike GDTR which holds a direct memory address, LDTR holds a selector that references a descriptor in the GDT.

LDTR Structure:

┌──────────────────────────────────────────────────────────────────┐
│ LDTR: 16-bit visible selector + hidden descriptor cache         │
├──────────────────────────────────────────────────────────────────┤
│ Visible:  16-bit selector (index into GDT for LDT descriptor)  │
│ Hidden:   Cached LDT descriptor (base, limit, attributes)       │
└──────────────────────────────────────────────────────────────────┘

The Indirection:

LDTR contains a selector (e.g., 0x28)
This selector points to a descriptor in the GDT
That GDT descriptor is a system descriptor of type LDT
The LDT descriptor contains base/limit of the actual LDT
CPU caches this LDT information in LDTR's hidden portion

Why This Extra Level?

Having LDTR reference the GDT rather than directly containing a base/limit:

Consistency: All segment information flows through descriptors
Hardware Uniformity: Same caching mechanism for all segments
Protection: LDT descriptor in GDT has DPL, Present bit, etc.
Validation: CPU validates LDT descriptor when LDTR is loaded

ldtr_operations.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
// Setting up an LDT for a process
 
// Step 1: Allocate memory for the LDT
#define LDT_ENTRIES 32
segment_descriptor_t* ldt = kmalloc(LDT_ENTRIES * sizeof(segment_descriptor_t));
memset(ldt, 0, LDT_ENTRIES * sizeof(segment_descriptor_t));
 
// Step 2: Populate LDT with process-specific segments
create_gdt_entry(&ldt[1], 
    process->code_base, 
    process->code_limit,
    ACCESS_CODE_EXEC_READ | DPL_USER,
    FLAGS_32BIT | FLAGS_PAGE_GRAN);
 
create_gdt_entry(&ldt[2],
    process->data_base,
    process->data_limit,
    ACCESS_DATA_READ_WRITE | DPL_USER,
    FLAGS_32BIT | FLAGS_PAGE_GRAN);
 
// Step 3: Create LDT descriptor in GDT
uint16_t ldt_gdt_index = allocate_gdt_slot();
create_system_descriptor(&gdt[ldt_gdt_index],
    (uint32_t)ldt,                    // Base: LDT's address
    LDT_ENTRIES * 8 - 1,              // Limit: LDT size - 1
    TYPE_LDT,                          // Type: LDT descriptor
    DPL_KERNEL);                       // Only kernel can load LDTR
 
// Step 4: Construct LDTR selector
process->ldt_selector = (ldt_gdt_index << 3) | TI_GDT | RPL_KERNEL;
 
// Step 5: Load LDTR (during context switch to this process)
void switch_to_process(process_t* proc) {
    // Load the new LDT
    __asm__ volatile ("lldt %0" :: "r"(proc->ldt_selector));
    
    // Now selectors with TI=1 reference this process's LDT
}

LDTR Instructions:

LLDT (Load LDT Register): Loads a selector into LDTR, CPU fetches and caches the LDT descriptor from GDT. Privileged (Ring 0).
SLDT (Store LDT Register): Stores the LDTR selector to memory. Can be executed from user mode.

Null LDT:

If LDTR is loaded with a null selector (0), no LDT is active. Any segment selector with TI=1 (LDT indicator) will cause #GP because there's no LDT to look up. Many modern OSes run with LDTR=0.

Per-CPU GDT, Not Global

On SMP systems, each CPU typically has its own GDT copy. This allows per-CPU data (via FS/GS bases) and separate TSS entries per CPU. The GDT content is mostly identical, but having separate copies avoids cache contention and allows CPU-specific entries.

Descriptor Table Layout

Descriptor tables are arrays of 8-byte (or 16-byte in 64-bit mode for system descriptors) entries. Let's examine the structure and typical layouts.

GDT Structure:

Offset    Entry    Purpose
0x00      [0]      Null Descriptor (required, never used)
0x08      [1]      Kernel Code Segment (CS when in kernel)
0x10      [2]      Kernel Data Segment (DS/ES/SS when in kernel)
0x18      [3]      User Code Segment (CS when in user mode)
0x20      [4]      User Data Segment (DS/ES/SS when in user mode)
0x28      [5]      TSS Descriptor (for current CPU)
0x30      [6]      Per-CPU Data Segment (for FS/GS base)
          ...      Additional entries as needed

Typical Linux 32-bit GDT:

Linux 32-bit GDT Layout
Index	Selector	Name	Purpose
0	0x00	GDT_ENTRY_NULL	Null descriptor
1	0x08	GDT_ENTRY_KERNEL_CS	Kernel code (Ring 0)
2	0x10	GDT_ENTRY_KERNEL_DS	Kernel data (Ring 0)
3	0x18	GDT_ENTRY_USER_CS	User code (Ring 3)
4	0x20	GDT_ENTRY_USER_DS	User data (Ring 3)
5	0x28	GDT_ENTRY_TSS	Task State Segment
6	0x30	GDT_ENTRY_LDT	LDT (if used)
7+	0x38+	Various	Per-CPU, TLS, etc.

Selector to Entry Mapping:

The relationship between selectors and GDT entries:

Selector = (Index << 3) | TI | RPL

Index: Entry number in the table (0, 1, 2, ...)
TI:    Table Indicator (0 = GDT, 1 = LDT)
RPL:   Requested Privilege Level (0-3)

Examples:
  Selector 0x08 = (1 << 3) | 0 | 0 = Entry 1 in GDT, RPL=0
  Selector 0x1B = (3 << 3) | 0 | 3 = Entry 3 in GDT, RPL=3
  Selector 0x0F = (1 << 3) | 1 | 3 = Entry 1 in LDT, RPL=3

Finding a Descriptor:

segment_descriptor_t* find_descriptor(uint16_t selector) {
    uint16_t index = selector >> 3;
    bool use_ldt = (selector >> 2) & 1;
    
    if (use_ldt) {
        // TI = 1: Look in LDT
        if (ldtr_is_null())
            raise_gp(selector);  // No LDT loaded
        if (index >= ldt_limit / 8)
            raise_gp(selector);  // Index out of bounds
        return &ldt[index];
    } else {
        // TI = 0: Look in GDT
        if (index >= gdt_limit / 8)
            raise_gp(selector);  // Index out of bounds
        return &gdt[index];
    }
}

Entry 0 is Special

GDT entry 0 (the null descriptor) is reserved and cannot be used for actual segments. Loading selector 0 into a data segment register (DS, ES, FS, GS) sets it to null—subsequent data access through that register causes #GP. Loading 0 into CS or SS always causes #GP immediately.

Selector to Descriptor Lookup

When a segment register is loaded with a selector, the CPU must find and validate the corresponding descriptor. This is a critical path that happens on every segment load.

Lookup Algorithm:

segment_lookup.pseudo

Pseudocode

function load_segment_register(register, selector):
    // Step 1: Parse selector
    index = selector >> 3              // Bits 15-3: Index
    TI = (selector >> 2) & 1           // Bit 2: Table indicator
    RPL = selector & 3                 // Bits 1-0: Requested privilege
    
    // Step 2: Handle null selector
    if selector == 0:
        if register in [CS, SS]:
            raise #GP(0)               // Cannot load null into CS/SS
        else:
            register.selector = 0
            register.cached.valid = false
            return                     // Null DS/ES/FS/GS is allowed
    
    // Step 3: Determine which table
    if TI == 0:
        table_base = GDTR.base
        table_limit = GDTR.limit
    else:  // TI == 1
        if LDTR.selector == 0:
            raise #GP(selector)        // No LDT loaded
        table_base = LDTR.cached.base
        table_limit = LDTR.cached.limit
    
    // Step 4: Bounds check
    descriptor_offset = index * 8
    if descriptor_offset + 7 > table_limit:
        raise #GP(selector)            // Index out of bounds
    
    // Step 5: Fetch descriptor from memory
    descriptor = read_memory(table_base + descriptor_offset, 8 bytes)
    
    // Step 6: Validate descriptor type for target register
    if register == CS:
        if not descriptor.is_code_segment():
            raise #GP(selector)
    if register == SS:
        if not descriptor.is_writable_data():
            raise #GP(selector)
    if register in [DS, ES, FS, GS]:
        if descriptor.is_system():
            raise #GP(selector)
    
    // Step 7: Presence check
    if not descriptor.present:
        raise #NP(selector)            // Segment not present
    
    // Step 8: Privilege check (varies by segment type)
    perform_privilege_check(register, selector, descriptor)
    
    // Step 9: Cache descriptor
    register.selector = selector
    register.cached = descriptor
    mark_descriptor_accessed(table_base + descriptor_offset)

Physical Address Calculation:

GDT Entry Address = GDTR.Base + (Selector.Index × 8)

Example:
  GDTR.Base = 0xC0001000
  Selector = 0x18 (Index = 3)
  Entry Address = 0xC0001000 + (3 × 8) = 0xC0001018

Memory Access Pattern:

When loading a segment register, the CPU:

Reads 8 bytes from GDT/LDT (one cache line access usually)
Parses the descriptor
Caches result in segment register's hidden portion
Sets the accessed bit in the descriptor (another write)

The accessed bit write can cause TLB activity and cache coherence traffic on SMP systems.

Minimizing Segment Loads

Because loading segment registers involves memory access and validation, performance-sensitive code minimizes segment register loads. With flat memory models (all segments base=0, limit=4GB), segment registers can be set once and never changed, eliminating this overhead during normal execution.

GDT Initialization at Boot

When a computer boots, it starts in real mode (8086-compatible mode) with no GDT. Transitioning to protected mode requires setting up a GDT first. This is one of the first tasks of any OS bootloader or kernel.

Boot Sequence:

BIOS/UEFI loads bootloader in real mode
Bootloader sets up minimal GDT
Enable Protected Mode: Set PE bit in CR0
Far Jump: Load CS with protected mode selector
Load Data Segments: Set DS, ES, SS to protected mode selectors
Kernel may set up more complete GDT later

Minimal Boot GDT:

boot_gdt.asm

x86 Assembly

; Minimal GDT for entering protected mode
; Called before switching from real mode to protected mode
 
setup_protected_mode:
    cli                         ; Disable interrupts
    
    ; Load GDT
    lgdt [gdt_descriptor]
    
    ; Enable protected mode (set PE bit in CR0)
    mov eax, cr0
    or eax, 1                   ; Set PE bit
    mov cr0, eax
    
    ; Far jump to load CS with protected mode selector
    jmp 0x08:protected_mode_entry
 
[bits 32]
protected_mode_entry:
    ; Now in 32-bit protected mode
    ; Load data segment registers
    mov ax, 0x10                ; Kernel data selector
    mov ds, ax
    mov es, ax
    mov ss, ax
    mov fs, ax
    mov gs, ax
    
    ; Set up stack
    mov esp, 0x90000
    
    ; Continue to kernel...
    jmp kernel_main
 
; GDT data
align 8
gdt_start:
gdt_null:                       ; Entry 0: Required null
    dq 0
 
gdt_code:                       ; Entry 1: 32-bit code, base=0, limit=4GB
    dw 0xFFFF                   ; Limit [0:15]
    dw 0x0000                   ; Base [0:15]
    db 0x00                     ; Base [16:23]
    db 0b10011010               ; Access: P=1, DPL=0, S=1, Type=Execute/Read
    db 0b11001111               ; Flags: G=1, D=1, Limit [16:19]
    db 0x00                     ; Base [24:31]
 
gdt_data:                       ; Entry 2: 32-bit data, base=0, limit=4GB
    dw 0xFFFF
    dw 0x0000
    db 0x00
    db 0b10010010               ; Access: P=1, DPL=0, S=1, Type=Read/Write
    db 0b11001111
    db 0x00
 
gdt_end:
 
gdt_descriptor:
    dw gdt_end - gdt_start - 1  ; Size - 1
    dd gdt_start                ; Address

Why Far Jump After Setting CR0.PE?

When CR0.PE is set, the CPU is in protected mode, but CS still contains a real-mode value. The CPU continues in a weird hybrid state until CS is reloaded. The far jump:

Forces CS reload with a protected-mode selector
Flushes the instruction prefetch queue (contains real-mode decoded instructions)
Creates a clean entry into 32-bit protected mode

Kernel GDT Setup:

After basic boot, the kernel typically:

Relocates or rebuilds the GDT at a known virtual address
Adds more entries (TSS, user segments, per-CPU entries)
Reloads GDTR with the new GDT
Sets up per-CPU GDT copies for SMP

Identity Mapping Required

During the transition to protected mode (and later to paging), the code being executed must be identity-mapped: virtual address = physical address. Otherwise, the instruction pointer becomes invalid the moment translation changes. This is why boot code is carefully placed in low memory with identity mappings.

Per-Process LDT Management

When using per-process LDTs, the operating system must manage LDT creation, population, and switching. Let's examine the complete lifecycle.

LDT Lifecycle:

Allocation: Kernel allocates memory for new process's LDT
Population: Fill LDT with process-specific segment descriptors
GDT Entry: Create LDT descriptor in GDT pointing to this LDT
Activation: Load LDTR during context switch to this process
Modification: Update LDT entries as process changes (new segments)
Deallocation: Free LDT memory when process exits

LDT Per-Process Architecture:

┌─────────────────────┐     ┌────────────────────────────────────┐
│  Global GDT         │     │  Process A's LDT                   │
├─────────────────────┤     ├────────────────────────────────────┤
│ [0] Null            │     │ [0] Null (reserved)                │
│ [1] Kernel Code     │     │ [1] User Code: Base=A's code area  │
│ [2] Kernel Data     │     │ [2] User Data: Base=A's data area  │
│ [3] User Code       │     │ [3] User Stack: Base=A's stack     │
│ [4] User Data       │     │ [4] User TLS: Thread-local storage │
│ [5] TSS             │     └────────────────────────────────────┘
│ [6] LDT for A ─────────────────────┘
│ [7] LDT for B ──────────────────────┐
│ ...                 │     ┌─────────────────────────────────────┐
└─────────────────────┘     │  Process B's LDT                    │
                            ├─────────────────────────────────────┤
                            │ [0] Null                            │
                            │ [1] User Code: Base=B's code area   │
                            │ [2] User Data: Base=B's data area   │
                            │ [3] User Stack: Base=B's stack      │
                            └─────────────────────────────────────┘

ldt_management.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
// Complete LDT management for per-process isolation
 
typedef struct {
    segment_descriptor_t entries[LDT_SIZE];
    uint16_t gdt_selector;          // Selector for this LDT's GDT entry
    spinlock_t lock;                // Protect concurrent modifications
} process_ldt_t;
 
// Create a new LDT for a process
process_ldt_t* create_process_ldt(process_t* proc) {
    // Allocate LDT structure (must be accessible in kernel space)
    process_ldt_t* ldt = kmalloc_aligned(sizeof(process_ldt_t), 8);
    if (!ldt) return NULL;
    
    memset(ldt->entries, 0, sizeof(ldt->entries));
    spinlock_init(&ldt->lock);
    
    // Entry 0 is null (required)
    // Entry 1: User code segment
    set_segment_descriptor(&ldt->entries[1],
        proc->mm->code_start,           // Base
        proc->mm->code_size - 1,        // Limit
        DESC_CODE_EXEC_READ | DESC_DPL3,
        DESC_GRAN_4K | DESC_32BIT);
    
    // Entry 2: User data segment
    set_segment_descriptor(&ldt->entries[2],
        proc->mm->data_start,
        proc->mm->data_size - 1,
        DESC_DATA_READ_WRITE | DESC_DPL3,
        DESC_GRAN_4K | DESC_32BIT);
    
    // Entry 3: User stack (expand-down)
    set_segment_descriptor(&ldt->entries[3],
        proc->mm->stack_top - STACK_SIZE,
        STACK_SIZE - 1,
        DESC_DATA_READ_WRITE | DESC_DPL3,
        DESC_GRAN_4K | DESC_32BIT);
    
    // Allocate GDT slot and create LDT descriptor
    int gdt_slot = allocate_gdt_entry();
    if (gdt_slot < 0) {
        kfree(ldt);
        return NULL;
    }
    
    set_ldt_descriptor(&gdt[gdt_slot],
        (uintptr_t)ldt->entries,
        sizeof(ldt->entries) - 1);
    
    ldt->gdt_selector = (gdt_slot << 3) | RPL_KERNEL;
    
    return ldt;
}
 
// Switch LDT during context switch
void switch_ldt(process_ldt_t* new_ldt) {
    uint16_t selector = new_ldt ? new_ldt->gdt_selector : 0;
    __asm__ volatile ("lldt %0" :: "r"(selector));
}
 
// Destroy LDT on process exit
void destroy_process_ldt(process_ldt_t* ldt) {
    if (!ldt) return;
    
    // Free GDT slot
    int gdt_index = ldt->gdt_selector >> 3;
    free_gdt_entry(gdt_index);
    
    // Free LDT memory
    kfree(ldt);
}

Modern Alternative

Modern systems typically use flat segment models (base=0, limit=4GB for all user segments) and rely on paging for per-process isolation. This eliminates the need for per-process LDTs. The GDT can have a single set of user segment descriptors used by all processes, simplifying management.

Context Switch and Table Switching

During a context switch, the operating system may need to switch LDTs and potentially update other segment-related state. This is a critical path that must be fast and correct.

Context Switch Segment Operations:

Save Current State: Save segment selectors to outgoing process's context
Switch LDT: Load new process's LDT selector into LDTR
Load Segment Registers: Load new process's segment selectors
Update FS/GS Bases: Set thread-local storage pointers

Context Switch Sequence:

context_switch.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
// Context switch with segment handling
 
void context_switch(task_t* prev, task_t* next) {
    // Save prev's segment registers
    __asm__ volatile (
        "mov %%ds, %0\n\t"
        "mov %%es, %1\n\t"
        "mov %%fs, %2\n\t"
        "mov %%gs, %3"
        : "=m"(prev->context.ds),
          "=m"(prev->context.es),
          "=m"(prev->context.fs),
          "=m"(prev->context.gs)
    );
    
    // Switch LDT if processes have different LDTs
    if (prev->process != next->process) {
        process_t* next_proc = next->process;
        process_t* prev_proc = prev->process;
        
        // Different process = different LDT
        if (next_proc->ldt != prev_proc->ldt) {
            uint16_t ldt_sel = next_proc->ldt ? 
                               next_proc->ldt->gdt_selector : 0;
            __asm__ volatile ("lldt %0" :: "r"(ldt_sel));
        }
        
        // Also switch page tables (CR3)
        load_cr3(next_proc->page_directory);
    }
    
    // Load next's segment registers
    __asm__ volatile (
        "mov %0, %%ds\n\t"
        "mov %1, %%es\n\t"
        "mov %2, %%fs\n\t"
        "mov %3, %%gs"
        :: "r"(next->context.ds),
           "r"(next->context.es),
           "r"(next->context.fs),
           "r"(next->context.gs)
    );
    
    // Update FS/GS bases for thread-local storage (64-bit)
#ifdef __x86_64__
    wrmsrl(MSR_FS_BASE, next->fs_base);
    wrmsrl(MSR_GS_BASE, next->gs_base);
#else
    // 32-bit: Update descriptors in GDT for per-thread FS/GS
    update_gdt_entry(GDT_ENTRY_TLS, next->tls_base, TLS_SIZE);
#endif
    
    // Switch kernel stack (in TSS)
    tss.esp0 = (uint32_t)next->kernel_stack_top;
    
    // Perform actual register/stack switch
    switch_context(&prev->context, &next->context);
}

Performance Considerations:

LDT Switching: LLDT causes descriptor cache reload, but is fast if LDT descriptor is in cache
Segment Register Loads: Each load validates the selector, costing a few cycles
TLB Flush: Switching CR3 (page tables) flushes TLB—expensive but necessary for isolation
FS/GS Base Updates: MSR writes are relatively expensive; cached in CPU for fast access

Optimization: Lazy LDT Switching:

If LDTs aren't used (null LDT), skip LLDT:

if (next_proc->ldt != NULL || prev_proc->ldt != NULL) {
    // Only switch if either process uses LDT
    lldt(next_proc->ldt ? next_proc->ldt->gdt_selector : 0);
}

Same-Process Thread Switch

When switching between threads of the same process, LDT and page table switches can be skipped (they're shared). Only thread-specific state (registers, FS/GS bases for TLS, kernel stack) needs updating. This makes thread switches faster than process switches.

Protecting Descriptor Tables

Descriptor tables are security-critical. If an attacker could modify them, they could bypass all segment-based protection. The OS must ensure tables are protected.

Protection Mechanisms:

1. Ring 0 Modification Only:

LGDT and LLDT are privileged—Ring 3 cannot change table locations
GDT/LDT memory is in kernel space, inaccessible to user processes
Even if user code knows the GDT address (via SGDT), it cannot write there

2. Page-Level Protection:

GDT/LDT pages marked as supervisor-only (U/S bit = 0 in page tables)
Even with flat segment model, paging prevents user access
Pages should also be read-only for kernel (if OS supports it)

3. SMEP/SMAP (Modern x86):

SMEP: Supervisor Mode Execution Prevention—kernel cannot execute user pages
SMAP: Supervisor Mode Access Prevention—kernel cannot access user pages accidentally
These prevent kernel bugs from being exploited to modify user-mapped GDT

4. Descriptor Validation:

CPU validates descriptors on every segment load
Invalid descriptors (wrong type, present=0) cause faults
Self-protecting: even if attacker modifies a descriptor, the CPU checks it

Descriptor Table Protection Layers
Protection	How It Helps	Bypass Difficulty
Privileged LGDT/LLDT	Cannot change table location from Ring 3	Requires kernel code execution
Kernel-only memory	Table pages inaccessible to user mode	Requires kernel memory access
Read-only GDT pages	Even kernel cannot accidentally modify	Requires privilege to change page tables
SMEP/SMAP	Kernel cannot be tricked into accessing user pages	Requires disabling these features
Hardware validation	Invalid descriptors cause faults	Cannot use malformed descriptors

Attack Scenario: GDT Corruption

Hypothetical attack without protections:

Attacker finds kernel vulnerability allowing arbitrary write
Writes to GDT, creating a code segment with DPL=3 and base=kernel
Loads this segment into CS (now running at Ring 3 with kernel access)
Reads/writes kernel memory freely

Mitigation:

GDT in read-only kernel pages → write attempt page faults
Or, if descriptor changed, CPU re-validates on segment load
DPL=3 segment pointing to kernel memory still can't access Ring 0 data segments
SMAP prevents even this if kernel has SMAP enabled

Read-Only GDT in Linux:

Linux marks GDT pages read-only and only remaps them writable briefly during modifications:

void update_gdt_entry(int index, ...) {
    set_page_rw(gdt_page);     // Temporarily make writable
    gdt[index] = new_entry;    // Modify
    set_page_ro(gdt_page);     // Restore read-only
}

SGDT Information Leak

SGDT leaking GDT location is a known issue. Attackers can use this for ASLR bypass (locating kernel). Mitigations: UMIP (User-Mode Instruction Prevention) blocks SGDT from Ring 3, or hypervisors virtualize SGDT to return fake values.

Summary: Locating and Managing Segment Tables

We've comprehensively explored where segment tables live and how they're managed. Let's consolidate the key concepts:

Key Takeaways

•Two-Table System — GDT for system-wide segments, LDT for per-process segments.
•GDTR holds GDT base address and limit; loaded via LGDT (privileged).
•LDTR holds selector to LDT descriptor in GDT; loaded via LLDT (privileged).
•Selector Parsing — Index bits select entry, TI bit selects GDT or LDT, RPL for privilege.
•Boot Initialization — GDT must be set up before entering protected mode.
•Per-Process LDTs enable per-process segment layouts; managed across context switches.
•Context Switches require LDT switch and optionally segment register reload.
•Protection — Tables in kernel memory, page-protected, privileged modification only.

Module Complete:

With this page, we've completed our deep dive into segment tables. You now understand:

Segment Table Entry structure and all its fields
Segment Base for addressing and relocation
Segment Limit for bounds checking and protection
Protection Bits for access control and privilege
Segment Table Location for finding and managing tables

This comprehensive knowledge enables you to understand how operating systems implement memory protection at the segment level, which historically was (and partially still is) the foundation of protected-mode computing.

Module Complete

Congratulations! You've mastered segment tables—from individual entry fields to system-wide table management. This knowledge is foundational for operating system development, security research, and understanding modern memory protection evolution from segmentation to paging.

5 / 5

Loading learning content...

Operating SystemsSegment Table

Segment Table

LevelIntermediate

Duration75 mins

TopicSegment Table

5 / 5

Segment Table Location

Finding the Map to Memory

Understanding descriptor table location is essential for operating system development. The OS must:

Set up the GDT during boot
Create per-process LDTs for isolation
Switch LDTs during context switches
Protect descriptor tables from user-mode modification

This page demystifies the architecture of descriptor table management.

What You Will Learn

The Two-Table Architecture

x86 protected mode uses a two-tier descriptor table architecture:

1. Global Descriptor Table (GDT):

System-wide, shared by all processes
Contains descriptors for OS kernel, shared services
Must always be present; CPU requires it
Typically one GDT for the entire system (or per-CPU on SMP)

2. Local Descriptor Table (LDT):

Per-process, unique to each process
Contains descriptors specific to that process
Optional; many processes don't use LDTs
Provides additional isolation between processes

Why Two Tables?

The two-table design separates concerns:

GDT: System segments that must be accessible regardless of which process is running (kernel code, kernel data, TSS, etc.)
LDT: Process-specific segments that should differ between processes (user code, user data, user stack)

This design allows the OS to:

Keep kernel segments in GDT (always available)
Switch LDT on context switch (changes user segments)
Limit GDT modifications (security-critical, kernel-only)
Let processes have different segment layouts via different LDTs

Global Descriptor Table

•Single table for entire system
•Located via GDTR register
•Contains kernel segments
•Holds TSS and LDT descriptors
•Modified only by kernel
•Fixed during normal operation

Local Descriptor Table

•Per-process table
•Located via LDTR register
•Contains user segments
•Switched on context switch
•Can be customized per process
•Optional (modern OSes often skip)

Modern OS Practice

The GDTR Register

GDTR Structure:

┌────────────────────────────────────────────────────────┐
│ GDTR: 48 bits (10 bytes on 32-bit, 80 bits on 64-bit) │
├────────────────────────────────────────────────────────┤
│  Limit (16 bits)   │     Base Address (32/64 bits)    │
│  Bytes 0-1         │     Bytes 2-5 (or 2-9 on 64-bit) │
└────────────────────┴───────────────────────────────────┘

Limit: Size of GDT in bytes minus 1 (max 65535 = 8192 entries × 8 bytes)
Base:  Linear (virtual) address where GDT starts

Critical Points:

Base is Linear Address: With paging enabled, the GDT base is a virtual address that must be mapped in page tables.
Limit is Size-1: A limit of 0x17 means 24 bytes (3 entries × 8 bytes), not 0x17 entries.
Entry 0 is Null: GDT entry 0 is reserved as the "null descriptor" and cannot be used.

GDTR Instructions:

gdtr_operations.asm

x86 Assembly

; LGDT - Load GDT Register
; Operand is a 6-byte (or 10-byte) memory location
lgdt [gdt_descriptor]
 
; SGDT - Store GDT Register  
; Saves current GDTR to memory (can be executed in user mode!)
sgdt [gdt_save_location]
 
; GDT descriptor structure (for LGDT)
gdt_descriptor:
    dw gdt_end - gdt_start - 1    ; Limit (size - 1)
    dd gdt_start                   ; Base address (32-bit)
    
; 64-bit version
gdt_descriptor_64:
    dw gdt_end - gdt_start - 1    ; Limit (size - 1)
    dq gdt_start                   ; Base address (64-bit)
 
; Example GDT
align 16
gdt_start:
gdt_null:         ; Entry 0: Null descriptor (required)
    dq 0
gdt_code:         ; Entry 1: Kernel code segment
    dw 0xFFFF     ; Limit[15:0]
    dw 0x0000     ; Base[15:0]
    db 0x00       ; Base[23:16]
    db 10011010b  ; Access byte: P=1, DPL=0, S=1, Type=Code Execute/Read
    db 11001111b  ; Flags + Limit[19:16]: G=1, D=1, Limit=0xF
    db 0x00       ; Base[31:24]
gdt_data:         ; Entry 2: Kernel data segment
    dw 0xFFFF
    dw 0x0000
    db 0x00
    db 10010010b  ; Access byte: P=1, DPL=0, S=1, Type=Data Read/Write
    db 11001111b  ; G=1, D=1
    db 0x00
gdt_end:

Security Consideration: SGDT Vulnerability

The SGDT instruction can be executed from user mode, revealing the GDT's location. This has been exploited for:

VM Detection: Hypervisors often relocate GDT, SGDT reveals this
ASLR Bypass: GDT location might leak kernel address space layout
Fingerprinting: Distinguish OS/hypervisor by GDT layout

Modern systems mitigate this:

Hardware virtualization intercepts SGDT
Some OSes randomize GDT location
UMIP (User-Mode Instruction Prevention) blocks SGDT from Ring 3

LGDT is Privileged

The LDTR Register

The Local Descriptor Table Register (LDTR) points to the current process's LDT. Unlike GDTR which holds a direct memory address, LDTR holds a selector that references a descriptor in the GDT.

LDTR Structure:

┌──────────────────────────────────────────────────────────────────┐
│ LDTR: 16-bit visible selector + hidden descriptor cache         │
├──────────────────────────────────────────────────────────────────┤
│ Visible:  16-bit selector (index into GDT for LDT descriptor)  │
│ Hidden:   Cached LDT descriptor (base, limit, attributes)       │
└──────────────────────────────────────────────────────────────────┘

The Indirection:

LDTR contains a selector (e.g., 0x28)
This selector points to a descriptor in the GDT
That GDT descriptor is a system descriptor of type LDT
The LDT descriptor contains base/limit of the actual LDT
CPU caches this LDT information in LDTR's hidden portion

Why This Extra Level?

Having LDTR reference the GDT rather than directly containing a base/limit:

Consistency: All segment information flows through descriptors
Hardware Uniformity: Same caching mechanism for all segments
Protection: LDT descriptor in GDT has DPL, Present bit, etc.
Validation: CPU validates LDT descriptor when LDTR is loaded

ldtr_operations.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
// Setting up an LDT for a process
 
// Step 1: Allocate memory for the LDT
#define LDT_ENTRIES 32
segment_descriptor_t* ldt = kmalloc(LDT_ENTRIES * sizeof(segment_descriptor_t));
memset(ldt, 0, LDT_ENTRIES * sizeof(segment_descriptor_t));
 
// Step 2: Populate LDT with process-specific segments
create_gdt_entry(&ldt[1], 
    process->code_base, 
    process->code_limit,
    ACCESS_CODE_EXEC_READ | DPL_USER,
    FLAGS_32BIT | FLAGS_PAGE_GRAN);
 
create_gdt_entry(&ldt[2],
    process->data_base,
    process->data_limit,
    ACCESS_DATA_READ_WRITE | DPL_USER,
    FLAGS_32BIT | FLAGS_PAGE_GRAN);
 
// Step 3: Create LDT descriptor in GDT
uint16_t ldt_gdt_index = allocate_gdt_slot();
create_system_descriptor(&gdt[ldt_gdt_index],
    (uint32_t)ldt,                    // Base: LDT's address
    LDT_ENTRIES * 8 - 1,              // Limit: LDT size - 1
    TYPE_LDT,                          // Type: LDT descriptor
    DPL_KERNEL);                       // Only kernel can load LDTR
 
// Step 4: Construct LDTR selector
process->ldt_selector = (ldt_gdt_index << 3) | TI_GDT | RPL_KERNEL;
 
// Step 5: Load LDTR (during context switch to this process)
void switch_to_process(process_t* proc) {
    // Load the new LDT
    __asm__ volatile ("lldt %0" :: "r"(proc->ldt_selector));
    
    // Now selectors with TI=1 reference this process's LDT
}

LDTR Instructions:

LLDT (Load LDT Register): Loads a selector into LDTR, CPU fetches and caches the LDT descriptor from GDT. Privileged (Ring 0).
SLDT (Store LDT Register): Stores the LDTR selector to memory. Can be executed from user mode.

Null LDT:

If LDTR is loaded with a null selector (0), no LDT is active. Any segment selector with TI=1 (LDT indicator) will cause #GP because there's no LDT to look up. Many modern OSes run with LDTR=0.

Per-CPU GDT, Not Global

Descriptor Table Layout

Descriptor tables are arrays of 8-byte (or 16-byte in 64-bit mode for system descriptors) entries. Let's examine the structure and typical layouts.

GDT Structure:

Offset    Entry    Purpose
0x00      [0]      Null Descriptor (required, never used)
0x08      [1]      Kernel Code Segment (CS when in kernel)
0x10      [2]      Kernel Data Segment (DS/ES/SS when in kernel)
0x18      [3]      User Code Segment (CS when in user mode)
0x20      [4]      User Data Segment (DS/ES/SS when in user mode)
0x28      [5]      TSS Descriptor (for current CPU)
0x30      [6]      Per-CPU Data Segment (for FS/GS base)
          ...      Additional entries as needed

Typical Linux 32-bit GDT:

Linux 32-bit GDT Layout
Index	Selector	Name	Purpose
0	0x00	GDT_ENTRY_NULL	Null descriptor
1	0x08	GDT_ENTRY_KERNEL_CS	Kernel code (Ring 0)
2	0x10	GDT_ENTRY_KERNEL_DS	Kernel data (Ring 0)
3	0x18	GDT_ENTRY_USER_CS	User code (Ring 3)
4	0x20	GDT_ENTRY_USER_DS	User data (Ring 3)
5	0x28	GDT_ENTRY_TSS	Task State Segment
6	0x30	GDT_ENTRY_LDT	LDT (if used)
7+	0x38+	Various	Per-CPU, TLS, etc.

Selector to Entry Mapping:

The relationship between selectors and GDT entries:

Selector = (Index << 3) | TI | RPL

Index: Entry number in the table (0, 1, 2, ...)
TI:    Table Indicator (0 = GDT, 1 = LDT)
RPL:   Requested Privilege Level (0-3)

Examples:
  Selector 0x08 = (1 << 3) | 0 | 0 = Entry 1 in GDT, RPL=0
  Selector 0x1B = (3 << 3) | 0 | 3 = Entry 3 in GDT, RPL=3
  Selector 0x0F = (1 << 3) | 1 | 3 = Entry 1 in LDT, RPL=3

Finding a Descriptor:

segment_descriptor_t* find_descriptor(uint16_t selector) {
    uint16_t index = selector >> 3;
    bool use_ldt = (selector >> 2) & 1;
    
    if (use_ldt) {
        // TI = 1: Look in LDT
        if (ldtr_is_null())
            raise_gp(selector);  // No LDT loaded
        if (index >= ldt_limit / 8)
            raise_gp(selector);  // Index out of bounds
        return &ldt[index];
    } else {
        // TI = 0: Look in GDT
        if (index >= gdt_limit / 8)
            raise_gp(selector);  // Index out of bounds
        return &gdt[index];
    }
}

Entry 0 is Special

Selector to Descriptor Lookup

When a segment register is loaded with a selector, the CPU must find and validate the corresponding descriptor. This is a critical path that happens on every segment load.

Lookup Algorithm:

segment_lookup.pseudo

Pseudocode

function load_segment_register(register, selector):
    // Step 1: Parse selector
    index = selector >> 3              // Bits 15-3: Index
    TI = (selector >> 2) & 1           // Bit 2: Table indicator
    RPL = selector & 3                 // Bits 1-0: Requested privilege
    
    // Step 2: Handle null selector
    if selector == 0:
        if register in [CS, SS]:
            raise #GP(0)               // Cannot load null into CS/SS
        else:
            register.selector = 0
            register.cached.valid = false
            return                     // Null DS/ES/FS/GS is allowed
    
    // Step 3: Determine which table
    if TI == 0:
        table_base = GDTR.base
        table_limit = GDTR.limit
    else:  // TI == 1
        if LDTR.selector == 0:
            raise #GP(selector)        // No LDT loaded
        table_base = LDTR.cached.base
        table_limit = LDTR.cached.limit
    
    // Step 4: Bounds check
    descriptor_offset = index * 8
    if descriptor_offset + 7 > table_limit:
        raise #GP(selector)            // Index out of bounds
    
    // Step 5: Fetch descriptor from memory
    descriptor = read_memory(table_base + descriptor_offset, 8 bytes)
    
    // Step 6: Validate descriptor type for target register
    if register == CS:
        if not descriptor.is_code_segment():
            raise #GP(selector)
    if register == SS:
        if not descriptor.is_writable_data():
            raise #GP(selector)
    if register in [DS, ES, FS, GS]:
        if descriptor.is_system():
            raise #GP(selector)
    
    // Step 7: Presence check
    if not descriptor.present:
        raise #NP(selector)            // Segment not present
    
    // Step 8: Privilege check (varies by segment type)
    perform_privilege_check(register, selector, descriptor)
    
    // Step 9: Cache descriptor
    register.selector = selector
    register.cached = descriptor
    mark_descriptor_accessed(table_base + descriptor_offset)

Physical Address Calculation:

GDT Entry Address = GDTR.Base + (Selector.Index × 8)

Example:
  GDTR.Base = 0xC0001000
  Selector = 0x18 (Index = 3)
  Entry Address = 0xC0001000 + (3 × 8) = 0xC0001018

Memory Access Pattern:

When loading a segment register, the CPU:

Reads 8 bytes from GDT/LDT (one cache line access usually)
Parses the descriptor
Caches result in segment register's hidden portion
Sets the accessed bit in the descriptor (another write)

The accessed bit write can cause TLB activity and cache coherence traffic on SMP systems.

Minimizing Segment Loads

GDT Initialization at Boot

Boot Sequence:

BIOS/UEFI loads bootloader in real mode
Bootloader sets up minimal GDT
Enable Protected Mode: Set PE bit in CR0
Far Jump: Load CS with protected mode selector
Load Data Segments: Set DS, ES, SS to protected mode selectors
Kernel may set up more complete GDT later

Minimal Boot GDT:

boot_gdt.asm

x86 Assembly

; Minimal GDT for entering protected mode
; Called before switching from real mode to protected mode
 
setup_protected_mode:
    cli                         ; Disable interrupts
    
    ; Load GDT
    lgdt [gdt_descriptor]
    
    ; Enable protected mode (set PE bit in CR0)
    mov eax, cr0
    or eax, 1                   ; Set PE bit
    mov cr0, eax
    
    ; Far jump to load CS with protected mode selector
    jmp 0x08:protected_mode_entry
 
[bits 32]
protected_mode_entry:
    ; Now in 32-bit protected mode
    ; Load data segment registers
    mov ax, 0x10                ; Kernel data selector
    mov ds, ax
    mov es, ax
    mov ss, ax
    mov fs, ax
    mov gs, ax
    
    ; Set up stack
    mov esp, 0x90000
    
    ; Continue to kernel...
    jmp kernel_main
 
; GDT data
align 8
gdt_start:
gdt_null:                       ; Entry 0: Required null
    dq 0
 
gdt_code:                       ; Entry 1: 32-bit code, base=0, limit=4GB
    dw 0xFFFF                   ; Limit [0:15]
    dw 0x0000                   ; Base [0:15]
    db 0x00                     ; Base [16:23]
    db 0b10011010               ; Access: P=1, DPL=0, S=1, Type=Execute/Read
    db 0b11001111               ; Flags: G=1, D=1, Limit [16:19]
    db 0x00                     ; Base [24:31]
 
gdt_data:                       ; Entry 2: 32-bit data, base=0, limit=4GB
    dw 0xFFFF
    dw 0x0000
    db 0x00
    db 0b10010010               ; Access: P=1, DPL=0, S=1, Type=Read/Write
    db 0b11001111
    db 0x00
 
gdt_end:
 
gdt_descriptor:
    dw gdt_end - gdt_start - 1  ; Size - 1
    dd gdt_start                ; Address

Why Far Jump After Setting CR0.PE?

When CR0.PE is set, the CPU is in protected mode, but CS still contains a real-mode value. The CPU continues in a weird hybrid state until CS is reloaded. The far jump:

Forces CS reload with a protected-mode selector
Flushes the instruction prefetch queue (contains real-mode decoded instructions)
Creates a clean entry into 32-bit protected mode

Kernel GDT Setup:

After basic boot, the kernel typically:

Relocates or rebuilds the GDT at a known virtual address
Adds more entries (TSS, user segments, per-CPU entries)
Reloads GDTR with the new GDT
Sets up per-CPU GDT copies for SMP

Identity Mapping Required

Per-Process LDT Management

When using per-process LDTs, the operating system must manage LDT creation, population, and switching. Let's examine the complete lifecycle.

LDT Lifecycle:

Allocation: Kernel allocates memory for new process's LDT
Population: Fill LDT with process-specific segment descriptors
GDT Entry: Create LDT descriptor in GDT pointing to this LDT
Activation: Load LDTR during context switch to this process
Modification: Update LDT entries as process changes (new segments)
Deallocation: Free LDT memory when process exits

LDT Per-Process Architecture:

┌─────────────────────┐     ┌────────────────────────────────────┐
│  Global GDT         │     │  Process A's LDT                   │
├─────────────────────┤     ├────────────────────────────────────┤
│ [0] Null            │     │ [0] Null (reserved)                │
│ [1] Kernel Code     │     │ [1] User Code: Base=A's code area  │
│ [2] Kernel Data     │     │ [2] User Data: Base=A's data area  │
│ [3] User Code       │     │ [3] User Stack: Base=A's stack     │
│ [4] User Data       │     │ [4] User TLS: Thread-local storage │
│ [5] TSS             │     └────────────────────────────────────┘
│ [6] LDT for A ─────────────────────┘
│ [7] LDT for B ──────────────────────┐
│ ...                 │     ┌─────────────────────────────────────┐
└─────────────────────┘     │  Process B's LDT                    │
                            ├─────────────────────────────────────┤
                            │ [0] Null                            │
                            │ [1] User Code: Base=B's code area   │
                            │ [2] User Data: Base=B's data area   │
                            │ [3] User Stack: Base=B's stack      │
                            └─────────────────────────────────────┘

ldt_management.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
// Complete LDT management for per-process isolation
 
typedef struct {
    segment_descriptor_t entries[LDT_SIZE];
    uint16_t gdt_selector;          // Selector for this LDT's GDT entry
    spinlock_t lock;                // Protect concurrent modifications
} process_ldt_t;
 
// Create a new LDT for a process
process_ldt_t* create_process_ldt(process_t* proc) {
    // Allocate LDT structure (must be accessible in kernel space)
    process_ldt_t* ldt = kmalloc_aligned(sizeof(process_ldt_t), 8);
    if (!ldt) return NULL;
    
    memset(ldt->entries, 0, sizeof(ldt->entries));
    spinlock_init(&ldt->lock);
    
    // Entry 0 is null (required)
    // Entry 1: User code segment
    set_segment_descriptor(&ldt->entries[1],
        proc->mm->code_start,           // Base
        proc->mm->code_size - 1,        // Limit
        DESC_CODE_EXEC_READ | DESC_DPL3,
        DESC_GRAN_4K | DESC_32BIT);
    
    // Entry 2: User data segment
    set_segment_descriptor(&ldt->entries[2],
        proc->mm->data_start,
        proc->mm->data_size - 1,
        DESC_DATA_READ_WRITE | DESC_DPL3,
        DESC_GRAN_4K | DESC_32BIT);
    
    // Entry 3: User stack (expand-down)
    set_segment_descriptor(&ldt->entries[3],
        proc->mm->stack_top - STACK_SIZE,
        STACK_SIZE - 1,
        DESC_DATA_READ_WRITE | DESC_DPL3,
        DESC_GRAN_4K | DESC_32BIT);
    
    // Allocate GDT slot and create LDT descriptor
    int gdt_slot = allocate_gdt_entry();
    if (gdt_slot < 0) {
        kfree(ldt);
        return NULL;
    }
    
    set_ldt_descriptor(&gdt[gdt_slot],
        (uintptr_t)ldt->entries,
        sizeof(ldt->entries) - 1);
    
    ldt->gdt_selector = (gdt_slot << 3) | RPL_KERNEL;
    
    return ldt;
}
 
// Switch LDT during context switch
void switch_ldt(process_ldt_t* new_ldt) {
    uint16_t selector = new_ldt ? new_ldt->gdt_selector : 0;
    __asm__ volatile ("lldt %0" :: "r"(selector));
}
 
// Destroy LDT on process exit
void destroy_process_ldt(process_ldt_t* ldt) {
    if (!ldt) return;
    
    // Free GDT slot
    int gdt_index = ldt->gdt_selector >> 3;
    free_gdt_entry(gdt_index);
    
    // Free LDT memory
    kfree(ldt);
}

Modern Alternative

Context Switch and Table Switching

During a context switch, the operating system may need to switch LDTs and potentially update other segment-related state. This is a critical path that must be fast and correct.

Context Switch Segment Operations:

Save Current State: Save segment selectors to outgoing process's context
Switch LDT: Load new process's LDT selector into LDTR
Load Segment Registers: Load new process's segment selectors
Update FS/GS Bases: Set thread-local storage pointers

Context Switch Sequence:

context_switch.c
C
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
// Context switch with segment handling
 
void context_switch(task_t* prev, task_t* next) {
    // Save prev's segment registers
    __asm__ volatile (
        "mov %%ds, %0\n\t"
        "mov %%es, %1\n\t"
        "mov %%fs, %2\n\t"
        "mov %%gs, %3"
        : "=m"(prev->context.ds),
          "=m"(prev->context.es),
          "=m"(prev->context.fs),
          "=m"(prev->context.gs)
    );
    
    // Switch LDT if processes have different LDTs
    if (prev->process != next->process) {
        process_t* next_proc = next->process;
        process_t* prev_proc = prev->process;
        
        // Different process = different LDT
        if (next_proc->ldt != prev_proc->ldt) {
            uint16_t ldt_sel = next_proc->ldt ? 
                               next_proc->ldt->gdt_selector : 0;
            __asm__ volatile ("lldt %0" :: "r"(ldt_sel));
        }
        
        // Also switch page tables (CR3)
        load_cr3(next_proc->page_directory);
    }
    
    // Load next's segment registers
    __asm__ volatile (
        "mov %0, %%ds\n\t"
        "mov %1, %%es\n\t"
        "mov %2, %%fs\n\t"
        "mov %3, %%gs"
        :: "r"(next->context.ds),
           "r"(next->context.es),
           "r"(next->context.fs),
           "r"(next->context.gs)
    );
    
    // Update FS/GS bases for thread-local storage (64-bit)
#ifdef __x86_64__
    wrmsrl(MSR_FS_BASE, next->fs_base);
    wrmsrl(MSR_GS_BASE, next->gs_base);
#else
    // 32-bit: Update descriptors in GDT for per-thread FS/GS
    update_gdt_entry(GDT_ENTRY_TLS, next->tls_base, TLS_SIZE);
#endif
    
    // Switch kernel stack (in TSS)
    tss.esp0 = (uint32_t)next->kernel_stack_top;
    
    // Perform actual register/stack switch
    switch_context(&prev->context, &next->context);
}

Performance Considerations:

LDT Switching: LLDT causes descriptor cache reload, but is fast if LDT descriptor is in cache
Segment Register Loads: Each load validates the selector, costing a few cycles
TLB Flush: Switching CR3 (page tables) flushes TLB—expensive but necessary for isolation
FS/GS Base Updates: MSR writes are relatively expensive; cached in CPU for fast access

Optimization: Lazy LDT Switching:

If LDTs aren't used (null LDT), skip LLDT:

if (next_proc->ldt != NULL || prev_proc->ldt != NULL) {
    // Only switch if either process uses LDT
    lldt(next_proc->ldt ? next_proc->ldt->gdt_selector : 0);
}

Same-Process Thread Switch

Protecting Descriptor Tables

Descriptor tables are security-critical. If an attacker could modify them, they could bypass all segment-based protection. The OS must ensure tables are protected.

Protection Mechanisms:

1. Ring 0 Modification Only:

LGDT and LLDT are privileged—Ring 3 cannot change table locations
GDT/LDT memory is in kernel space, inaccessible to user processes
Even if user code knows the GDT address (via SGDT), it cannot write there

2. Page-Level Protection:

GDT/LDT pages marked as supervisor-only (U/S bit = 0 in page tables)
Even with flat segment model, paging prevents user access
Pages should also be read-only for kernel (if OS supports it)

3. SMEP/SMAP (Modern x86):

SMEP: Supervisor Mode Execution Prevention—kernel cannot execute user pages
SMAP: Supervisor Mode Access Prevention—kernel cannot access user pages accidentally
These prevent kernel bugs from being exploited to modify user-mapped GDT

4. Descriptor Validation:

CPU validates descriptors on every segment load
Invalid descriptors (wrong type, present=0) cause faults
Self-protecting: even if attacker modifies a descriptor, the CPU checks it

Descriptor Table Protection Layers
Protection	How It Helps	Bypass Difficulty
Privileged LGDT/LLDT	Cannot change table location from Ring 3	Requires kernel code execution
Kernel-only memory	Table pages inaccessible to user mode	Requires kernel memory access
Read-only GDT pages	Even kernel cannot accidentally modify	Requires privilege to change page tables
SMEP/SMAP	Kernel cannot be tricked into accessing user pages	Requires disabling these features
Hardware validation	Invalid descriptors cause faults	Cannot use malformed descriptors

Attack Scenario: GDT Corruption

Hypothetical attack without protections:

Attacker finds kernel vulnerability allowing arbitrary write
Writes to GDT, creating a code segment with DPL=3 and base=kernel
Loads this segment into CS (now running at Ring 3 with kernel access)
Reads/writes kernel memory freely

Mitigation:

GDT in read-only kernel pages → write attempt page faults
Or, if descriptor changed, CPU re-validates on segment load
DPL=3 segment pointing to kernel memory still can't access Ring 0 data segments
SMAP prevents even this if kernel has SMAP enabled

Read-Only GDT in Linux:

Linux marks GDT pages read-only and only remaps them writable briefly during modifications:

void update_gdt_entry(int index, ...) {
    set_page_rw(gdt_page);     // Temporarily make writable
    gdt[index] = new_entry;    // Modify
    set_page_ro(gdt_page);     // Restore read-only
}

SGDT Information Leak

Summary: Locating and Managing Segment Tables

We've comprehensively explored where segment tables live and how they're managed. Let's consolidate the key concepts:

Key Takeaways

•Two-Table System — GDT for system-wide segments, LDT for per-process segments.
•GDTR holds GDT base address and limit; loaded via LGDT (privileged).
•LDTR holds selector to LDT descriptor in GDT; loaded via LLDT (privileged).
•Selector Parsing — Index bits select entry, TI bit selects GDT or LDT, RPL for privilege.
•Boot Initialization — GDT must be set up before entering protected mode.
•Per-Process LDTs enable per-process segment layouts; managed across context switches.
•Context Switches require LDT switch and optionally segment register reload.
•Protection — Tables in kernel memory, page-protected, privileged modification only.

Module Complete:

With this page, we've completed our deep dive into segment tables. You now understand:

Segment Table Entry structure and all its fields
Segment Base for addressing and relocation
Segment Limit for bounds checking and protection
Protection Bits for access control and privilege
Segment Table Location for finding and managing tables

Module Complete

5 / 5