Operating SystemsFile System Structures

Virtual File System (VFS)

LevelAdvanced

Duration90 mins

TopicFile System Structures

1 / 5

VFS Abstraction

The Problem of Diversity

Modern operating systems face a remarkable challenge: users and applications expect to read files, write files, list directories, and navigate file hierarchies—all without caring whether their data resides on an ext4 partition, an NTFS volume, an NFS network share, a FAT-formatted USB drive, or even a pseudo-filesystem like /proc. The system call open("/home/user/document.txt", O_RDONLY) should work identically regardless of the underlying storage technology.

This creates an engineering problem of significant complexity. Each file system has radically different on-disk structures, metadata formats, allocation strategies, and performance characteristics. ext4 uses inodes and block groups; NTFS uses the Master File Table; FAT uses the File Allocation Table; ZFS uses a transactional copy-on-write model. How can an operating system provide a uniform interface to applications while simultaneously supporting this diverse ecosystem of storage technologies?

The answer is the Virtual File System (VFS)—one of the most elegant and important abstraction layers in operating system design.

What You Will Learn

By the end of this page, you will understand what the VFS is, why it was created, how it provides uniform file system access, its position in the kernel architecture, and why mastering VFS concepts is essential for systems programming. You'll see how this single abstraction layer enables everything from mounting USB drives to accessing remote servers to introspecting kernel state.

What is the Virtual File System?

The Virtual File System (VFS) is a software abstraction layer within the kernel that provides a uniform interface to the file system namespace, regardless of the underlying file system implementation. It is the kernel subsystem that interprets system calls like open(), read(), write(), close(), stat(), readdir(), and dispatches them to the appropriate file system driver.

Key Insight: The VFS is not a file system itself. It stores no data on disk. Instead, it defines a contract—a set of data structures and function pointers—that all file systems must implement. When an application calls read(), the VFS translates that call into the corresponding operation for ext4, XFS, NFS, or whatever file system actually holds the data.

The Object-Oriented Pattern

The VFS uses an object-oriented design pattern implemented in C. Each VFS object (superblock, inode, dentry, file) contains a pointer to an operations structure—essentially a vtable of function pointers. Different file systems provide their own implementations of these operations. This is classic polymorphism: the VFS calls inode->i_op->lookup() and gets ext4's lookup function, NTFS's lookup function, or NFS's lookup function depending on the inode's origin.

Formal Definition:

The Virtual File System is the kernel subsystem that:

Presents a unified API to user-space processes for all file and directory operations
Defines abstract data types (superblock, inode, dentry, file) that represent file system entities
Dispatches operations to concrete file system implementations through function pointers
Manages the namespace by assembling file systems into a single directory hierarchy via mounting
Provides caching for directory entries (dentry cache) and inode metadata (inode cache)

This design achieves separation of concerns: application programmers write portable code against a stable API, while file system developers implement specific storage formats without modifying the kernel's core I/O path.

VFS Terminology and Concepts
Term	Definition	Purpose
VFS	Virtual File System	Kernel abstraction layer for uniform file access
File System	On-disk format for data and metadata	Defines how data is physically organized on storage
File System Driver	Kernel module implementing VFS operations	Translates VFS calls to on-disk operations
Mount Point	Directory where a file system is attached	Integrates file systems into the namespace
Namespace	The unified directory tree visible to processes	Single hierarchy starting from root (/)

Historical Context and Motivation

The VFS concept emerged from practical necessity in the 1980s as operating systems needed to support multiple file system types simultaneously.

The Problem: Early Unix systems had a single, hardcoded file system (often the original Unix File System or variations). When Sun Microsystems developed NFS (Network File System) in 1984, they faced a challenge: how could the kernel support both local disk access and remote network file access without duplicating the entire file I/O subsystem?

Sun's Solution: Sun engineers created what they called the vnode interface (virtual node interface)—the first VFS implementation. The vnode abstraction represented any file-like entity, whether local or remote, with a common set of operations. This allowed NFS to plug into the kernel alongside the local file system.

The Vnode Model:

A vnode (virtual inode) is a kernel data structure representing an open file
Each vnode contains a pointer to vnode operations—functions like read, write, lookup
Different file systems provide their own vnode operation implementations
The kernel calls through the vnode interface without knowing the underlying implementation

Timeline of VFS Evolution

•1984 — Sun Microsystems introduces the vnode/VFS interface for SunOS to support NFS alongside UFS (Unix File System)
•1986 — System V Release 4 (SVR4) adopts and standardizes the vnode interface, becoming a Unix standard
•1991 — Linux 0.01 initially had a simple, hardcoded Minix file system; VFS added later as Linux grew
•1993 — Linux 1.0 includes a mature VFS layer supporting ext2, minix, msdos, and NFS
•2000s — Linux VFS evolves with dcache (directory entry cache), unified buffer cache, and advanced features
•Present — Modern VFS supports 60+ file systems in Linux alone, from local disk to network to virtual

Cross-Platform VFS Implementations

While this content focuses primarily on Unix/Linux VFS, the concept appears in all major operating systems. Windows has the Installable File System (IFS) architecture with filter drivers. macOS has its VFS layer inherited from BSD. Each uses the same fundamental idea: abstract the interface, allow pluggable implementations.

The Abstraction Principle

The VFS embodies a fundamental principle in systems design: abstraction hides complexity behind stable interfaces. Let's examine what this means concretely.

Without VFS:

// Hypothetical world without VFS abstraction
if (file_system_type == FS_EXT4) {
    ext4_open(path, flags);
} else if (file_system_type == FS_NTFS) {
    ntfs_open(path, flags);
} else if (file_system_type == FS_NFS) {
    nfs_open(path, flags);
} else if (file_system_type == FS_ZFS) {
    zfs_open(path, flags);
}
// ... repeated for EVERY operation, in EVERY application

This approach is unmaintainable. Every application would need knowledge of every file system. Adding a new file system would require modifying every program.

With VFS:

// Application code — identical regardless of file system
int fd = open("/path/to/file", O_RDONLY);
read(fd, buffer, size);
close(fd);

The application is completely isolated from file system details. The kernel's VFS layer handles dispatch to the appropriate driver.

Benefits of VFS Abstraction

•Application Portability — Programs work on any mounted file system without modification
•File System Independence — New file systems can be added without changing applications or the kernel core
•Unified Namespace — All file systems appear in a single directory hierarchy
•Caching Optimization — VFS provides system-wide caching (dentry cache, inode cache) that benefits all file systems
•Simplified Development — File system developers implement a defined interface rather than full I/O stacks

Abstraction Trade-offs

•Lowest Common Denominator — Some file system-specific features may not be exposed through VFS
•Indirection Overhead — Function pointer calls add slight overhead compared to direct calls
•Complexity for FS Developers — Must implement many VFS operations, even if some don't apply
•Semantic Mismatches — Network file systems may have different consistency semantics
•Feature Evolution — Adding new VFS operations requires coordinating with existing file systems

The Power of Uniform Semantics

Because shell scripts and utilities like ls, cp, cat, and find use the same VFS system calls, they work transparently across all mounted file systems. You can cp a file from an NFS share to a local ext4 partition to a FUSE-mounted cloud storage, all with the same command. This uniformity is enabled entirely by VFS.

VFS Architecture Overview

The VFS sits between user-space applications and concrete file system implementations, serving as the kernel's file system dispatcher and cache manager. Understanding its position in the software stack is crucial.

Layer Stack (Top to Bottom):

User-Space Applications — Programs using standard C library functions (fopen, fread, fprintf)
C Library (libc) — Translates library calls to system calls (write() syscall from fprintf)
System Call Interface — Kernel entry point for file operations
Virtual File System (VFS) — Dispatches to appropriate file system, manages caches
File System Drivers — ext4, XFS, btrfs, NFS, FAT, etc.
Block Layer / Network Layer — Generic I/O for block devices or network sockets
Device Drivers — Hardware-specific code for disks, NICs, etc.
Hardware — Physical storage media, network interfaces

Converting Mermaid diagram...

Key Observations:

Single Entry Point: All file operations, regardless of destination, enter through the same system call interface and pass through VFS.
Uniform Dispatch: VFS uses function pointers in the inode, dentry, and file structures to route operations to the correct driver.
Diverse Backends: Notice how some file systems go to the block layer (ext4, XFS), others to the network stack (NFS), yet others to user-space (FUSE), and pseudo file systems access kernel data directly (procfs).
VFS as Unifier: Despite these vastly different backends, the VFS makes them all look the same to applications.

Core VFS Responsibilities

The VFS layer performs several critical functions that go beyond simple dispatch. Understanding these responsibilities reveals why VFS is so central to kernel operation.

VFS Core Responsibilities

•System Call Handling — VFS implements the kernel side of open(), read(), write(), close(), stat(), readdir(), mkdir(), unlink(), rename(), chmod(), chown(), and dozens of other file-related system calls. It validates arguments, manages file descriptors, and routes to file system drivers.
•Path Resolution — When you open /home/user/docs/file.txt, VFS performs pathname lookup, traversing the directory tree component by component, crossing mount points, following symbolic links, and checking permissions at each step. This is implemented in the namei (name-to-inode) subsystem.
•Mount Management — VFS maintains the mount table, tracks which file systems are mounted where, handles mount point traversal, and manages the unified namespace. Commands like mount and umount are implemented as VFS operations.
•Dentry Caching — The dentry cache (dcache) is a VFS-managed hash table that caches recent path lookups. Looking up /usr/bin/ls requires examining inodes for /, usr, bin, and ls. The dcache avoids disk I/O for frequently accessed paths.
•Inode Caching — The inode cache keeps recently accessed inode structures in memory. Since inodes contain file metadata and pointers to data, caching them dramatically speeds up repeated access to the same files.
•File Object Management — VFS manages struct file objects that represent open files. This includes tracking the current file position (offset), access mode (read/write), and reference counting for shared file descriptors across fork().
•Permission and Security Checks — Before dispatching to file system drivers, VFS verifies that the calling process has appropriate permissions (read, write, execute) and consults security modules like SELinux or AppArmor.

VFS as Performance Optimization

The dentry and inode caches are performance-critical. On a busy server, these caches can absorb the majority of file system operations, serving lookups and metadata queries directly from memory without any disk I/O. The VFS layer's caching might handle 99% of file operations on a warm system.

Pathname Resolution Deep Dive

One of VFS's most complex and performance-critical tasks is pathname resolution—converting a path string like /home/alice/projects/kernel/vfs.c into an in-memory inode structure. This process involves multiple steps, caches, and potential mount point crossings.

The namei Algorithm (name-to-inode):

Start Point: For absolute paths, start at the root dentry (/). For relative paths, start at the process's current working directory.
Component Iteration: Split the path by / and process each component (home, alice, projects, etc.) left to right.
For Each Component:
- Dentry Cache Lookup: Check if this component is in the dcache. Cache hit? Use the cached dentry/inode without disk I/O.
- Cache Miss: Call the parent directory's lookup operation (e.g., ext4_lookup) to read the directory and find the entry.
- Permission Check: Verify execute permission on the directory (required for traversal).
- Special Cases: Handle . (current), .. (parent), symbolic links (follow or not based on flags).
- Mount Point Check: If this dentry is a mount point, switch to the root dentry of the mounted file system.
Final Component: The last component may be the target file/directory, or for operations like open() with O_CREAT, it may not exist yet.
Return: Return the final dentry and inode, or an error if resolution fails.

Pathname Resolution Pseudocode

Pseudocode

function resolve_path(path, flags):
    // Determine starting point
    if path.starts_with("/"):
        current = root_dentry
    else:
        current = process.cwd
    
    // Split and iterate through path components
    components = path.split("/").filter(non_empty)
    
    for each component in components:
        // Check dentry cache first
        cached = dcache_lookup(current, component)
        if cached:
            next = cached
        else:
            // Cache miss: ask file system to look up
            next = current.inode.ops.lookup(current, component)
            if not next:
                return -ENOENT  // No such file or directory
            dcache_add(current, component, next)
        
        // Permission check: need execute to traverse directory
        if not has_exec_permission(current.inode):
            return -EACCES  // Permission denied
        
        // Handle symbolic links
        if next.inode.is_symlink and should_follow_symlink(flags):
            link_target = next.inode.ops.readlink(next)
            next = resolve_path(link_target, flags)  // Recursive!
        
        // Handle mount points: if mounted here, switch to mount root
        if is_mount_point(next):
            next = get_mounted_root(next)
        
        current = next
    
    return current  // Final dentry/inode

Symbolic Link Loops and Limits

Symbolic links can create loops: a -> b, b -> a. To prevent infinite recursion, the kernel limits symlink traversal (typically 40 consecutive symlinks in Linux). Exceeding this limit returns ELOOP. Similarly, there are limits on total path length (PATH_MAX, typically 4096 bytes) and individual component length (NAME_MAX, typically 255 bytes).

Performance Insight:

For a path like /usr/local/bin/python, the kernel must resolve 5 components. On a cold cache, this might require 5 disk reads. With a warm dcache, all 5 lookups come from memory in nanoseconds. This is why the dcache is sized generously and uses efficient hash-table lookup—path resolution happens millions of times per second on busy systems.

Mount Point Traversal:

Mount points are transparent to path resolution. If /mnt/usb has a FAT file system mounted, resolving /mnt/usb/document.txt automatically crosses from whatever file system /mnt is on (probably ext4) into the FAT file system. The VFS maintains a mount hash table that maps dentries to mount information, enabling O(1) mount point detection.

VFS and Special File Systems

The VFS abstraction is so powerful that many things that aren't traditional "file systems" are implemented as file systems. These pseudo-filesystems or virtual filesystems use the VFS interface to expose kernel data, device interfaces, and computed information as files and directories.

Philosophy: In Unix, "everything is a file." The VFS makes this philosophy implementable in practice.

Examples of Virtual/Pseudo File Systems
File System	Mount Point	Purpose	Key Characteristics
procfs	/proc	Process and kernel information	Dynamic content generated on read; no persistent storage
sysfs	/sys	Device model and kernel objects	Exposes kobject hierarchy; used by udev for device management
tmpfs	/tmp, /run	In-memory temporary storage	RAM-backed; fast; cleared on reboot; can swap to disk
devtmpfs	/dev	Device nodes	Automatically creates device nodes when drivers load
cgroup	/sys/fs/cgroup	Resource control groups	Hierarchical resource limits for containers
debugfs	/sys/kernel/debug	Debugging interface	Developers expose debugging info; not for production
securityfs	/sys/kernel/security	Security modules	LSM (SELinux, AppArmor) interfaces
hugetlbfs	various	Huge page allocation	Allocates huge pages for applications

Why Implement as File Systems?

Universal Interface: Any tool that reads files can inspect kernel state. cat /proc/cpuinfo works without special programs.
Permissions: Standard Unix permissions apply. chmod 600 /proc/self/status restricts who can read process info.
Composability: Shell pipelines, scripting, existing tools all work: grep MemFree /proc/meminfo | awk '{print $2}'.
No New APIs: No need for special system calls or ioctls. Read and write are sufficient.
Discoverability: Users can ls directories to see what's available.

FUSE: File Systems in User Space

The ultimate expression of VFS flexibility is FUSE (Filesystem in Userspace). FUSE is a VFS driver that forwards operations to a user-space daemon. This enables implementing file systems in Python, Go, or any language with a FUSE library. sshfs (mount remote SSH directories), s3fs (mount Amazon S3 buckets), and ntfs-3g (full NTFS support) all use FUSE. VFS makes heterogeneous storage appear uniform even when the driver runs outside the kernel.

Summary: VFS Abstraction

We've explored the Virtual File System abstraction layer in depth. Let's consolidate what we've learned:

Key Takeaways

•VFS is an abstraction layer, not a file system — It defines interfaces that file systems implement, enabling uniform access to diverse storage backends.
•Originated to support NFS — Sun Microsystems created the vnode/VFS interface in 1984 so NFS could plug into Unix alongside local file systems.
•Uses object-oriented design in C — VFS objects contain function pointers (operations structures) that different file systems populate with their implementations.
•Manages the unified namespace — VFS assembles all mounted file systems into a single directory hierarchy through mount point management.
•Provides critical caching — The dentry cache and inode cache dramatically accelerate pathname resolution and metadata access.
•Handles pathname resolution — The namei subsystem breaks paths into components, consults caches, calls file system lookup functions, and crosses mount points.
•Enables pseudo-filesystems — procfs, sysfs, tmpfs, and many others leverage VFS to expose non-disk entities as files.
•Fundamental to Unix philosophy — "Everything is a file" is implemented through VFS, making the system composable and tool-friendly.

What's Next:

Now that we understand what VFS is and why it exists, we'll examine the common interface it presents—the specific system calls, data structures, and operations that form the VFS contract. This will show you exactly what a file system must implement to plug into the VFS layer.

Page Complete

You now understand the Virtual File System abstraction: what it is, why it was created, how it fits into the kernel architecture, and how it enables the incredible diversity of file systems in modern operating systems. This foundation prepares you to dive into VFS internals.

1 / 5

Loading learning content...

Operating SystemsFile System Structures

Virtual File System (VFS)

LevelAdvanced

Duration90 mins

TopicFile System Structures

1 / 5

VFS Abstraction

The Problem of Diversity

The answer is the Virtual File System (VFS)—one of the most elegant and important abstraction layers in operating system design.

What You Will Learn

What is the Virtual File System?

The Object-Oriented Pattern

Formal Definition:

The Virtual File System is the kernel subsystem that:

Presents a unified API to user-space processes for all file and directory operations
Defines abstract data types (superblock, inode, dentry, file) that represent file system entities
Dispatches operations to concrete file system implementations through function pointers
Manages the namespace by assembling file systems into a single directory hierarchy via mounting
Provides caching for directory entries (dentry cache) and inode metadata (inode cache)

VFS Terminology and Concepts
Term	Definition	Purpose
VFS	Virtual File System	Kernel abstraction layer for uniform file access
File System	On-disk format for data and metadata	Defines how data is physically organized on storage
File System Driver	Kernel module implementing VFS operations	Translates VFS calls to on-disk operations
Mount Point	Directory where a file system is attached	Integrates file systems into the namespace
Namespace	The unified directory tree visible to processes	Single hierarchy starting from root (/)

Historical Context and Motivation

The VFS concept emerged from practical necessity in the 1980s as operating systems needed to support multiple file system types simultaneously.

The Vnode Model:

A vnode (virtual inode) is a kernel data structure representing an open file
Each vnode contains a pointer to vnode operations—functions like read, write, lookup
Different file systems provide their own vnode operation implementations
The kernel calls through the vnode interface without knowing the underlying implementation

Timeline of VFS Evolution

•1984 — Sun Microsystems introduces the vnode/VFS interface for SunOS to support NFS alongside UFS (Unix File System)
•1986 — System V Release 4 (SVR4) adopts and standardizes the vnode interface, becoming a Unix standard
•1991 — Linux 0.01 initially had a simple, hardcoded Minix file system; VFS added later as Linux grew
•1993 — Linux 1.0 includes a mature VFS layer supporting ext2, minix, msdos, and NFS
•2000s — Linux VFS evolves with dcache (directory entry cache), unified buffer cache, and advanced features
•Present — Modern VFS supports 60+ file systems in Linux alone, from local disk to network to virtual

Cross-Platform VFS Implementations

The Abstraction Principle

The VFS embodies a fundamental principle in systems design: abstraction hides complexity behind stable interfaces. Let's examine what this means concretely.

Without VFS:

// Hypothetical world without VFS abstraction
if (file_system_type == FS_EXT4) {
    ext4_open(path, flags);
} else if (file_system_type == FS_NTFS) {
    ntfs_open(path, flags);
} else if (file_system_type == FS_NFS) {
    nfs_open(path, flags);
} else if (file_system_type == FS_ZFS) {
    zfs_open(path, flags);
}
// ... repeated for EVERY operation, in EVERY application

This approach is unmaintainable. Every application would need knowledge of every file system. Adding a new file system would require modifying every program.

With VFS:

// Application code — identical regardless of file system
int fd = open("/path/to/file", O_RDONLY);
read(fd, buffer, size);
close(fd);

The application is completely isolated from file system details. The kernel's VFS layer handles dispatch to the appropriate driver.

Benefits of VFS Abstraction

•Application Portability — Programs work on any mounted file system without modification
•File System Independence — New file systems can be added without changing applications or the kernel core
•Unified Namespace — All file systems appear in a single directory hierarchy
•Caching Optimization — VFS provides system-wide caching (dentry cache, inode cache) that benefits all file systems
•Simplified Development — File system developers implement a defined interface rather than full I/O stacks

Abstraction Trade-offs

•Lowest Common Denominator — Some file system-specific features may not be exposed through VFS
•Indirection Overhead — Function pointer calls add slight overhead compared to direct calls
•Complexity for FS Developers — Must implement many VFS operations, even if some don't apply
•Semantic Mismatches — Network file systems may have different consistency semantics
•Feature Evolution — Adding new VFS operations requires coordinating with existing file systems

The Power of Uniform Semantics

VFS Architecture Overview

Layer Stack (Top to Bottom):

User-Space Applications — Programs using standard C library functions (fopen, fread, fprintf)
C Library (libc) — Translates library calls to system calls (write() syscall from fprintf)
System Call Interface — Kernel entry point for file operations
Virtual File System (VFS) — Dispatches to appropriate file system, manages caches
File System Drivers — ext4, XFS, btrfs, NFS, FAT, etc.
Block Layer / Network Layer — Generic I/O for block devices or network sockets
Device Drivers — Hardware-specific code for disks, NICs, etc.
Hardware — Physical storage media, network interfaces

Converting Mermaid diagram...

Key Observations:

Single Entry Point: All file operations, regardless of destination, enter through the same system call interface and pass through VFS.
Uniform Dispatch: VFS uses function pointers in the inode, dentry, and file structures to route operations to the correct driver.
Diverse Backends: Notice how some file systems go to the block layer (ext4, XFS), others to the network stack (NFS), yet others to user-space (FUSE), and pseudo file systems access kernel data directly (procfs).
VFS as Unifier: Despite these vastly different backends, the VFS makes them all look the same to applications.

Core VFS Responsibilities

The VFS layer performs several critical functions that go beyond simple dispatch. Understanding these responsibilities reveals why VFS is so central to kernel operation.

VFS Core Responsibilities

•System Call Handling — VFS implements the kernel side of open(), read(), write(), close(), stat(), readdir(), mkdir(), unlink(), rename(), chmod(), chown(), and dozens of other file-related system calls. It validates arguments, manages file descriptors, and routes to file system drivers.
•Path Resolution — When you open /home/user/docs/file.txt, VFS performs pathname lookup, traversing the directory tree component by component, crossing mount points, following symbolic links, and checking permissions at each step. This is implemented in the namei (name-to-inode) subsystem.
•Mount Management — VFS maintains the mount table, tracks which file systems are mounted where, handles mount point traversal, and manages the unified namespace. Commands like mount and umount are implemented as VFS operations.
•Dentry Caching — The dentry cache (dcache) is a VFS-managed hash table that caches recent path lookups. Looking up /usr/bin/ls requires examining inodes for /, usr, bin, and ls. The dcache avoids disk I/O for frequently accessed paths.
•Inode Caching — The inode cache keeps recently accessed inode structures in memory. Since inodes contain file metadata and pointers to data, caching them dramatically speeds up repeated access to the same files.
•File Object Management — VFS manages struct file objects that represent open files. This includes tracking the current file position (offset), access mode (read/write), and reference counting for shared file descriptors across fork().
•Permission and Security Checks — Before dispatching to file system drivers, VFS verifies that the calling process has appropriate permissions (read, write, execute) and consults security modules like SELinux or AppArmor.

VFS as Performance Optimization

Pathname Resolution Deep Dive

The namei Algorithm (name-to-inode):

Start Point: For absolute paths, start at the root dentry (/). For relative paths, start at the process's current working directory.
Component Iteration: Split the path by / and process each component (home, alice, projects, etc.) left to right.
For Each Component:
- Dentry Cache Lookup: Check if this component is in the dcache. Cache hit? Use the cached dentry/inode without disk I/O.
- Cache Miss: Call the parent directory's lookup operation (e.g., ext4_lookup) to read the directory and find the entry.
- Permission Check: Verify execute permission on the directory (required for traversal).
- Special Cases: Handle . (current), .. (parent), symbolic links (follow or not based on flags).
- Mount Point Check: If this dentry is a mount point, switch to the root dentry of the mounted file system.
Final Component: The last component may be the target file/directory, or for operations like open() with O_CREAT, it may not exist yet.
Return: Return the final dentry and inode, or an error if resolution fails.

Pathname Resolution Pseudocode

Pseudocode

function resolve_path(path, flags):
    // Determine starting point
    if path.starts_with("/"):
        current = root_dentry
    else:
        current = process.cwd
    
    // Split and iterate through path components
    components = path.split("/").filter(non_empty)
    
    for each component in components:
        // Check dentry cache first
        cached = dcache_lookup(current, component)
        if cached:
            next = cached
        else:
            // Cache miss: ask file system to look up
            next = current.inode.ops.lookup(current, component)
            if not next:
                return -ENOENT  // No such file or directory
            dcache_add(current, component, next)
        
        // Permission check: need execute to traverse directory
        if not has_exec_permission(current.inode):
            return -EACCES  // Permission denied
        
        // Handle symbolic links
        if next.inode.is_symlink and should_follow_symlink(flags):
            link_target = next.inode.ops.readlink(next)
            next = resolve_path(link_target, flags)  // Recursive!
        
        // Handle mount points: if mounted here, switch to mount root
        if is_mount_point(next):
            next = get_mounted_root(next)
        
        current = next
    
    return current  // Final dentry/inode

Symbolic Link Loops and Limits

Performance Insight:

Mount Point Traversal:

VFS and Special File Systems

Philosophy: In Unix, "everything is a file." The VFS makes this philosophy implementable in practice.

Examples of Virtual/Pseudo File Systems
File System	Mount Point	Purpose	Key Characteristics
procfs	/proc	Process and kernel information	Dynamic content generated on read; no persistent storage
sysfs	/sys	Device model and kernel objects	Exposes kobject hierarchy; used by udev for device management
tmpfs	/tmp, /run	In-memory temporary storage	RAM-backed; fast; cleared on reboot; can swap to disk
devtmpfs	/dev	Device nodes	Automatically creates device nodes when drivers load
cgroup	/sys/fs/cgroup	Resource control groups	Hierarchical resource limits for containers
debugfs	/sys/kernel/debug	Debugging interface	Developers expose debugging info; not for production
securityfs	/sys/kernel/security	Security modules	LSM (SELinux, AppArmor) interfaces
hugetlbfs	various	Huge page allocation	Allocates huge pages for applications

Why Implement as File Systems?

Universal Interface: Any tool that reads files can inspect kernel state. cat /proc/cpuinfo works without special programs.
Permissions: Standard Unix permissions apply. chmod 600 /proc/self/status restricts who can read process info.
Composability: Shell pipelines, scripting, existing tools all work: grep MemFree /proc/meminfo | awk '{print $2}'.
No New APIs: No need for special system calls or ioctls. Read and write are sufficient.
Discoverability: Users can ls directories to see what's available.

FUSE: File Systems in User Space

Summary: VFS Abstraction

We've explored the Virtual File System abstraction layer in depth. Let's consolidate what we've learned:

Key Takeaways

•VFS is an abstraction layer, not a file system — It defines interfaces that file systems implement, enabling uniform access to diverse storage backends.
•Originated to support NFS — Sun Microsystems created the vnode/VFS interface in 1984 so NFS could plug into Unix alongside local file systems.
•Uses object-oriented design in C — VFS objects contain function pointers (operations structures) that different file systems populate with their implementations.
•Manages the unified namespace — VFS assembles all mounted file systems into a single directory hierarchy through mount point management.
•Provides critical caching — The dentry cache and inode cache dramatically accelerate pathname resolution and metadata access.
•Handles pathname resolution — The namei subsystem breaks paths into components, consults caches, calls file system lookup functions, and crosses mount points.
•Enables pseudo-filesystems — procfs, sysfs, tmpfs, and many others leverage VFS to expose non-disk entities as files.
•Fundamental to Unix philosophy — "Everything is a file" is implemented through VFS, making the system composable and tool-friendly.

What's Next:

Page Complete

1 / 5