Computer NetworksSocket Programming

Socket Programming

LevelIntermediate

Duration90 mins

TopicSocket Programming

1 / 5

Socket Basics

The Universal Network Interface

Every time you load a webpage, send an email, stream a video, or play an online game, your application is communicating through sockets. Sockets are the fundamental building blocks of network communication—the interface where applications meet the network stack.

Without sockets, applications would need to understand the intricate details of packet construction, routing, error correction, and flow control. With sockets, applications simply read and write data as if communicating through a file, while the operating system handles the complexity of network protocols underneath.

Understanding sockets is essential for any developer who builds networked applications, debugs network issues, or designs distributed systems. Whether you're implementing a simple client-server application or architecting a high-performance microservices platform, socket knowledge forms the foundation of your work.

What You Will Learn

By the end of this page, you will understand what sockets are, why they exist, how they abstract network complexity, the different types of sockets available, socket addressing mechanisms, and the complete lifecycle of socket operations. This knowledge prepares you for practical socket programming in any language.

What is a Socket?

A socket is an endpoint for communication between two machines. More precisely, it's a software abstraction that represents one end of a two-way communication link between two programs running on a network. Think of a socket as a virtual communication port through which data can be sent and received.

The term 'socket' comes from an analogy to electrical sockets—just as you plug an electrical device into a wall socket to receive power, an application connects to a network socket to receive and transmit data. This powerful abstraction has been central to networked computing since its introduction in BSD Unix in the early 1980s.

The socket as an abstraction:

Without sockets, an application wanting to send data over a network would need to:

Construct properly formatted packets with headers and checksums
Manage connection state and sequence numbers
Handle retransmissions and acknowledgments
Deal with flow control and congestion
Manage fragmentation and reassembly
Interface directly with network hardware

Sockets hide all this complexity. To an application, a socket appears as a simple communication channel—similar to a file descriptor—through which data flows.

The Key Insight

A socket is identified by the combination of an IP address and a port number. This (IP address, port) pair uniquely identifies a communication endpoint on the Internet. When combined with the remote endpoint's socket address, you have a complete communication channel.

Socket vs Connection:

It's important to distinguish between a socket and a connection:

Socket: A single endpoint—one IP address and one port on one machine
Connection: A complete communication path between two sockets

A 5-tuple uniquely identifies a TCP connection:

Protocol (TCP or UDP)
Local IP address
Local port number
Remote IP address
Remote port number

This distinction matters because a single socket (e.g., a server listening on port 80) can be part of thousands of simultaneous connections—each with a different remote IP/port combination.

Socket Identification Components
Component	Description	Example
Protocol	Transport protocol being used	TCP, UDP, SCTP
Local IP Address	IP address of the local machine	192.168.1.100 or 2001:db8::1
Local Port	Port number on the local machine	8080, 443, 54321
Remote IP Address	IP address of the remote machine	93.184.216.34
Remote Port	Port number on the remote machine	80, 443, 22

The Socket Abstraction Layer

Sockets provide a clean separation between applications and the underlying network protocols. This separation is fundamental to the design of modern operating systems and networks.

The socket abstraction sits between the application layer and the transport layer in the network stack. Applications interact with sockets through a well-defined Socket API (Application Programming Interface), while the operating system kernel handles the complexities of the transport, network, and data link layers.

The layered architecture:

┌─────────────────────────────────────────┐
│           Application Layer             │
│    (Your code: HTTP client, chat app)   │
├─────────────────────────────────────────┤
│           Socket API                    │
│  (socket, bind, listen, connect, etc.)  │
├─────────────────────────────────────────┤
│           Transport Layer               │
│           (TCP, UDP - kernel)           │
├─────────────────────────────────────────┤
│           Network Layer                 │
│           (IP - kernel)                 │
├─────────────────────────────────────────┤
│           Data Link Layer               │
│       (Ethernet driver - kernel)        │
└─────────────────────────────────────────┘

This architecture provides several critical benefits:

Benefits of the Socket Abstraction

•Protocol Independence — Applications can switch between TCP and UDP (or other protocols) with minimal code changes. The socket interface remains largely the same.
•Hardware Abstraction — Applications don't need to know whether data travels over Ethernet, WiFi, or 4G. The socket interface is the same regardless of physical medium.
•OS Protection — The kernel maintains exclusive control over network hardware and protocol state. Applications cannot corrupt network infrastructure or interfere with other applications' connections.
•Resource Management — The OS efficiently manages socket resources, buffer allocation, and connection tables across all applications on the system.
•Portability — The BSD socket API has been implemented on virtually every operating system. Code written using sockets can run on Linux, Windows, macOS, and many embedded systems with minimal modification.

Historical Significance

The socket API was developed as part of BSD Unix in 1983. Its elegant abstraction has proven so successful that it remains the standard network programming interface over 40 years later, largely unchanged. The design choices made then continue to influence how billions of devices communicate today.

Socket Types and Domains

When creating a socket, you must specify two fundamental parameters: the socket domain (also called address family) and the socket type. These determine how the socket addresses endpoints and what communication semantics it provides.

Socket Domains (Address Families):

The domain specifies the protocol family used for communication. The most important domains are:

Common Socket Domains
Domain	Constant	Description	Use Case
IPv4	AF_INET	Internet Protocol version 4	Standard Internet communication
IPv6	AF_INET6	Internet Protocol version 6	Modern Internet with larger address space
Unix	AF_UNIX / AF_LOCAL	Local inter-process communication	Fast IPC on same machine
Bluetooth	AF_BLUETOOTH	Bluetooth communication	Wireless device communication
Packet	AF_PACKET	Low-level packet interface	Packet capture, custom protocols

Socket Types:

The type specifies the communication semantics—how data flows through the socket:

Socket Types and Their Characteristics
Type	Constant	Protocol	Characteristics
Stream	SOCK_STREAM	TCP	Reliable, ordered, connection-oriented byte streams
Datagram	SOCK_DGRAM	UDP	Unreliable, unordered, connectionless messages
Raw	SOCK_RAW	IP/Custom	Direct access to IP layer for custom protocols
Seq. Packet	SOCK_SEQPACKET	SCTP	Reliable, ordered, connection-oriented messages

Stream Sockets (SOCK_STREAM)

•Provides reliable, bidirectional byte stream
•Data arrives in order, no duplicates
•Connection must be established first
•Uses TCP for AF_INET/AF_INET6
•Flow control and congestion control built-in
•Best for: Web, email, file transfer

Datagram Sockets (SOCK_DGRAM)

•Sends/receives discrete messages (datagrams)
•No guaranteed delivery or ordering
•No connection establishment required
•Uses UDP for AF_INET/AF_INET6
•Lower overhead, lower latency
•Best for: Gaming, streaming, DNS

Raw Sockets Require Privileges

Raw sockets (SOCK_RAW) provide direct access to the IP layer, bypassing TCP/UDP. This allows custom protocol implementation and packet capture, but requires root/administrator privileges due to security implications. Misuse can enable packet spoofing and network attacks.

Socket Addressing

Every socket needs an address that identifies where it lives in the network. Socket addresses combine protocol-specific addressing information into structures that the socket API can use.

The sockaddr Structure:

The socket API uses a generic sockaddr structure that can represent addresses from any protocol family. Protocol-specific structures (like sockaddr_in for IPv4) extend this base structure:

// Generic socket address structure
struct sockaddr {
    sa_family_t sa_family;    // Address family (AF_INET, AF_INET6, etc.)
    char        sa_data[14];  // Protocol-specific address data
};

// IPv4 socket address structure
struct sockaddr_in {
    sa_family_t    sin_family;  // AF_INET
    in_port_t      sin_port;    // Port number (network byte order)
    struct in_addr sin_addr;    // IPv4 address
    char           sin_zero[8]; // Padding
};

// IPv6 socket address structure  
struct sockaddr_in6 {
    sa_family_t     sin6_family;   // AF_INET6
    in_port_t       sin6_port;     // Port number
    uint32_t        sin6_flowinfo; // Flow information
    struct in6_addr sin6_addr;     // IPv6 address
    uint32_t        sin6_scope_id; // Scope ID
};

Network Byte Order

Port numbers and IP addresses in socket structures must be in network byte order (big-endian). Use htons() (host-to-network short) for ports and inet_addr() or inet_pton() for IP addresses. Forgetting byte order conversion is one of the most common socket programming bugs!

Port Numbers:

Ports are 16-bit numbers (0-65535) that identify specific applications or services on a host:

Port Range	Category	Description
0-1023	Well-Known Ports	Reserved for system services (HTTP=80, HTTPS=443, SSH=22)
1024-49151	Registered Ports	Assigned by IANA for specific applications
49152-65535	Dynamic/Ephemeral	OS assigns automatically for client connections

Special Addresses:

INADDR_ANY (0.0.0.0) — Binds to all available network interfaces
INADDR_LOOPBACK (127.0.0.1) — Binds to loopback interface only (local connections)
INADDR_BROADCAST (255.255.255.255) — For broadcast transmissions

For IPv6:

in6addr_any (::) — Equivalent to INADDR_ANY
in6addr_loopback (::1) — IPv6 loopback address

Address Resolution Considerations

•DNS Resolution — Convert hostnames to IP addresses using getaddrinfo(), which handles both IPv4 and IPv6
•Protocol Agnostic Code — Use getaddrinfo() instead of gethostbyname() for IPv6 compatibility
•Dual-Stack Sockets — IPv6 sockets can handle both IPv6 and IPv4 (mapped) addresses when configured
•Binding Considerations — Binding to INADDR_ANY exposes the service on all interfaces; consider security implications

The Socket Lifecycle

Sockets pass through a well-defined lifecycle from creation to destruction. Understanding this lifecycle is essential for correct socket programming. The lifecycle differs slightly between connection-oriented (TCP) and connectionless (UDP) sockets.

Core Socket Operations:

The socket API provides a set of fundamental operations that control the socket lifecycle:

Socket API Functions
Function	Purpose	Used By
socket()	Create a new socket	Client and Server
bind()	Assign local address to socket	Server (required), Client (optional)
listen()	Mark socket as passive (accepting connections)	Server only
accept()	Accept incoming connection, create new socket	Server only
connect()	Establish connection to remote socket	Client (TCP), Optional (UDP)
send() / write()	Transmit data through socket	Client and Server
recv() / read()	Receive data from socket	Client and Server
close()	Close socket and release resources	Client and Server
shutdown()	Gracefully close one or both directions	Client and Server

TCP Server Lifecycle:

  socket()     Create a socket
     ↓
  bind()       Assign port (e.g., 8080)
     ↓
  listen()     Begin accepting connections
     ↓
  accept()  ←─ Block until client connects
     ↓
[new socket]   Dedicated socket for this client
     ↓
  recv/send()  Communicate with client
     ↓
  close()      Close client connection
     ↓
  [loop back to accept() for next client]

TCP Client Lifecycle:

  socket()     Create a socket
     ↓
  connect()    Connect to server (initiates 3-way handshake)
     ↓
  send/recv()  Communicate with server
     ↓
  close()      Close connection

The Accept Pattern

Notice that accept() creates a NEW socket for each client. The original listening socket remains open to accept more connections. This is how a single server process can handle multiple clients—each gets their own dedicated socket while the listening socket continues its job.

UDP Lifecycle (Simpler):

UDP sockets don't require connection establishment:

  socket()        Create UDP socket
     ↓
  bind()          (Server) Assign local port
     ↓
  sendto/recvfrom()  Send/receive datagrams with explicit addresses
     ↓
  close()         Close socket

Note: UDP can use connect() to set a default destination, enabling use of send()/recv() instead of sendto()/recvfrom(). This doesn't create a true connection—it's just a convenience.

Socket Options and Configuration

Sockets are highly configurable through socket options. These options control everything from buffer sizes to keep-alive behavior to timeout values. Mastering socket options is essential for building production-quality networked applications.

Socket options are set and retrieved using:

int setsockopt(int sockfd, int level, int optname, 
               const void *optval, socklen_t optlen);
int getsockopt(int sockfd, int level, int optname,
               void *optval, socklen_t *optlen);

The level parameter specifies which protocol layer the option applies to:

SOL_SOCKET — Socket layer options
IPPROTO_TCP — TCP-specific options
IPPROTO_IP — IP layer options

Essential Socket Options
Option	Level	Type	Purpose
SO_REUSEADDR	SOL_SOCKET	int	Allow binding to address already in use (TIME_WAIT)
SO_REUSEPORT	SOL_SOCKET	int	Allow multiple sockets to bind to same port
SO_KEEPALIVE	SOL_SOCKET	int	Enable TCP keep-alive probes
SO_RCVBUF	SOL_SOCKET	int	Receive buffer size
SO_SNDBUF	SOL_SOCKET	int	Send buffer size
SO_RCVTIMEO	SOL_SOCKET	timeval	Receive timeout
SO_SNDTIMEO	SOL_SOCKET	timeval	Send timeout
SO_LINGER	SOL_SOCKET	linger	Behavior on close with pending data
TCP_NODELAY	IPPROTO_TCP	int	Disable Nagle's algorithm
TCP_QUICKACK	IPPROTO_TCP	int	Disable delayed acknowledgments

SO_REUSEADDR is Almost Always Needed

When a TCP server closes, the socket enters TIME_WAIT state for 2×MSL (typically 60 seconds). During this time, you cannot bind a new socket to the same port without SO_REUSEADDR. Always set this option for server sockets to enable quick restarts—it's the first thing experienced developers do.

Common Configuration Patterns

•Low-Latency Applications: Set TCP_NODELAY=1 and TCP_QUICKACK=1 to disable Nagle's algorithm and delayed ACKs
•High-Throughput: Increase SO_RCVBUF and SO_SNDBUF (kernel may auto-tune, check first)
•Connection Pooling: Enable SO_KEEPALIVE with appropriate TCP_KEEPIDLE/TCP_KEEPINTVL values
•Graceful Shutdown: Configure SO_LINGER to control behavior when closing with pending data
•Load Balancing: Use SO_REUSEPORT to allow multiple processes to accept on the same port

Socket Buffers and Data Flow

Every socket has two kernel buffers: a send buffer and a receive buffer. Understanding how data flows through these buffers is crucial for writing efficient and correct network code.

Data Flow Model:

Application                    Kernel                     Network
┌──────────┐              ┌──────────────┐
│  send()  │ ──→ copy ──→ │ Send Buffer  │ ──→ TCP/IP ──→ [Network]
└──────────┘              └──────────────┘

┌──────────┐              ┌──────────────┐
│  recv()  │ ←── copy ←── │ Recv Buffer  │ ←── TCP/IP ←── [Network]
└──────────┘              └──────────────┘

Key Points:

send() is a copy operation: When you call send(), data is copied from your application buffer to the kernel's send buffer. send() returns when the copy is complete—not when data reaches the destination.
recv() is also a copy operation: recv() copies data from the kernel's receive buffer to your application buffer. If the receive buffer is empty, recv() blocks (in blocking mode).
Buffers are finite: When buffers fill, operations block (blocking mode) or return errors (non-blocking mode).

send() Success Doesn't Mean Delivery

A successful send() only means data was copied to the kernel buffer. It says nothing about whether the data reached the network, was received by the remote host, or was processed by the remote application. For guaranteed delivery confirmation, you need application-level acknowledgments.

Buffer Sizing Considerations:

Buffer Issue	Symptoms	Solution
Receive buffer too small	Data loss (UDP), slow throughput (TCP)	Increase SO_RCVBUF
Send buffer too small	Slow writes, high CPU (many small sends)	Increase SO_SNDBUF
Buffers too large	Wasted memory, increased latency (bufferbloat)	Use appropriate sizes

Modern kernels often auto-tune buffer sizes based on connection characteristics, but manual tuning may be needed for specialized applications.

The Bandwidth-Delay Product:

For optimal TCP throughput, the buffer should be at least as large as the bandwidth-delay product (BDP):

BDP = Bandwidth × Round-Trip Time

Example:
- 1 Gbps connection with 50ms RTT
- BDP = 1,000,000,000 bits/sec × 0.050 sec = 50,000,000 bits = 6.25 MB

This tells us the buffer needs to hold at least 6.25 MB of data to fully utilize a 1 Gbps link with 50ms RTT.

Summary: Socket Fundamentals

We've established a comprehensive foundation for understanding network sockets. Let's consolidate the essential concepts:

Key Takeaways

•Sockets are communication endpoints — They provide the interface between applications and the network stack, identified by (IP address, port) pairs.
•Socket abstraction hides complexity — Applications read and write data without managing packets, routing, or protocol state machines.
•Domain and type determine behavior — AF_INET/AF_INET6 for Internet, SOCK_STREAM for TCP, SOCK_DGRAM for UDP.
•Addressing requires proper structure and byte order — Use sockaddr_in/sockaddr_in6, always convert to network byte order.
•Lifecycle differs by protocol — TCP requires connection establishment; UDP is connectionless.
•Socket options enable fine-tuning — SO_REUSEADDR, TCP_NODELAY, buffer sizes, and timeouts are commonly configured.
•Buffers bridge applications and kernel — Understanding buffer behavior is crucial for performance and correctness.

What's Next:

With socket fundamentals established, we'll dive deep into TCP sockets—the workhorse of reliable network communication. We'll explore connection establishment, the TCP state machine from a programming perspective, and the patterns used in production TCP applications.

Page Complete

You now understand what sockets are, how they abstract network complexity, the socket domains and types available, addressing mechanisms, the socket lifecycle, configuration options, and buffer behavior. This foundation prepares you for practical socket programming with TCP and UDP.

1 / 5

Loading learning content...

Computer NetworksSocket Programming

Socket Programming

LevelIntermediate

Duration90 mins

TopicSocket Programming

1 / 5

Socket Basics

The Universal Network Interface

What You Will Learn

What is a Socket?

The socket as an abstraction:

Without sockets, an application wanting to send data over a network would need to:

Construct properly formatted packets with headers and checksums
Manage connection state and sequence numbers
Handle retransmissions and acknowledgments
Deal with flow control and congestion
Manage fragmentation and reassembly
Interface directly with network hardware

Sockets hide all this complexity. To an application, a socket appears as a simple communication channel—similar to a file descriptor—through which data flows.

The Key Insight

Socket vs Connection:

It's important to distinguish between a socket and a connection:

Socket: A single endpoint—one IP address and one port on one machine
Connection: A complete communication path between two sockets

A 5-tuple uniquely identifies a TCP connection:

Protocol (TCP or UDP)
Local IP address
Local port number
Remote IP address
Remote port number

This distinction matters because a single socket (e.g., a server listening on port 80) can be part of thousands of simultaneous connections—each with a different remote IP/port combination.

Socket Identification Components
Component	Description	Example
Protocol	Transport protocol being used	TCP, UDP, SCTP
Local IP Address	IP address of the local machine	192.168.1.100 or 2001:db8::1
Local Port	Port number on the local machine	8080, 443, 54321
Remote IP Address	IP address of the remote machine	93.184.216.34
Remote Port	Port number on the remote machine	80, 443, 22

The Socket Abstraction Layer

Sockets provide a clean separation between applications and the underlying network protocols. This separation is fundamental to the design of modern operating systems and networks.

The layered architecture:

┌─────────────────────────────────────────┐
│           Application Layer             │
│    (Your code: HTTP client, chat app)   │
├─────────────────────────────────────────┤
│           Socket API                    │
│  (socket, bind, listen, connect, etc.)  │
├─────────────────────────────────────────┤
│           Transport Layer               │
│           (TCP, UDP - kernel)           │
├─────────────────────────────────────────┤
│           Network Layer                 │
│           (IP - kernel)                 │
├─────────────────────────────────────────┤
│           Data Link Layer               │
│       (Ethernet driver - kernel)        │
└─────────────────────────────────────────┘

This architecture provides several critical benefits:

Benefits of the Socket Abstraction

•Protocol Independence — Applications can switch between TCP and UDP (or other protocols) with minimal code changes. The socket interface remains largely the same.
•Hardware Abstraction — Applications don't need to know whether data travels over Ethernet, WiFi, or 4G. The socket interface is the same regardless of physical medium.
•OS Protection — The kernel maintains exclusive control over network hardware and protocol state. Applications cannot corrupt network infrastructure or interfere with other applications' connections.
•Resource Management — The OS efficiently manages socket resources, buffer allocation, and connection tables across all applications on the system.
•Portability — The BSD socket API has been implemented on virtually every operating system. Code written using sockets can run on Linux, Windows, macOS, and many embedded systems with minimal modification.

Historical Significance

Socket Types and Domains

Socket Domains (Address Families):

The domain specifies the protocol family used for communication. The most important domains are:

Common Socket Domains
Domain	Constant	Description	Use Case
IPv4	AF_INET	Internet Protocol version 4	Standard Internet communication
IPv6	AF_INET6	Internet Protocol version 6	Modern Internet with larger address space
Unix	AF_UNIX / AF_LOCAL	Local inter-process communication	Fast IPC on same machine
Bluetooth	AF_BLUETOOTH	Bluetooth communication	Wireless device communication
Packet	AF_PACKET	Low-level packet interface	Packet capture, custom protocols

Socket Types:

The type specifies the communication semantics—how data flows through the socket:

Socket Types and Their Characteristics
Type	Constant	Protocol	Characteristics
Stream	SOCK_STREAM	TCP	Reliable, ordered, connection-oriented byte streams
Datagram	SOCK_DGRAM	UDP	Unreliable, unordered, connectionless messages
Raw	SOCK_RAW	IP/Custom	Direct access to IP layer for custom protocols
Seq. Packet	SOCK_SEQPACKET	SCTP	Reliable, ordered, connection-oriented messages

Stream Sockets (SOCK_STREAM)

•Provides reliable, bidirectional byte stream
•Data arrives in order, no duplicates
•Connection must be established first
•Uses TCP for AF_INET/AF_INET6
•Flow control and congestion control built-in
•Best for: Web, email, file transfer

Datagram Sockets (SOCK_DGRAM)

•Sends/receives discrete messages (datagrams)
•No guaranteed delivery or ordering
•No connection establishment required
•Uses UDP for AF_INET/AF_INET6
•Lower overhead, lower latency
•Best for: Gaming, streaming, DNS

Raw Sockets Require Privileges

Socket Addressing

Every socket needs an address that identifies where it lives in the network. Socket addresses combine protocol-specific addressing information into structures that the socket API can use.

The sockaddr Structure:

The socket API uses a generic sockaddr structure that can represent addresses from any protocol family. Protocol-specific structures (like sockaddr_in for IPv4) extend this base structure:

// Generic socket address structure
struct sockaddr {
    sa_family_t sa_family;    // Address family (AF_INET, AF_INET6, etc.)
    char        sa_data[14];  // Protocol-specific address data
};

// IPv4 socket address structure
struct sockaddr_in {
    sa_family_t    sin_family;  // AF_INET
    in_port_t      sin_port;    // Port number (network byte order)
    struct in_addr sin_addr;    // IPv4 address
    char           sin_zero[8]; // Padding
};

// IPv6 socket address structure  
struct sockaddr_in6 {
    sa_family_t     sin6_family;   // AF_INET6
    in_port_t       sin6_port;     // Port number
    uint32_t        sin6_flowinfo; // Flow information
    struct in6_addr sin6_addr;     // IPv6 address
    uint32_t        sin6_scope_id; // Scope ID
};

Network Byte Order

Port Numbers:

Ports are 16-bit numbers (0-65535) that identify specific applications or services on a host:

Port Range	Category	Description
0-1023	Well-Known Ports	Reserved for system services (HTTP=80, HTTPS=443, SSH=22)
1024-49151	Registered Ports	Assigned by IANA for specific applications
49152-65535	Dynamic/Ephemeral	OS assigns automatically for client connections

Special Addresses:

INADDR_ANY (0.0.0.0) — Binds to all available network interfaces
INADDR_LOOPBACK (127.0.0.1) — Binds to loopback interface only (local connections)
INADDR_BROADCAST (255.255.255.255) — For broadcast transmissions

For IPv6:

in6addr_any (::) — Equivalent to INADDR_ANY
in6addr_loopback (::1) — IPv6 loopback address

Address Resolution Considerations

•DNS Resolution — Convert hostnames to IP addresses using getaddrinfo(), which handles both IPv4 and IPv6
•Protocol Agnostic Code — Use getaddrinfo() instead of gethostbyname() for IPv6 compatibility
•Dual-Stack Sockets — IPv6 sockets can handle both IPv6 and IPv4 (mapped) addresses when configured
•Binding Considerations — Binding to INADDR_ANY exposes the service on all interfaces; consider security implications

The Socket Lifecycle

Core Socket Operations:

The socket API provides a set of fundamental operations that control the socket lifecycle:

Socket API Functions
Function	Purpose	Used By
socket()	Create a new socket	Client and Server
bind()	Assign local address to socket	Server (required), Client (optional)
listen()	Mark socket as passive (accepting connections)	Server only
accept()	Accept incoming connection, create new socket	Server only
connect()	Establish connection to remote socket	Client (TCP), Optional (UDP)
send() / write()	Transmit data through socket	Client and Server
recv() / read()	Receive data from socket	Client and Server
close()	Close socket and release resources	Client and Server
shutdown()	Gracefully close one or both directions	Client and Server

TCP Server Lifecycle:

  socket()     Create a socket
     ↓
  bind()       Assign port (e.g., 8080)
     ↓
  listen()     Begin accepting connections
     ↓
  accept()  ←─ Block until client connects
     ↓
[new socket]   Dedicated socket for this client
     ↓
  recv/send()  Communicate with client
     ↓
  close()      Close client connection
     ↓
  [loop back to accept() for next client]

TCP Client Lifecycle:

  socket()     Create a socket
     ↓
  connect()    Connect to server (initiates 3-way handshake)
     ↓
  send/recv()  Communicate with server
     ↓
  close()      Close connection

The Accept Pattern

UDP Lifecycle (Simpler):

UDP sockets don't require connection establishment:

  socket()        Create UDP socket
     ↓
  bind()          (Server) Assign local port
     ↓
  sendto/recvfrom()  Send/receive datagrams with explicit addresses
     ↓
  close()         Close socket

Note: UDP can use connect() to set a default destination, enabling use of send()/recv() instead of sendto()/recvfrom(). This doesn't create a true connection—it's just a convenience.

Socket Options and Configuration

Socket options are set and retrieved using:

int setsockopt(int sockfd, int level, int optname, 
               const void *optval, socklen_t optlen);
int getsockopt(int sockfd, int level, int optname,
               void *optval, socklen_t *optlen);

The level parameter specifies which protocol layer the option applies to:

SOL_SOCKET — Socket layer options
IPPROTO_TCP — TCP-specific options
IPPROTO_IP — IP layer options

Essential Socket Options
Option	Level	Type	Purpose
SO_REUSEADDR	SOL_SOCKET	int	Allow binding to address already in use (TIME_WAIT)
SO_REUSEPORT	SOL_SOCKET	int	Allow multiple sockets to bind to same port
SO_KEEPALIVE	SOL_SOCKET	int	Enable TCP keep-alive probes
SO_RCVBUF	SOL_SOCKET	int	Receive buffer size
SO_SNDBUF	SOL_SOCKET	int	Send buffer size
SO_RCVTIMEO	SOL_SOCKET	timeval	Receive timeout
SO_SNDTIMEO	SOL_SOCKET	timeval	Send timeout
SO_LINGER	SOL_SOCKET	linger	Behavior on close with pending data
TCP_NODELAY	IPPROTO_TCP	int	Disable Nagle's algorithm
TCP_QUICKACK	IPPROTO_TCP	int	Disable delayed acknowledgments

SO_REUSEADDR is Almost Always Needed

Common Configuration Patterns

•Low-Latency Applications: Set TCP_NODELAY=1 and TCP_QUICKACK=1 to disable Nagle's algorithm and delayed ACKs
•High-Throughput: Increase SO_RCVBUF and SO_SNDBUF (kernel may auto-tune, check first)
•Connection Pooling: Enable SO_KEEPALIVE with appropriate TCP_KEEPIDLE/TCP_KEEPINTVL values
•Graceful Shutdown: Configure SO_LINGER to control behavior when closing with pending data
•Load Balancing: Use SO_REUSEPORT to allow multiple processes to accept on the same port

Socket Buffers and Data Flow

Every socket has two kernel buffers: a send buffer and a receive buffer. Understanding how data flows through these buffers is crucial for writing efficient and correct network code.

Data Flow Model:

Application                    Kernel                     Network
┌──────────┐              ┌──────────────┐
│  send()  │ ──→ copy ──→ │ Send Buffer  │ ──→ TCP/IP ──→ [Network]
└──────────┘              └──────────────┘

┌──────────┐              ┌──────────────┐
│  recv()  │ ←── copy ←── │ Recv Buffer  │ ←── TCP/IP ←── [Network]
└──────────┘              └──────────────┘

Key Points:

send() is a copy operation: When you call send(), data is copied from your application buffer to the kernel's send buffer. send() returns when the copy is complete—not when data reaches the destination.
recv() is also a copy operation: recv() copies data from the kernel's receive buffer to your application buffer. If the receive buffer is empty, recv() blocks (in blocking mode).
Buffers are finite: When buffers fill, operations block (blocking mode) or return errors (non-blocking mode).

send() Success Doesn't Mean Delivery

Buffer Sizing Considerations:

Buffer Issue	Symptoms	Solution
Receive buffer too small	Data loss (UDP), slow throughput (TCP)	Increase SO_RCVBUF
Send buffer too small	Slow writes, high CPU (many small sends)	Increase SO_SNDBUF
Buffers too large	Wasted memory, increased latency (bufferbloat)	Use appropriate sizes

Modern kernels often auto-tune buffer sizes based on connection characteristics, but manual tuning may be needed for specialized applications.

The Bandwidth-Delay Product:

For optimal TCP throughput, the buffer should be at least as large as the bandwidth-delay product (BDP):

BDP = Bandwidth × Round-Trip Time

Example:
- 1 Gbps connection with 50ms RTT
- BDP = 1,000,000,000 bits/sec × 0.050 sec = 50,000,000 bits = 6.25 MB

This tells us the buffer needs to hold at least 6.25 MB of data to fully utilize a 1 Gbps link with 50ms RTT.

Summary: Socket Fundamentals

We've established a comprehensive foundation for understanding network sockets. Let's consolidate the essential concepts:

Key Takeaways

•Sockets are communication endpoints — They provide the interface between applications and the network stack, identified by (IP address, port) pairs.
•Socket abstraction hides complexity — Applications read and write data without managing packets, routing, or protocol state machines.
•Domain and type determine behavior — AF_INET/AF_INET6 for Internet, SOCK_STREAM for TCP, SOCK_DGRAM for UDP.
•Addressing requires proper structure and byte order — Use sockaddr_in/sockaddr_in6, always convert to network byte order.
•Lifecycle differs by protocol — TCP requires connection establishment; UDP is connectionless.
•Socket options enable fine-tuning — SO_REUSEADDR, TCP_NODELAY, buffer sizes, and timeouts are commonly configured.
•Buffers bridge applications and kernel — Understanding buffer behavior is crucial for performance and correctness.

What's Next:

Page Complete

1 / 5