Constraints And Bottlenecks - Learning Module

Loading content...

0/273

Database as Bottleneck

The Database: Where Scaling Dreams Go to Die

If there is one component that most frequently becomes the bottleneck in growing systems, it is the database. Ask any engineer who has scaled a system from thousands to millions of users, and they will tell you stories of database struggles—queries that suddenly take seconds instead of milliseconds, connection pools exhausted, replication lag causing data inconsistencies, and the dreaded 3 AM page because the primary database is overloaded.

The database is often the bottleneck because it sits at the intersection of multiple constraints:

Durability requires disk I/O — Data must persist, which means writing to disk
Consistency requires coordination — ACID guarantees require locking, transactions, and synchronization
Querying requires computation — Complex queries consume CPU and memory
Everything connects to it — The database is typically the single source of truth, making it a fan-in point for all services

Understanding why databases become bottlenecks—and how to address these constraints—is essential knowledge for any system designer.

What You Will Learn

By the end of this page, you will understand why databases are uniquely prone to becoming bottlenecks, how to identify database bottleneck symptoms, the fundamental constraints databases face, and architectural patterns for scaling databases horizontally and vertically. This knowledge is critical for designing systems that don't collapse under load.

Why Databases Become Bottlenecks

To understand why databases so frequently become the constraint, we need to examine the fundamental properties that make databases valuable—and why those same properties create scaling challenges.

1. Durability Requires Disk I/O

Databases must persist data reliably. This means writing to disk (or multiple disks, for redundancy). Even with SSDs, disk I/O is orders of magnitude slower than memory operations. Every write operation ultimately requires:

Writing to a write-ahead log (WAL) for crash recovery
Eventually writing to the actual data files
Fsync calls to ensure data reaches persistent storage (not just OS buffers)

This creates a fundamental throughput limit. A database can only commit as many transactions as the disk subsystem can durably store.

2. ACID Guarantees Require Coordination

The properties that make relational databases reliable—Atomicity, Consistency, Isolation, Durability—all require coordination mechanisms:

Transactions require locks or multi-version concurrency control (MVCC)
Isolation levels determine how much locking/coordination is needed
Foreign key constraints require checking related tables on every insert/update
Unique constraints require index lookups before inserts

Coordination means contention. Multiple operations competing for the same resources (rows, tables, locks) cannot proceed in parallel. As concurrency increases, contention increases, and throughput plateaus or even decreases.

3. The Single Source of Truth Problem

In most architectures, the database is the authoritative source of truth. Everything flows through it:

All writes must go to the database
All reads requiring consistency must query the database
All services depend on database availability

This creates a fan-in pattern where potentially thousands of application instances all contend for the same database resources. The database becomes a centralized bottleneck in an otherwise distributed system.

Database Properties and Their Scaling Implications
Property	Why It's Valuable	Why It Creates Bottlenecks
Durability	Data survives crashes and power failures	Requires disk I/O, fsync, replication
ACID Transactions	Data integrity, predictable behavior	Requires locking, coordination, serialization
Consistency	Queries return correct, up-to-date data	Limits read scaling (reads from primary or sync replicas)
Query Flexibility	SQL allows arbitrary queries	Complex queries consume CPU, memory, I/O; hard to optimize
Single Source of Truth	No data conflicts, clear authority	Creates centralized contention point

The CAP Theorem Connection

Database scaling challenges are deeply connected to the CAP theorem. To maintain strong Consistency and tolerate Partitions, you sacrifice Availability (or vice versa). Most relational databases prioritize CP, which inherently limits horizontal scaling. This is why NoSQL databases—which often relax consistency guarantees—can scale more easily for certain workloads.

Identifying Database Bottlenecks

Database bottlenecks manifest in specific, recognizable patterns. Learning to identify these patterns quickly is a critical skill for any engineer operating production systems.

Symptom 1: Increasing Query Latency

The most obvious symptom: queries that once took 10ms now take 500ms or more. This can happen gradually (as data grows) or suddenly (when load spikes). Causes include:

Lock contention: Queries waiting for locks held by other transactions
Resource saturation: CPU, memory, or disk I/O at limits
Poor query plans: Queries doing full table scans instead of using indexes
Connection pool exhaustion: Queries waiting for available connections

Symptom 2: Connection Pool Saturation

Application logs show errors like 'unable to acquire connection' or 'connection timeout.' The database itself might not be overloaded, but the maximum connection limit is reached. Causes:

Too many application instances sharing a limited connection pool
Long-running transactions holding connections open
Connection leaks in application code
Insufficient connection pool configuration relative to load

Symptom 3: Replication Lag

For systems with read replicas, replication lag is a critical metric. High lag means replicas are falling behind the primary, causing:

Stale reads: Users see outdated data
Read-after-write inconsistency: User writes data, then reads from replica and doesn't see their write
Failover risk: If primary fails, promoting a lagged replica loses recent data

Symptom 4: Lock Wait Timeouts

Queries failing with 'lock wait timeout exceeded' indicate severe contention. This typically means:

Hot rows or tables with concurrent write access
Long-running transactions blocking other work
Missing indexes causing full table scans that hold locks
Deadlocks requiring transaction rollback

Database Bottleneck Symptoms and Diagnostics
Symptom	Metrics to Check	Common Causes	Immediate Action
Slow queries	Query latency percentiles, slow query log	Lock contention, missing indexes, resource saturation	Identify slow queries, add indexes, kill long transactions
Connection exhaustion	Active connections, pool utilization	Too many clients, connection leaks, long transactions	Increase pool size, fix leaks, add connection pooler (PgBouncer)
Replication lag	Seconds behind primary, WAL lag	Write volume exceeds replica apply rate, network issues	Upgrade replica hardware, reduce write volume
Lock timeouts	Lock wait time, deadlock count	Hot rows, long transactions, missing indexes	Optimize hot paths, reduce transaction scope
High I/O wait	Disk utilization, IOPS consumption	Data exceeds memory, write-heavy workload	Add RAM, upgrade to SSD, scale writes

Monitoring Tools and Queries:

Every database platform has specific tools for bottleneck identification:

PostgreSQL:

pg_stat_activity — Active queries and their states (waiting, active, idle)
pg_stat_user_tables — Table-level statistics (scans, tuples read/written)
pg_locks — Current lock information
pg_stat_replication — Replication status and lag
EXPLAIN ANALYZE — Query execution plans and actual timings

MySQL:

SHOW PROCESSLIST — Active connections and queries
performance_schema — Detailed performance metrics
SHOW ENGINE INNODB STATUS — InnoDB internals, deadlock info
Slow Query Log — Queries exceeding threshold
EXPLAIN ANALYZE — Query execution plans (MySQL 8.0+)

General Principles:

Monitor query latency at p50, p95, p99, and p999 percentiles
Track connection pool utilization as a percentage of maximum
Alert on replication lag exceeding acceptable thresholds (e.g., > 10 seconds)
Log all queries exceeding a latency threshold (e.g., > 100ms)

The Slow Query Log Is Your Friend

Enable slow query logging in production with a reasonable threshold (100ms-1s). Review it regularly. A single poorly-optimized query running thousands of times can be your entire bottleneck. Finding it is often just a matter of looking at what's in the slow query log.

Read Bottlenecks vs. Write Bottlenecks

Not all database bottlenecks are the same. Understanding whether you're read-bottlenecked or write-bottlenecked dramatically changes your scaling strategy.

Read-Heavy Workloads:

Most applications are read-heavy—often 90%+ reads. Common examples:

Social media feeds (read posts, comments)
E-commerce product catalogs
Content management systems
Search and discovery features

Characteristics of read bottlenecks:

High CPU utilization from query processing
High memory usage for caching/buffering
Disk I/O dominated by reads (if data doesn't fit in memory)
Single primary database can't serve all read traffic

Write-Heavy Workloads:

Some applications are write-intensive:

Event streaming and logging
IoT data ingestion
Real-time analytics pipelines
Chat and messaging systems

Characteristics of write bottlenecks:

Disk I/O dominated by WAL writes and data persistence
Replication lag on replicas (can't keep up with write volume)
Lock contention on frequently-updated rows
Transaction log growth and checkpoint overhead

Scaling Reads

•Read replicas — Add more read-only copies of the database
•Caching layers — Redis/Memcached in front of the database
•CDN for static content — Offload static data entirely
•Materialized views — Precompute expensive query results
•Search indexes — Elasticsearch for complex text search
•Connection pooling — More efficient connection reuse

Scaling Writes

•Sharding — Partition data across multiple primary databases
•Write batching — Aggregate writes to reduce transaction overhead
•Async writes — Queue writes and process asynchronously
•Denormalization — Reduce write amplification from cascading updates
•Faster storage — NVMe SSDs, provisioned IOPS
•Different database — Write-optimized stores (Cassandra, ScyllaDB)

Identifying Your Workload Type:

Use database metrics to characterize your workload:

PostgreSQL:

SELECT 
    sum(tup_returned + tup_fetched) as reads,
    sum(tup_inserted + tup_updated + tup_deleted) as writes,
    round(sum(tup_returned + tup_fetched)::numeric / 
          NULLIF(sum(tup_inserted + tup_updated + tup_deleted), 0), 2) as read_write_ratio
FROM pg_stat_user_tables;

Key insight: Read replicas only help read-heavy workloads. Adding 10 read replicas to a write-heavy system provides minimal benefit because all writes still go to the single primary. Understanding your read/write ratio is essential for choosing the right scaling strategy.

The Read/Write Ratio Determines Your Strategy

At 90% reads / 10% writes: read replicas and caching are highly effective. At 50% reads / 50% writes: you need both read scaling AND write scaling strategies. At 90% writes / 10% reads: sharding and write-optimized databases become necessary. Know your ratio before choosing your approach.

Connection Pooling: The First Line of Defense

Before implementing complex scaling solutions, ensure you're using connections efficiently. Connection pooling is often the single highest-ROI fix for database bottlenecks.

Why Connection Pooling Matters:

Database connections are expensive:

Establishing a connection takes time — TCP handshake, authentication, session setup (10-50ms typical)
Each connection consumes memory — PostgreSQL: ~5-10MB per connection; MySQL: ~1-5MB per connection
Connections have limits — Databases have maximum connection limits (often 100-500 by default)

Without pooling, an application that handles 1000 concurrent requests might try to open 1000 database connections simultaneously—far exceeding typical limits.

Application-Level Connection Pooling:

Most application frameworks include connection pools:

Set a minimum pool size for warm connections ready for use
Set a maximum pool size based on your database's connection limit divided by number of application instances
Configure connection timeout to fail fast when pool is exhausted
Configure idle timeout to close unused connections and free resources

External Connection Poolers:

For PostgreSQL, external poolers like PgBouncer or Pgpool-II sit between applications and the database:

PgBouncer can multiplex thousands of application connections onto dozens of database connections
Transaction pooling mode releases connections back to the pool after each transaction (vs. per-session)
Connection poolers can survive application restarts without database connection churn

Example Configuration (PgBouncer):

[databases]
myapp = host=db.internal.example.com port=5432 dbname=myapp

[pgbouncer]
pool_mode = transaction
max_client_conn = 5000      # Accept up to 5000 application connections
default_pool_size = 50      # Use only 50 actual database connections per database
reserve_pool_size = 10      # Emergency reserve connections
reserve_pool_timeout = 5    # Wait 5s before using reserve

This configuration allows 5000 application connections to be served by just 50 database connections—a 100x multiplier.

Connection Pool Sizing Guidelines
Application Instances	DB Max Connections	Pool Size per Instance	Recommended Pooler
5-10	100-200	10-20	Application-level pool usually sufficient
10-50	200-500	5-10	Consider external pooler
50-200	500-1000	2-5	External pooler highly recommended
200+	1000+	N/A	External pooler mandatory (PgBouncer, ProxySQL)

The Pool Size Trap

More connections is not better. Each connection consumes memory and CPU for context switching. Beyond a point, adding connections degrades performance due to contention. For a typical PostgreSQL instance, 100-200 active connections is often optimal. Monitor queries per second, not just connection count.

Caching: Reducing Database Load

The most effective way to scale reads is to avoid hitting the database entirely. Caching puts frequently-accessed data in faster storage (memory) closer to the application.

Cache Hit Rate Is Everything:

The value of caching depends entirely on hit rate:

90% hit rate: 10x reduction in database load
99% hit rate: 100x reduction in database load
50% hit rate: Only 2x reduction—questionable value given cache complexity

Before implementing caching, analyze your access patterns. Caching works well for:

Frequently accessed data (popular products, user profiles)
Data that changes infrequently (configuration, reference data)
Expensive-to-compute query results

Caching works poorly for:

Highly personalized data with low repeat access
Rapidly changing data requiring strong consistency
Long-tail access patterns where everything is accessed roughly equally

Caching Patterns:

Cache-Aside (Lazy Loading):

Application checks cache
If cache miss, query database
Store result in cache with TTL
Return result to caller

Best for: Read-heavy workloads with tolerance for brief stale data

Write-Through:

On write, update both cache and database together
Reads always get fresh data from cache

Best for: Read-heavy workloads requiring strong consistency

Write-Behind (Write-Back):

Write to cache immediately
Asynchronously persist to database
Faster writes but risk of data loss

Best for: Write-heavy workloads with tolerance for potential data loss during cache failure

Caching Pattern Comparison
Pattern	Read Performance	Write Performance	Consistency	Complexity
Cache-Aside	Excellent (after warm)	Good	Eventual (TTL-based)	Simple
Write-Through	Excellent	Moderate (sync write)	Strong	Moderate
Write-Behind	Excellent	Excellent	Weak (risk of loss)	Complex
Read-Through	Excellent (after warm)	Good	Eventual	Moderate

Cache Invalidation Strategies:

Phil Karlton famously said, 'There are only two hard things in Computer Science: cache invalidation and naming things.' Indeed, deciding when to remove or update cached data is challenging:

TTL (Time-To-Live): Simplest approach; data expires after fixed time. Tradeoff between freshness and hit rate.
Event-based invalidation: Publish cache invalidation events when data changes. Complex but precise.
Version-based: Include version number in cache key; update version on change.

Common Caching Technologies:

Redis: Feature-rich, supports complex data structures, clustering, persistence
Memcached: Simpler, faster for basic key-value caching, no persistence
Application-local cache: In-process caching (e.g., Guava Cache, Caffeine for Java); fastest but not shared across instances

Start Simple, Measure, Iterate

Begin with cache-aside pattern and TTL-based expiration. Measure hit rate, invalidation accuracy, and database load reduction. Only add complexity (event-based invalidation, write-through) if simple approaches are insufficient. Over-engineered caching is a major source of bugs and operational complexity.

Read Replicas: Scaling Read Capacity

When caching isn't sufficient (or practical), read replicas allow you to scale read capacity horizontally by adding copies of your database that can serve read queries.

How Read Replication Works:

Primary database handles all writes
Write-ahead log (WAL) records changes
Replicas receive WAL stream and apply changes
Applications direct read traffic to replicas

This multiplies your read capacity proportionally to the number of replicas—3 replicas means ~3x read throughput (minus replication overhead).

Replication Types:

Synchronous Replication:

Primary waits for replica acknowledgment before committing
Guarantees replica has data before commit returns
Higher latency for writes
Zero data loss on primary failure

Asynchronous Replication:

Primary commits immediately, replicates in background
Lower write latency
Replicas may lag behind primary
Potential data loss on primary failure (data in transit)

Semi-Synchronous (MySQL):

Primary waits for at least one replica acknowledgment
Balance between safety and performance

Routing Read Traffic:

Applications must decide which queries go to primary vs. replicas:

All reads to replicas: Maximum read scaling, but stale reads possible
Read-after-write to primary: Recent writes read from primary, others from replica
Critical reads to primary: Important queries use primary, bulk queries use replicas

Many applications implement 'sticky reads'—after a write, subsequent reads for that session go to primary for a brief period (e.g., 5 seconds) to ensure the user sees their own changes.

Read Replica Architectures
Configuration	Read Scale	Consistency	Write Latency	Failure Tolerance
Single Primary, No Replicas	1x	Strong	Baseline	Data loss if primary fails
Primary + Async Replicas	Nx	Eventual	Baseline	Potential data loss
Primary + Sync Replica	2x	Strong	Higher	No data loss
Primary + Mixed Replicas	Nx	Configurable	Depends	Configurable

Practical Considerations:

Replica Promotion on Failure: If the primary fails, a replica can be promoted to become the new primary. This requires:

Detecting primary failure (health checks, heartbeats)
Electing which replica to promote (most up-to-date)
Reconfiguring applications to use new primary
Potentially recreating replicas from new primary

Cross-Region Replicas: Replicas can be placed in different geographic regions:

Reduces read latency for users in those regions
Provides disaster recovery capability
Adds significant replication lag (laws of physics)

Replica Sizing: Replicas don't need identical specs to the primary:

Replicas often handle read-only queries (no write overhead)
Replicas can be smaller if serving filtered/summarized data
But replicas must keep up with WAL stream—undersized replicas fall behind

Replication Lag Is Unavoidable

With asynchronous replication, replicas will always be slightly behind the primary. Under heavy write load, this lag can grow to seconds or minutes. Design your application to handle this—either by tolerating stale reads, implementing read-after-write consistency explicitly, or using synchronous replication with its performance tradeoffs.

Sharding: Scaling Write Capacity

When you've exhausted vertical scaling, caching, and read replicas, and writes are still the bottleneck, sharding becomes necessary. Sharding distributes data across multiple primary databases, each handling a subset of the total dataset.

How Sharding Works:

Choose a shard key (e.g., user_id, tenant_id, geographic region)
Hash or range function maps shard key to shard number
Each shard is an independent database (or cluster)
Application routes queries to the appropriate shard based on shard key

Example: For a social network, you might shard by user_id:

Shard 0: user_id % 4 == 0
Shard 1: user_id % 4 == 1
Shard 2: user_id % 4 == 2
Shard 3: user_id % 4 == 3

Now, four primary databases share the write load. Each user's data lives entirely on one shard.

Sharding Strategies:

Hash Sharding:

Deterministic function (hash(key) % num_shards) selects shard
Even distribution if hash is uniform
Difficult to range scan (need to query all shards)
Adding shards requires resharding (expensive)

Range Sharding:

Key ranges assigned to shards (e.g., A-M → shard 0, N-Z → shard 1)
Good for range queries
Risk of hot spots if traffic isn't uniformly distributed
Easier to add shards (split existing ranges)

Directory/Lookup Sharding:

Central lookup table maps keys to shards
Maximum flexibility for placement
Lookup table becomes a dependency (single point of failure)

Sharding Strategy Comparison
Strategy	Distribution	Range Queries	Resharding	Hotspots
Hash	Even (if hash is good)	Scatter-gather required	Complex (rehash all)	Unlikely
Range	Depends on key distribution	Efficient	Split ranges	Possible on popular ranges
Directory	Controllable	Depends on layout	Update directory	Avoidable with care

Sharding Challenges:

Cross-Shard Queries: Queries that need data from multiple shards are expensive:

Must query all relevant shards
Must aggregate results in application
Can't use database joins across shards
No transactional consistency across shards (without distributed transactions)

Resharding: As you grow, you'll need more shards. Resharding is operationally complex:

Moving data while maintaining availability
Updating routing logic
Handling in-flight queries during transition

No Global Secondary Indexes: An index that spans all shards is impractical. If your shard key is user_id but you need to search by email:

Maintain a separate lookup table (email → user_id)
Or replicate that index to all shards
Or use a search engine (Elasticsearch) for such queries

When to Shard: Sharding introduces massive complexity. Only shard when:

Vertical scaling is no longer cost-effective
Caching and read replicas don't solve the problem
Your bottleneck is truly writes, not reads
Your data model naturally fits a shard key

Sharding Is Not a First Resort

Sharding should be one of the last tools you reach for. It adds enormous complexity: cross-shard queries, distributed transactions, resharding operations, and operational overhead. Many companies avoid sharding by using managed database services like Aurora, Spanner, or CockroachDB that provide scaling without manual sharding. Only shard when you've exhausted alternatives.

Summary: Mastering Database Bottlenecks

The database is often the most challenging bottleneck to address because it sits at the center of most architectures and must balance competing demands: durability, consistency, performance, and scalability. Let's consolidate the key insights:

Key Takeaways

•Databases are naturally bottleneck-prone — Durability, ACID guarantees, and centralized authority all create scaling constraints
•Identify bottleneck type first — Read bottlenecks and write bottlenecks require different solutions
•Connection pooling is foundational — Often the highest-ROI fix; use external poolers (PgBouncer) at scale
•Caching eliminates load — The best database query is the one that never executes; target 90%+ hit rate
•Read replicas scale reads — But don't help write-heavy workloads; understand replication lag implications
•Sharding scales writes — But adds massive complexity; exhaust other options first
•Monitor proactively — Slow query logs, connection pool utilization, replication lag, and lock contention metrics are essential
•Consider managed databases — Aurora, Spanner, CockroachDB, and similar services handle much scaling complexity automatically

Scaling Toolkit Summary:

First: Connection pooling, query optimization, proper indexing
Second: Caching layer (Redis/Memcached) for read-heavy workloads
Third: Read replicas for additional read scale
Fourth: Vertical scaling (bigger database server)
Fifth: Sharding for write scale (or use distributed database)

What's Next:

With database bottlenecks covered in depth, the final page of this module explores network bottlenecks—the constraints introduced by moving data between components in distributed systems.

Page Complete

You now understand why databases become bottlenecks, how to identify database constraints, and the architectural patterns for addressing them—from connection pooling through caching, read replicas, and sharding. This knowledge is essential for any system that stores persistent data at scale. Next, we'll examine network bottlenecks in distributed systems.

Database as Bottleneck

The Database: Where Scaling Dreams Go to Die

The database is often the bottleneck because it sits at the intersection of multiple constraints:

Durability requires disk I/O — Data must persist, which means writing to disk
Consistency requires coordination — ACID guarantees require locking, transactions, and synchronization
Querying requires computation — Complex queries consume CPU and memory
Everything connects to it — The database is typically the single source of truth, making it a fan-in point for all services

Understanding why databases become bottlenecks—and how to address these constraints—is essential knowledge for any system designer.

What You Will Learn

Why Databases Become Bottlenecks

To understand why databases so frequently become the constraint, we need to examine the fundamental properties that make databases valuable—and why those same properties create scaling challenges.

1. Durability Requires Disk I/O

Writing to a write-ahead log (WAL) for crash recovery
Eventually writing to the actual data files
Fsync calls to ensure data reaches persistent storage (not just OS buffers)

This creates a fundamental throughput limit. A database can only commit as many transactions as the disk subsystem can durably store.

2. ACID Guarantees Require Coordination

The properties that make relational databases reliable—Atomicity, Consistency, Isolation, Durability—all require coordination mechanisms:

Transactions require locks or multi-version concurrency control (MVCC)
Isolation levels determine how much locking/coordination is needed
Foreign key constraints require checking related tables on every insert/update
Unique constraints require index lookups before inserts

3. The Single Source of Truth Problem

In most architectures, the database is the authoritative source of truth. Everything flows through it:

All writes must go to the database
All reads requiring consistency must query the database
All services depend on database availability

Database Properties and Their Scaling Implications
Property	Why It's Valuable	Why It Creates Bottlenecks
Durability	Data survives crashes and power failures	Requires disk I/O, fsync, replication
ACID Transactions	Data integrity, predictable behavior	Requires locking, coordination, serialization
Consistency	Queries return correct, up-to-date data	Limits read scaling (reads from primary or sync replicas)
Query Flexibility	SQL allows arbitrary queries	Complex queries consume CPU, memory, I/O; hard to optimize
Single Source of Truth	No data conflicts, clear authority	Creates centralized contention point

The CAP Theorem Connection

Identifying Database Bottlenecks

Database bottlenecks manifest in specific, recognizable patterns. Learning to identify these patterns quickly is a critical skill for any engineer operating production systems.

Symptom 1: Increasing Query Latency

The most obvious symptom: queries that once took 10ms now take 500ms or more. This can happen gradually (as data grows) or suddenly (when load spikes). Causes include:

Lock contention: Queries waiting for locks held by other transactions
Resource saturation: CPU, memory, or disk I/O at limits
Poor query plans: Queries doing full table scans instead of using indexes
Connection pool exhaustion: Queries waiting for available connections

Symptom 2: Connection Pool Saturation

Application logs show errors like 'unable to acquire connection' or 'connection timeout.' The database itself might not be overloaded, but the maximum connection limit is reached. Causes:

Too many application instances sharing a limited connection pool
Long-running transactions holding connections open
Connection leaks in application code
Insufficient connection pool configuration relative to load

Symptom 3: Replication Lag

For systems with read replicas, replication lag is a critical metric. High lag means replicas are falling behind the primary, causing:

Stale reads: Users see outdated data
Read-after-write inconsistency: User writes data, then reads from replica and doesn't see their write
Failover risk: If primary fails, promoting a lagged replica loses recent data

Symptom 4: Lock Wait Timeouts

Queries failing with 'lock wait timeout exceeded' indicate severe contention. This typically means:

Hot rows or tables with concurrent write access
Long-running transactions blocking other work
Missing indexes causing full table scans that hold locks
Deadlocks requiring transaction rollback

Database Bottleneck Symptoms and Diagnostics
Symptom	Metrics to Check	Common Causes	Immediate Action
Slow queries	Query latency percentiles, slow query log	Lock contention, missing indexes, resource saturation	Identify slow queries, add indexes, kill long transactions
Connection exhaustion	Active connections, pool utilization	Too many clients, connection leaks, long transactions	Increase pool size, fix leaks, add connection pooler (PgBouncer)
Replication lag	Seconds behind primary, WAL lag	Write volume exceeds replica apply rate, network issues	Upgrade replica hardware, reduce write volume
Lock timeouts	Lock wait time, deadlock count	Hot rows, long transactions, missing indexes	Optimize hot paths, reduce transaction scope
High I/O wait	Disk utilization, IOPS consumption	Data exceeds memory, write-heavy workload	Add RAM, upgrade to SSD, scale writes

Monitoring Tools and Queries:

Every database platform has specific tools for bottleneck identification:

PostgreSQL:

pg_stat_activity — Active queries and their states (waiting, active, idle)
pg_stat_user_tables — Table-level statistics (scans, tuples read/written)
pg_locks — Current lock information
pg_stat_replication — Replication status and lag
EXPLAIN ANALYZE — Query execution plans and actual timings

MySQL:

SHOW PROCESSLIST — Active connections and queries
performance_schema — Detailed performance metrics
SHOW ENGINE INNODB STATUS — InnoDB internals, deadlock info
Slow Query Log — Queries exceeding threshold
EXPLAIN ANALYZE — Query execution plans (MySQL 8.0+)

General Principles:

Monitor query latency at p50, p95, p99, and p999 percentiles
Track connection pool utilization as a percentage of maximum
Alert on replication lag exceeding acceptable thresholds (e.g., > 10 seconds)
Log all queries exceeding a latency threshold (e.g., > 100ms)

The Slow Query Log Is Your Friend

Read Bottlenecks vs. Write Bottlenecks

Not all database bottlenecks are the same. Understanding whether you're read-bottlenecked or write-bottlenecked dramatically changes your scaling strategy.

Read-Heavy Workloads:

Most applications are read-heavy—often 90%+ reads. Common examples:

Social media feeds (read posts, comments)
E-commerce product catalogs
Content management systems
Search and discovery features

Characteristics of read bottlenecks:

High CPU utilization from query processing
High memory usage for caching/buffering
Disk I/O dominated by reads (if data doesn't fit in memory)
Single primary database can't serve all read traffic

Write-Heavy Workloads:

Some applications are write-intensive:

Event streaming and logging
IoT data ingestion
Real-time analytics pipelines
Chat and messaging systems

Characteristics of write bottlenecks:

Disk I/O dominated by WAL writes and data persistence
Replication lag on replicas (can't keep up with write volume)
Lock contention on frequently-updated rows
Transaction log growth and checkpoint overhead

Scaling Reads

•Read replicas — Add more read-only copies of the database
•Caching layers — Redis/Memcached in front of the database
•CDN for static content — Offload static data entirely
•Materialized views — Precompute expensive query results
•Search indexes — Elasticsearch for complex text search
•Connection pooling — More efficient connection reuse

Scaling Writes

•Sharding — Partition data across multiple primary databases
•Write batching — Aggregate writes to reduce transaction overhead
•Async writes — Queue writes and process asynchronously
•Denormalization — Reduce write amplification from cascading updates
•Faster storage — NVMe SSDs, provisioned IOPS
•Different database — Write-optimized stores (Cassandra, ScyllaDB)

Identifying Your Workload Type:

Use database metrics to characterize your workload:

PostgreSQL:

SELECT 
    sum(tup_returned + tup_fetched) as reads,
    sum(tup_inserted + tup_updated + tup_deleted) as writes,
    round(sum(tup_returned + tup_fetched)::numeric / 
          NULLIF(sum(tup_inserted + tup_updated + tup_deleted), 0), 2) as read_write_ratio
FROM pg_stat_user_tables;

The Read/Write Ratio Determines Your Strategy

Connection Pooling: The First Line of Defense

Before implementing complex scaling solutions, ensure you're using connections efficiently. Connection pooling is often the single highest-ROI fix for database bottlenecks.

Why Connection Pooling Matters:

Database connections are expensive:

Establishing a connection takes time — TCP handshake, authentication, session setup (10-50ms typical)
Each connection consumes memory — PostgreSQL: ~5-10MB per connection; MySQL: ~1-5MB per connection
Connections have limits — Databases have maximum connection limits (often 100-500 by default)

Without pooling, an application that handles 1000 concurrent requests might try to open 1000 database connections simultaneously—far exceeding typical limits.

Application-Level Connection Pooling:

Most application frameworks include connection pools:

Set a minimum pool size for warm connections ready for use
Set a maximum pool size based on your database's connection limit divided by number of application instances
Configure connection timeout to fail fast when pool is exhausted
Configure idle timeout to close unused connections and free resources

External Connection Poolers:

For PostgreSQL, external poolers like PgBouncer or Pgpool-II sit between applications and the database:

PgBouncer can multiplex thousands of application connections onto dozens of database connections
Transaction pooling mode releases connections back to the pool after each transaction (vs. per-session)
Connection poolers can survive application restarts without database connection churn

Example Configuration (PgBouncer):

[databases]
myapp = host=db.internal.example.com port=5432 dbname=myapp

[pgbouncer]
pool_mode = transaction
max_client_conn = 5000      # Accept up to 5000 application connections
default_pool_size = 50      # Use only 50 actual database connections per database
reserve_pool_size = 10      # Emergency reserve connections
reserve_pool_timeout = 5    # Wait 5s before using reserve

This configuration allows 5000 application connections to be served by just 50 database connections—a 100x multiplier.

Connection Pool Sizing Guidelines
Application Instances	DB Max Connections	Pool Size per Instance	Recommended Pooler
5-10	100-200	10-20	Application-level pool usually sufficient
10-50	200-500	5-10	Consider external pooler
50-200	500-1000	2-5	External pooler highly recommended
200+	1000+	N/A	External pooler mandatory (PgBouncer, ProxySQL)

The Pool Size Trap

Caching: Reducing Database Load

The most effective way to scale reads is to avoid hitting the database entirely. Caching puts frequently-accessed data in faster storage (memory) closer to the application.

Cache Hit Rate Is Everything:

The value of caching depends entirely on hit rate:

90% hit rate: 10x reduction in database load
99% hit rate: 100x reduction in database load
50% hit rate: Only 2x reduction—questionable value given cache complexity

Before implementing caching, analyze your access patterns. Caching works well for:

Frequently accessed data (popular products, user profiles)
Data that changes infrequently (configuration, reference data)
Expensive-to-compute query results

Caching works poorly for:

Highly personalized data with low repeat access
Rapidly changing data requiring strong consistency
Long-tail access patterns where everything is accessed roughly equally

Caching Patterns:

Cache-Aside (Lazy Loading):

Application checks cache
If cache miss, query database
Store result in cache with TTL
Return result to caller

Best for: Read-heavy workloads with tolerance for brief stale data

Write-Through:

On write, update both cache and database together
Reads always get fresh data from cache

Best for: Read-heavy workloads requiring strong consistency

Write-Behind (Write-Back):

Write to cache immediately
Asynchronously persist to database
Faster writes but risk of data loss

Best for: Write-heavy workloads with tolerance for potential data loss during cache failure

Caching Pattern Comparison
Pattern	Read Performance	Write Performance	Consistency	Complexity
Cache-Aside	Excellent (after warm)	Good	Eventual (TTL-based)	Simple
Write-Through	Excellent	Moderate (sync write)	Strong	Moderate
Write-Behind	Excellent	Excellent	Weak (risk of loss)	Complex
Read-Through	Excellent (after warm)	Good	Eventual	Moderate

Cache Invalidation Strategies:

Phil Karlton famously said, 'There are only two hard things in Computer Science: cache invalidation and naming things.' Indeed, deciding when to remove or update cached data is challenging:

TTL (Time-To-Live): Simplest approach; data expires after fixed time. Tradeoff between freshness and hit rate.
Event-based invalidation: Publish cache invalidation events when data changes. Complex but precise.
Version-based: Include version number in cache key; update version on change.

Common Caching Technologies:

Redis: Feature-rich, supports complex data structures, clustering, persistence
Memcached: Simpler, faster for basic key-value caching, no persistence
Application-local cache: In-process caching (e.g., Guava Cache, Caffeine for Java); fastest but not shared across instances

Start Simple, Measure, Iterate

Read Replicas: Scaling Read Capacity

When caching isn't sufficient (or practical), read replicas allow you to scale read capacity horizontally by adding copies of your database that can serve read queries.

How Read Replication Works:

Primary database handles all writes
Write-ahead log (WAL) records changes
Replicas receive WAL stream and apply changes
Applications direct read traffic to replicas

This multiplies your read capacity proportionally to the number of replicas—3 replicas means ~3x read throughput (minus replication overhead).

Replication Types:

Synchronous Replication:

Primary waits for replica acknowledgment before committing
Guarantees replica has data before commit returns
Higher latency for writes
Zero data loss on primary failure

Asynchronous Replication:

Primary commits immediately, replicates in background
Lower write latency
Replicas may lag behind primary
Potential data loss on primary failure (data in transit)

Semi-Synchronous (MySQL):

Primary waits for at least one replica acknowledgment
Balance between safety and performance

Routing Read Traffic:

Applications must decide which queries go to primary vs. replicas:

All reads to replicas: Maximum read scaling, but stale reads possible
Read-after-write to primary: Recent writes read from primary, others from replica
Critical reads to primary: Important queries use primary, bulk queries use replicas

Many applications implement 'sticky reads'—after a write, subsequent reads for that session go to primary for a brief period (e.g., 5 seconds) to ensure the user sees their own changes.

Read Replica Architectures
Configuration	Read Scale	Consistency	Write Latency	Failure Tolerance
Single Primary, No Replicas	1x	Strong	Baseline	Data loss if primary fails
Primary + Async Replicas	Nx	Eventual	Baseline	Potential data loss
Primary + Sync Replica	2x	Strong	Higher	No data loss
Primary + Mixed Replicas	Nx	Configurable	Depends	Configurable

Practical Considerations:

Replica Promotion on Failure: If the primary fails, a replica can be promoted to become the new primary. This requires:

Detecting primary failure (health checks, heartbeats)
Electing which replica to promote (most up-to-date)
Reconfiguring applications to use new primary
Potentially recreating replicas from new primary

Cross-Region Replicas: Replicas can be placed in different geographic regions:

Reduces read latency for users in those regions
Provides disaster recovery capability
Adds significant replication lag (laws of physics)

Replica Sizing: Replicas don't need identical specs to the primary:

Replicas often handle read-only queries (no write overhead)
Replicas can be smaller if serving filtered/summarized data
But replicas must keep up with WAL stream—undersized replicas fall behind

Replication Lag Is Unavoidable

Sharding: Scaling Write Capacity

How Sharding Works:

Choose a shard key (e.g., user_id, tenant_id, geographic region)
Hash or range function maps shard key to shard number
Each shard is an independent database (or cluster)
Application routes queries to the appropriate shard based on shard key

Example: For a social network, you might shard by user_id:

Shard 0: user_id % 4 == 0
Shard 1: user_id % 4 == 1
Shard 2: user_id % 4 == 2
Shard 3: user_id % 4 == 3

Now, four primary databases share the write load. Each user's data lives entirely on one shard.

Sharding Strategies:

Hash Sharding:

Deterministic function (hash(key) % num_shards) selects shard
Even distribution if hash is uniform
Difficult to range scan (need to query all shards)
Adding shards requires resharding (expensive)

Range Sharding:

Key ranges assigned to shards (e.g., A-M → shard 0, N-Z → shard 1)
Good for range queries
Risk of hot spots if traffic isn't uniformly distributed
Easier to add shards (split existing ranges)

Directory/Lookup Sharding:

Central lookup table maps keys to shards
Maximum flexibility for placement
Lookup table becomes a dependency (single point of failure)

Sharding Strategy Comparison
Strategy	Distribution	Range Queries	Resharding	Hotspots
Hash	Even (if hash is good)	Scatter-gather required	Complex (rehash all)	Unlikely
Range	Depends on key distribution	Efficient	Split ranges	Possible on popular ranges
Directory	Controllable	Depends on layout	Update directory	Avoidable with care

Sharding Challenges:

Cross-Shard Queries: Queries that need data from multiple shards are expensive:

Must query all relevant shards
Must aggregate results in application
Can't use database joins across shards
No transactional consistency across shards (without distributed transactions)

Resharding: As you grow, you'll need more shards. Resharding is operationally complex:

Moving data while maintaining availability
Updating routing logic
Handling in-flight queries during transition

No Global Secondary Indexes: An index that spans all shards is impractical. If your shard key is user_id but you need to search by email:

Maintain a separate lookup table (email → user_id)
Or replicate that index to all shards
Or use a search engine (Elasticsearch) for such queries

When to Shard: Sharding introduces massive complexity. Only shard when:

Vertical scaling is no longer cost-effective
Caching and read replicas don't solve the problem
Your bottleneck is truly writes, not reads
Your data model naturally fits a shard key

Sharding Is Not a First Resort

Summary: Mastering Database Bottlenecks

Key Takeaways

•Databases are naturally bottleneck-prone — Durability, ACID guarantees, and centralized authority all create scaling constraints
•Identify bottleneck type first — Read bottlenecks and write bottlenecks require different solutions
•Connection pooling is foundational — Often the highest-ROI fix; use external poolers (PgBouncer) at scale
•Caching eliminates load — The best database query is the one that never executes; target 90%+ hit rate
•Read replicas scale reads — But don't help write-heavy workloads; understand replication lag implications
•Sharding scales writes — But adds massive complexity; exhaust other options first
•Monitor proactively — Slow query logs, connection pool utilization, replication lag, and lock contention metrics are essential
•Consider managed databases — Aurora, Spanner, CockroachDB, and similar services handle much scaling complexity automatically

Scaling Toolkit Summary:

First: Connection pooling, query optimization, proper indexing
Second: Caching layer (Redis/Memcached) for read-heavy workloads
Third: Read replicas for additional read scale
Fourth: Vertical scaling (bigger database server)
Fifth: Sharding for write scale (or use distributed database)

What's Next:

With database bottlenecks covered in depth, the final page of this module explores network bottlenecks—the constraints introduced by moving data between components in distributed systems.

Page Complete