System Design (HLD)High-Level Design

High-Level Design: From Requirements to Architecture

LevelIntermediate

Duration180 mins

TopicHigh-Level Design

1 / 5

Component Identification: Decomposing Systems into Building Blocks

The Architecture Layer Cake

You've gathered requirements, estimated scale, and analyzed tradeoffs. Now comes the pivotal moment in system design: translating abstract requirements into concrete architectural components. This is where design becomes tangible—where 'the system should support millions of users' transforms into specific services, databases, queues, and caches that collectively deliver that capability.

Component identification is simultaneously the most creative and the most disciplined phase of system design. It requires imagination to envision how pieces might fit together, and rigor to ensure those pieces are well-defined, appropriately scoped, and cohesively integrated.

This page establishes the foundational skill of decomposing systems into building blocks. You'll learn systematic approaches for identifying components, techniques for defining component boundaries, and patterns that guide decomposition across different system types.

What You Will Learn

By the end of this page, you will understand how to systematically identify architectural components from requirements, define clear component boundaries and responsibilities, apply decomposition patterns across different domains, and recognize common components that appear in most distributed systems.

What Is a Component?

Before diving into identification techniques, we need precision about what we mean by component in system design. This term is used loosely across software engineering, so let's establish clear definitions.

In the context of high-level design, a component is an autonomous unit of functionality with:

Clear boundaries: Well-defined inputs, outputs, and interfaces
Single responsibility: Focused purpose that can be described in one sentence
Independent deployability: Can be deployed, scaled, and updated without requiring changes to other components
Encapsulated state: Owns and manages its own data and internal state

Components are abstract—they can be implemented as microservices, serverless functions, monolith modules, background workers, or even third-party services. At the high-level design stage, we care about what functions they serve, not how they're implemented.

Component Types in System Design
Component Type	Description	Examples
Compute Components	Execute business logic and handle requests	API servers, background workers, stream processors
Data Components	Store and serve persistent data	Databases, caches, blob stores, search indices
Messaging Components	Enable asynchronous communication between services	Message queues, event buses, pub/sub systems
Gateway Components	Manage external access and routing	API gateways, load balancers, CDNs
Infrastructure Components	Support observability, security, and operations	Logging services, monitoring, secret managers

Components Are Not Microservices

A common misconception is that every component must become a separate microservice. In reality, multiple components might live in a single deployment (modular monolith), or a single logical component might span multiple processes (sharded database). Component identification is about logical decomposition—implementation decisions come later.

The Decomposition Challenge

Component identification seems straightforward in theory but presents substantial challenges in practice. The core tension lies between two opposing forces:

Under-decomposition (too few components):

Creates monolithic structures that are hard to scale independently
Leads to tight coupling where changes ripple across the entire system
Makes it difficult to assign clear ownership to teams
Forces all parts of the system to scale together, even when only one part needs more capacity

Over-decomposition (too many components):

Increases operational complexity and deployment overhead
Creates distributed system challenges: network latency, partial failures, data consistency
Requires sophisticated coordination mechanisms
Can lead to 'distributed monolith' anti-pattern where tightly coupled services lose benefits of separation

The goal is to find the appropriate granularity that balances autonomy, cohesion, and operational complexity. This is deeply context-dependent—there's no universal 'right' number of components.

Signs of Poor Decomposition

•Circular dependencies: Component A calls B, which calls C, which calls A. Design is tangled.
•Shared databases: Multiple components directly access the same tables. No clear data ownership.
•Change amplification: Modifying one feature requires changing multiple components.
•Phantom components: Components that exist but have unclear purpose or duplicated functionality.
•Synchronized deployments: Components must be deployed together, despite being 'separate'.
•Cross-component transactions: Business operations span multiple components atomically.

The Distributed Monolith Trap

The worst outcome is achieving the complexity of microservices without their benefits. If your 'independent' services require coordinated deployments, share databases, and have chatty communication, you've built a distributed monolith. It's slower than a monolith and harder to operate than well-designed microservices. Thoughtful component identification helps avoid this trap.

Decomposition Strategies

Several proven strategies guide component identification. These aren't mutually exclusive—production systems typically employ a combination based on context.

Strategy 1: Domain-Driven Decomposition

Align components with business domains using Bounded Contexts from Domain-Driven Design (DDD). Each bounded context represents a coherent area of the business with its own ubiquitous language, models, and rules.

Example: E-commerce platform decomposed by domain:

Catalog Context: Product information, categories, search
Inventory Context: Stock levels, warehouse locations, reservations
Order Context: Order lifecycle, fulfillment tracking
Payment Context: Payment processing, refunds, invoicing
Customer Context: User profiles, preferences, authentication

Each context owns its data and logic, communicating with others through well-defined interfaces. This mirrors organizational structure and enables Conway's Law to work in your favor.

Strategy 2: Capability-Based Decomposition

Organize components around technical capabilities that can be reused across domains. This approach extracts cross-cutting concerns into shared infrastructure.

Example: Platform capabilities:

Notification Service: Email, SMS, push notifications for all domains
File Storage Service: Binary storage and retrieval shared across features
Search Service: Unified search infrastructure for products, users, content
Analytics Service: Event collection and reporting across all domains
Authentication Service: Identity and access management for entire platform

Capability-based components become internal platforms that other components consume, enabling consistency and reducing duplication.

Strategy 3: Use Case-Based Decomposition

Group functionality around primary user journeys or use cases. This is particularly effective for systems with distinct user flows that share limited data.

Example: Video platform use cases:

Upload Flow: Video ingestion, transcoding, thumbnail generation
Watch Flow: Stream serving, adaptive bitrate, CDN distribution
Social Flow: Comments, likes, shares, subscriptions
Creator Flow: Analytics dashboard, monetization, channel management
Discovery Flow: Recommendations, search, trending algorithms

Each flow can evolve independently based on its specific optimization goals and user needs.

Strategy 4: Data-Oriented Decomposition

Partition components based on data ownership and access patterns. Components are organized around the data they own, with clear rules for data access.

Example: Social network by data ownership:

User Service: Owns user profiles and credentials
Graph Service: Owns social connections (follows, friends)
Content Service: Owns posts, media references
Feed Service: Owns user timelines and feed caches
Interaction Service: Owns likes, comments, reactions

This strategy naturally prevents shared databases and makes data ownership explicit, crucial for maintaining consistency in distributed systems.

Decomposition Strategy Comparison
Strategy	Best For	Watch Out For
Domain-Driven	Complex business domains with clear boundaries	Domains that overlap or have ambiguous ownership
Capability-Based	Cross-cutting technical concerns, platform teams	Over-extracting and creating too many shared services
Use Case-Based	Distinct user journeys with different optimization needs	Shared data and logic across use cases requiring coordination
Data-Oriented	Systems where data ownership is critical (GDPR, compliance)	Business logic that spans multiple data entities

The Component Identification Process

Component identification is iterative, not linear. However, a structured approach increases the likelihood of a coherent design. Here's a systematic process that works across most system types:

Step 1: Extract Nouns and Verbs from Requirements

Start with your functional requirements and use cases. Identify:

Nouns: Entities the system manages (Users, Orders, Products, Messages)
Verbs: Actions the system performs (Create, Search, Notify, Validate)

Nouns often become data components or domain services. Verbs often become operations within components or separate processing services.

Example: 'Users can upload videos which are then transcoded and distributed to viewers'

Nouns: Users, Videos, Viewers
Verbs: Upload, Transcode, Distribute
Initial components: User Service, Video Storage, Transcoder, Content Distribution

Step 2: Identify Domain Boundaries

Group related nouns and verbs into coherent domains. Look for:

Entities that change together: If Product and Inventory always update simultaneously, they might belong together
Consistent language: Where terminology shifts, boundaries likely exist
Team ownership: Who would own and develop each area?

Draw tentative boundaries around clusters. These become candidate components.

Step 3: Apply the Single Responsibility Test

For each candidate component, articulate its purpose in one sentence. If you can't, the component is probably too broad.

✅ Good: 'The Notification Service handles delivering messages to users across email, SMS, and push channels'

❌ Poor: 'The Core Service handles user management, order processing, and notification delivery'

Components with multiple responsibilities should be split.

Step 4: Analyze Data Ownership

For each candidate component, define:

What data does it own? (Authoritative source of truth)
What data does it need from others? (Dependencies on other components)
How will it share data? (Synchronous calls, events, data replication)

Ideally, each piece of data has exactly one owner. If multiple components need write access to the same data, reconsider boundaries.

Step 5: Identify Synchronous vs. Asynchronous Interactions

Determine how components will communicate:

Synchronous (request-response): When immediate response is required for the operation
Asynchronous (events/messages): When temporal decoupling is acceptable or preferred

Heavy synchronous coupling suggests components might belong together. Natural async boundaries suggest good separation points.

Step 6: Evaluate Against Non-Functional Requirements

Test your decomposition against system requirements:

Scalability: Can each component scale independently? If video transcoding needs 100x more compute than user management, separation helps.
Availability: Can failure in one component be isolated? If payment fails, should catalog browsing also fail?
Latency: Does decomposition add unacceptable network hops? Critical paths might need co-location.
Security: Do security boundaries align with component boundaries? Sensitive components (payment, auth) might need isolation.

Step 7: Iterate and Refine

Component identification is rarely correct on the first pass. As you diagram the system and trace data flows, you'll discover:

Missing components needed for coordination
Over-specified components that should merge
Unclear boundaries that need sharper definition

Iterate until the design feels cohesive and each component has a clear, singular purpose.

Start Coarse, Refine Later

When in doubt, start with fewer, larger components. It's easier to split a well-designed component later than to merge poorly conceived microservices. Many successful systems start as modular monoliths, extracting services only when the need becomes clear.

Common Component Patterns

Certain components appear repeatedly across different system designs. Recognizing these patterns accelerates identification and leverages proven solutions.

Pattern: API Gateway

A single entry point for client requests that handles:

Request routing to appropriate backend services
Authentication and authorization
Rate limiting and throttling
Request/response transformation
Protocol translation (REST to gRPC)

Appears in: Nearly every client-facing system at scale

Pattern: Authentication/Identity Service

Dedicated component for identity management:

User registration and credential storage
Session management and token issuance
OAuth integration and social login
Password reset and account recovery
Multi-factor authentication

Appears in: Any system with user accounts

Pattern: Notification Hub

Centralized notification delivery across channels:

Channel abstraction (email, SMS, push, in-app)
Template management and personalization
Delivery tracking and retry logic
User preference management
Rate limiting and batching

Appears in: E-commerce, social platforms, SaaS applications

Pattern: Media Processing Pipeline

Asynchronous processing for binary content:

Upload handling and validation
Format conversion and optimization
Metadata extraction
Storage orchestration
CDN integration

Appears in: Video platforms, image-heavy apps, document management

Pattern: Search Service

Dedicated search infrastructure:

Index management and updates
Query parsing and optimization
Ranking and relevance scoring
Faceting and filtering
Typeahead and suggestions

Appears in: E-commerce, content platforms, enterprise search

Pattern: Workflow Orchestrator

Coordinator for multi-step processes:

State machine management
Step execution and transitions
Compensation and rollback
Timeout handling
Progress tracking

Appears in: Order fulfillment, approval workflows, data pipelines

Pattern: Cache Layer

Performance optimization through caching:

Frequently accessed data caching
Session storage
Computation result memoization
Rate limit counters
Distributed locking

Appears in: High-traffic read-heavy systems

Pattern: Event Bus / Message Broker

Asynchronous communication backbone:

Event publishing and subscription
Message queuing and delivery
Topic-based routing
Dead letter handling
Event replay capability

Appears in: Event-driven architectures, decoupled microservices

Reusable Component Checklist

•Gateway/Ingress: Do you need a single entry point for routing, auth, and rate limiting?
•Identity/Auth: Does your system have users requiring authentication?
•Notification: Do you need to deliver messages across multiple channels?
•Media Processing: Does your system handle images, videos, or documents?
•Search: Do users need to search content, products, or entities?
•Workflow: Are there multi-step processes requiring coordination?
•Cache: Do you have hot data or expensive computations?
•Message Bus: Do components need asynchronous, decoupled communication?
•Scheduler: Do you need to run periodic or delayed tasks?
•Analytics: Do you need to collect, aggregate, and report on events?

Case Study: Identifying Components for a Food Delivery System

Let's apply component identification to a realistic system: a food delivery platform like DoorDash or Uber Eats.

Functional Requirements Summary:

Customers can browse restaurants, view menus, and place orders
Restaurants manage menus, accept/reject orders, and confirm readiness
Drivers accept deliveries, navigate routes, and confirm delivery
Real-time order tracking for customers
Payment processing with driver payouts
Ratings and reviews for restaurants and drivers

Step 1: Extract Nouns and Verbs

Key Nouns: Customers, Restaurants, Drivers, Orders, Menus, Items, Payments, Reviews, Locations

Key Verbs: Browse, Search, Order, Accept, Reject, Dispatch, Track, Pay, Rate, Navigate

Step 2: Identify Domain Boundaries

Grouping by coherent domains:

Customer Domain:

Customer registration and profiles
Address management
Order history and preferences

Restaurant Domain:

Restaurant profiles and operating hours
Menu management (items, pricing, availability)
Order acceptance and preparation tracking

Driver Domain:

Driver registration and verification
Availability and location tracking
Delivery history and performance

Order Domain:

Order creation and lifecycle
Cart management
Order status tracking

Dispatch Domain:

Driver-order matching
Route optimization
Real-time tracking

Payment Domain:

Customer charges
Restaurant settlements
Driver payouts
Refund processing

Review Domain:

Customer reviews for restaurants and drivers
Restaurant responses
Aggregate ratings

Step 3: Single Responsibility Test

✅ Order Service: Manages order lifecycle from creation to completion ✅ Dispatch Service: Matches available drivers with orders based on location and optimization ✅ Payment Service: Handles all monetary transactions and settlements ✅ Location Service: Tracks and serves real-time driver positions ✅ Notification Service: Delivers status updates across channels

Step 4: Identify Additional Components

Analyzing the design reveals needs for:

API Gateway: Single entry point for mobile apps and web clients
Search Service: Restaurant and menu search with filters
Media Service: Restaurant photos, menu images
Analytics Service: Business intelligence, operational metrics
Pricing Service: Dynamic pricing, promotions, fees calculation

Step 5: Final Component List

Customer Service - User accounts, preferences, addresses
Restaurant Service - Restaurant profiles, menus, availability
Driver Service - Driver profiles, vehicles, documents
Order Service - Order lifecycle management
Dispatch Service - Driver assignment and optimization
Location Service - Real-time position tracking
Payment Service - Transactions, settlements, payouts
Review Service - Ratings and reviews
Search Service - Restaurant and menu search
Notification Service - Push, SMS, email delivery
Pricing Service - Fees, promotions, dynamic pricing
API Gateway - Routing, auth, rate limiting

Component Boundaries Evolve

This decomposition represents one valid approach. Real-world systems evolve: Location Service might merge with Dispatch Service if they're always deployed together, or Order Service might split into Cart Service and Fulfillment Service as complexity grows. Start with clear boundaries and adjust based on actual operational needs.

Documenting Components

Component identification doesn't end with a list. Each component should be documented with enough detail for the design to be implemented and evaluated.

Component Specification Template

For each component, define:

1. Purpose Statement One sentence describing what this component does and why it exists.

2. Responsibilities

What this component is responsible for (scope-in)
What this component is NOT responsible for (scope-out)

3. Data Ownership

What data entities does this component own?
What is the expected data volume and growth?

4. Key Operations

Primary operations/APIs this component exposes
Expected latency and throughput requirements

5. Dependencies

Other components this component calls synchronously
Events this component subscribes to

6. Outputs

Events this component publishes
Data this component exposes to others

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
# Order Service Component Specification
 
purpose: |
  Manages the complete lifecycle of customer orders from
  creation through fulfillment and completion.
 
responsibilities:
  in_scope:
    - Order creation and cart management
    - Order status transitions and validation
    - Order history storage and retrieval
    - Coordinating order events to other services
  out_of_scope:
    - Payment processing (Payment Service)
    - Driver assignment (Dispatch Service)
    - Customer data management (Customer Service)
 
data_ownership:
  entities:
    - Orders (primary entity)
    - OrderItems (line items within orders)
    - OrderStatusHistory (audit trail)
  volumes:
    - Expected: 100K orders/day initially
    - Growth: 10x over 2 years
 
key_operations:
  - CreateOrder: Creates new order from cart
    latency: < 200ms p99
    throughput: 500 req/sec peak
  - GetOrder: Retrieves order details
    latency: < 50ms p99
    throughput: 2000 req/sec peak
  - UpdateOrderStatus: Transitions order state
    latency: < 100ms p99
    throughput: 1000 req/sec peak
 
dependencies:
  synchronous:
    - CustomerService: Validate customer, get address
    - RestaurantService: Validate menu items, prices
    - PricingService: Calculate fees, apply promotions
  asynchronous:
    - PaymentCompleted event from PaymentService
    - DeliveryCompleted event from DispatchService
 
outputs:
  events_published:
    - OrderCreated
    - OrderConfirmed
    - OrderReadyForPickup
    - OrderDelivered
    - OrderCancelled

Documentation Depth

In interview settings, you won't write formal specifications. But articulating these details verbally demonstrates thoroughness. For real projects, this documentation prevents ambiguity and enables parallel implementation by different teams.

Common Pitfalls in Component Identification

Even experienced architects make mistakes during component identification. Recognizing these patterns helps avoid them.

Pitfall 1: Entity-Based Decomposition

Mistake: Creating one service per database table (UserService, OrderService, AddressService, CartService, CartItemService...)

Problem: This fragments related functionality. A single operation requires orchestrating many services. Operational complexity explodes.

Solution: Group entities by domain boundary, not by table. Cart and CartItem belong together in Order context.

Pitfall 2: Technology-Based Decomposition

Mistake: Organizing by technology (Frontend, Backend, Database, Cache, Queue)

Problem: This doesn't reflect business domains. Changes to a single feature touch every 'component'. No independent evolution.

Solution: Each component should own its full stack from API to data storage. Technology is implementation, not architecture.

Pitfall 3: Premature Extraction

Mistake: Creating a separate service for every potential reuse opportunity (StringUtilsService, ValidationService, LoggingService)

Problem: Overhead of network calls, deployment, and monitoring for trivial functionality. Libraries work better for utilities.

Solution: Extract services for capabilities that require independence (different scaling, different data, different teams), not for code reuse.

Pitfall 4: Ignoring Data

Mistake: Defining components without considering data ownership and access patterns

Problem: Multiple services end up reading/writing the same database, creating hidden coupling and consistency issues.

Solution: Data ownership must be a primary consideration. If two components need the same data, reconsider boundaries.

Red Flags in Component Design

•Service names ending in 'Manager', 'Handler', 'Processor' without specific domain context
•Components that are always deployed together but defined separately
•Components that can't be explained in one sentence without using 'and'
•Services that require database joins across ownership boundaries
•Components named after technical functions rather than business capabilities
•More than 10-12 synchronous hops in any request path

Summary: Component Identification

Component identification transforms requirements into architectural building blocks. Let's consolidate the key principles:

Key Takeaways

•Components are logical units — Autonomous, single-responsibility building blocks with clear boundaries and interfaces.
•Decomposition requires balance — Too few components create monoliths; too many create distributed complexity. Find appropriate granularity.
•Multiple strategies apply — Domain-driven, capability-based, use case-based, and data-oriented decomposition each have strengths.
•The process is iterative — Extract nouns/verbs, identify boundaries, test responsibilities, analyze data, and refine continuously.
•Patterns accelerate identification — Common components (Gateway, Auth, Notification, Search) appear across most systems.
•Documentation clarifies intent — Specify purpose, responsibilities, data, operations, dependencies, and outputs for each component.
•Avoid common pitfalls — Entity-based, technology-based, and premature extraction lead to poor designs.

What's next:

With components identified, our next step is visualizing their relationships through system architecture diagrams. The next page covers how to effectively communicate system structure through diagrams that serve as blueprints for implementation.

Page Complete

You now understand how to systematically identify architectural components from system requirements. This skill forms the foundation of high-level design, enabling you to translate abstract requirements into concrete, implementable building blocks.

1 / 5

Loading learning content...

System Design (HLD)High-Level Design

High-Level Design: From Requirements to Architecture

LevelIntermediate

Duration180 mins

TopicHigh-Level Design

1 / 5

Component Identification: Decomposing Systems into Building Blocks

The Architecture Layer Cake

What You Will Learn

What Is a Component?

In the context of high-level design, a component is an autonomous unit of functionality with:

Clear boundaries: Well-defined inputs, outputs, and interfaces
Single responsibility: Focused purpose that can be described in one sentence
Independent deployability: Can be deployed, scaled, and updated without requiring changes to other components
Encapsulated state: Owns and manages its own data and internal state

Component Types in System Design
Component Type	Description	Examples
Compute Components	Execute business logic and handle requests	API servers, background workers, stream processors
Data Components	Store and serve persistent data	Databases, caches, blob stores, search indices
Messaging Components	Enable asynchronous communication between services	Message queues, event buses, pub/sub systems
Gateway Components	Manage external access and routing	API gateways, load balancers, CDNs
Infrastructure Components	Support observability, security, and operations	Logging services, monitoring, secret managers

Components Are Not Microservices

The Decomposition Challenge

Component identification seems straightforward in theory but presents substantial challenges in practice. The core tension lies between two opposing forces:

Under-decomposition (too few components):

Creates monolithic structures that are hard to scale independently
Leads to tight coupling where changes ripple across the entire system
Makes it difficult to assign clear ownership to teams
Forces all parts of the system to scale together, even when only one part needs more capacity

Over-decomposition (too many components):

Increases operational complexity and deployment overhead
Creates distributed system challenges: network latency, partial failures, data consistency
Requires sophisticated coordination mechanisms
Can lead to 'distributed monolith' anti-pattern where tightly coupled services lose benefits of separation

The goal is to find the appropriate granularity that balances autonomy, cohesion, and operational complexity. This is deeply context-dependent—there's no universal 'right' number of components.

Signs of Poor Decomposition

•Circular dependencies: Component A calls B, which calls C, which calls A. Design is tangled.
•Shared databases: Multiple components directly access the same tables. No clear data ownership.
•Change amplification: Modifying one feature requires changing multiple components.
•Phantom components: Components that exist but have unclear purpose or duplicated functionality.
•Synchronized deployments: Components must be deployed together, despite being 'separate'.
•Cross-component transactions: Business operations span multiple components atomically.

The Distributed Monolith Trap

Decomposition Strategies

Several proven strategies guide component identification. These aren't mutually exclusive—production systems typically employ a combination based on context.

Strategy 1: Domain-Driven Decomposition

Example: E-commerce platform decomposed by domain:

Catalog Context: Product information, categories, search
Inventory Context: Stock levels, warehouse locations, reservations
Order Context: Order lifecycle, fulfillment tracking
Payment Context: Payment processing, refunds, invoicing
Customer Context: User profiles, preferences, authentication

Each context owns its data and logic, communicating with others through well-defined interfaces. This mirrors organizational structure and enables Conway's Law to work in your favor.

Strategy 2: Capability-Based Decomposition

Organize components around technical capabilities that can be reused across domains. This approach extracts cross-cutting concerns into shared infrastructure.

Example: Platform capabilities:

Notification Service: Email, SMS, push notifications for all domains
File Storage Service: Binary storage and retrieval shared across features
Search Service: Unified search infrastructure for products, users, content
Analytics Service: Event collection and reporting across all domains
Authentication Service: Identity and access management for entire platform

Capability-based components become internal platforms that other components consume, enabling consistency and reducing duplication.

Strategy 3: Use Case-Based Decomposition

Group functionality around primary user journeys or use cases. This is particularly effective for systems with distinct user flows that share limited data.

Example: Video platform use cases:

Upload Flow: Video ingestion, transcoding, thumbnail generation
Watch Flow: Stream serving, adaptive bitrate, CDN distribution
Social Flow: Comments, likes, shares, subscriptions
Creator Flow: Analytics dashboard, monetization, channel management
Discovery Flow: Recommendations, search, trending algorithms

Each flow can evolve independently based on its specific optimization goals and user needs.

Strategy 4: Data-Oriented Decomposition

Partition components based on data ownership and access patterns. Components are organized around the data they own, with clear rules for data access.

Example: Social network by data ownership:

User Service: Owns user profiles and credentials
Graph Service: Owns social connections (follows, friends)
Content Service: Owns posts, media references
Feed Service: Owns user timelines and feed caches
Interaction Service: Owns likes, comments, reactions

This strategy naturally prevents shared databases and makes data ownership explicit, crucial for maintaining consistency in distributed systems.

Decomposition Strategy Comparison
Strategy	Best For	Watch Out For
Domain-Driven	Complex business domains with clear boundaries	Domains that overlap or have ambiguous ownership
Capability-Based	Cross-cutting technical concerns, platform teams	Over-extracting and creating too many shared services
Use Case-Based	Distinct user journeys with different optimization needs	Shared data and logic across use cases requiring coordination
Data-Oriented	Systems where data ownership is critical (GDPR, compliance)	Business logic that spans multiple data entities

The Component Identification Process

Component identification is iterative, not linear. However, a structured approach increases the likelihood of a coherent design. Here's a systematic process that works across most system types:

Step 1: Extract Nouns and Verbs from Requirements

Start with your functional requirements and use cases. Identify:

Nouns: Entities the system manages (Users, Orders, Products, Messages)
Verbs: Actions the system performs (Create, Search, Notify, Validate)

Nouns often become data components or domain services. Verbs often become operations within components or separate processing services.

Example: 'Users can upload videos which are then transcoded and distributed to viewers'

Nouns: Users, Videos, Viewers
Verbs: Upload, Transcode, Distribute
Initial components: User Service, Video Storage, Transcoder, Content Distribution

Step 2: Identify Domain Boundaries

Group related nouns and verbs into coherent domains. Look for:

Entities that change together: If Product and Inventory always update simultaneously, they might belong together
Consistent language: Where terminology shifts, boundaries likely exist
Team ownership: Who would own and develop each area?

Draw tentative boundaries around clusters. These become candidate components.

Step 3: Apply the Single Responsibility Test

For each candidate component, articulate its purpose in one sentence. If you can't, the component is probably too broad.

✅ Good: 'The Notification Service handles delivering messages to users across email, SMS, and push channels'

❌ Poor: 'The Core Service handles user management, order processing, and notification delivery'

Components with multiple responsibilities should be split.

Step 4: Analyze Data Ownership

For each candidate component, define:

What data does it own? (Authoritative source of truth)
What data does it need from others? (Dependencies on other components)
How will it share data? (Synchronous calls, events, data replication)

Ideally, each piece of data has exactly one owner. If multiple components need write access to the same data, reconsider boundaries.

Step 5: Identify Synchronous vs. Asynchronous Interactions

Determine how components will communicate:

Synchronous (request-response): When immediate response is required for the operation
Asynchronous (events/messages): When temporal decoupling is acceptable or preferred

Heavy synchronous coupling suggests components might belong together. Natural async boundaries suggest good separation points.

Step 6: Evaluate Against Non-Functional Requirements

Test your decomposition against system requirements:

Scalability: Can each component scale independently? If video transcoding needs 100x more compute than user management, separation helps.
Availability: Can failure in one component be isolated? If payment fails, should catalog browsing also fail?
Latency: Does decomposition add unacceptable network hops? Critical paths might need co-location.
Security: Do security boundaries align with component boundaries? Sensitive components (payment, auth) might need isolation.

Step 7: Iterate and Refine

Component identification is rarely correct on the first pass. As you diagram the system and trace data flows, you'll discover:

Missing components needed for coordination
Over-specified components that should merge
Unclear boundaries that need sharper definition

Iterate until the design feels cohesive and each component has a clear, singular purpose.

Start Coarse, Refine Later

Common Component Patterns

Certain components appear repeatedly across different system designs. Recognizing these patterns accelerates identification and leverages proven solutions.

Pattern: API Gateway

A single entry point for client requests that handles:

Request routing to appropriate backend services
Authentication and authorization
Rate limiting and throttling
Request/response transformation
Protocol translation (REST to gRPC)

Appears in: Nearly every client-facing system at scale

Pattern: Authentication/Identity Service

Dedicated component for identity management:

User registration and credential storage
Session management and token issuance
OAuth integration and social login
Password reset and account recovery
Multi-factor authentication

Appears in: Any system with user accounts

Pattern: Notification Hub

Centralized notification delivery across channels:

Channel abstraction (email, SMS, push, in-app)
Template management and personalization
Delivery tracking and retry logic
User preference management
Rate limiting and batching

Appears in: E-commerce, social platforms, SaaS applications

Pattern: Media Processing Pipeline

Asynchronous processing for binary content:

Upload handling and validation
Format conversion and optimization
Metadata extraction
Storage orchestration
CDN integration

Appears in: Video platforms, image-heavy apps, document management

Pattern: Search Service

Dedicated search infrastructure:

Index management and updates
Query parsing and optimization
Ranking and relevance scoring
Faceting and filtering
Typeahead and suggestions

Appears in: E-commerce, content platforms, enterprise search

Pattern: Workflow Orchestrator

Coordinator for multi-step processes:

State machine management
Step execution and transitions
Compensation and rollback
Timeout handling
Progress tracking

Appears in: Order fulfillment, approval workflows, data pipelines

Pattern: Cache Layer

Performance optimization through caching:

Frequently accessed data caching
Session storage
Computation result memoization
Rate limit counters
Distributed locking

Appears in: High-traffic read-heavy systems

Pattern: Event Bus / Message Broker

Asynchronous communication backbone:

Event publishing and subscription
Message queuing and delivery
Topic-based routing
Dead letter handling
Event replay capability

Appears in: Event-driven architectures, decoupled microservices

Reusable Component Checklist

•Gateway/Ingress: Do you need a single entry point for routing, auth, and rate limiting?
•Identity/Auth: Does your system have users requiring authentication?
•Notification: Do you need to deliver messages across multiple channels?
•Media Processing: Does your system handle images, videos, or documents?
•Search: Do users need to search content, products, or entities?
•Workflow: Are there multi-step processes requiring coordination?
•Cache: Do you have hot data or expensive computations?
•Message Bus: Do components need asynchronous, decoupled communication?
•Scheduler: Do you need to run periodic or delayed tasks?
•Analytics: Do you need to collect, aggregate, and report on events?

Case Study: Identifying Components for a Food Delivery System

Let's apply component identification to a realistic system: a food delivery platform like DoorDash or Uber Eats.

Functional Requirements Summary:

Customers can browse restaurants, view menus, and place orders
Restaurants manage menus, accept/reject orders, and confirm readiness
Drivers accept deliveries, navigate routes, and confirm delivery
Real-time order tracking for customers
Payment processing with driver payouts
Ratings and reviews for restaurants and drivers

Step 1: Extract Nouns and Verbs

Key Nouns: Customers, Restaurants, Drivers, Orders, Menus, Items, Payments, Reviews, Locations

Key Verbs: Browse, Search, Order, Accept, Reject, Dispatch, Track, Pay, Rate, Navigate

Step 2: Identify Domain Boundaries

Grouping by coherent domains:

Customer Domain:

Customer registration and profiles
Address management
Order history and preferences

Restaurant Domain:

Restaurant profiles and operating hours
Menu management (items, pricing, availability)
Order acceptance and preparation tracking

Driver Domain:

Driver registration and verification
Availability and location tracking
Delivery history and performance

Order Domain:

Order creation and lifecycle
Cart management
Order status tracking

Dispatch Domain:

Driver-order matching
Route optimization
Real-time tracking

Payment Domain:

Customer charges
Restaurant settlements
Driver payouts
Refund processing

Review Domain:

Customer reviews for restaurants and drivers
Restaurant responses
Aggregate ratings

Step 3: Single Responsibility Test

Step 4: Identify Additional Components

Analyzing the design reveals needs for:

API Gateway: Single entry point for mobile apps and web clients
Search Service: Restaurant and menu search with filters
Media Service: Restaurant photos, menu images
Analytics Service: Business intelligence, operational metrics
Pricing Service: Dynamic pricing, promotions, fees calculation

Step 5: Final Component List

Customer Service - User accounts, preferences, addresses
Restaurant Service - Restaurant profiles, menus, availability
Driver Service - Driver profiles, vehicles, documents
Order Service - Order lifecycle management
Dispatch Service - Driver assignment and optimization
Location Service - Real-time position tracking
Payment Service - Transactions, settlements, payouts
Review Service - Ratings and reviews
Search Service - Restaurant and menu search
Notification Service - Push, SMS, email delivery
Pricing Service - Fees, promotions, dynamic pricing
API Gateway - Routing, auth, rate limiting

Component Boundaries Evolve

Documenting Components

Component identification doesn't end with a list. Each component should be documented with enough detail for the design to be implemented and evaluated.

Component Specification Template

For each component, define:

1. Purpose Statement One sentence describing what this component does and why it exists.

2. Responsibilities

What this component is responsible for (scope-in)
What this component is NOT responsible for (scope-out)

3. Data Ownership

What data entities does this component own?
What is the expected data volume and growth?

4. Key Operations

Primary operations/APIs this component exposes
Expected latency and throughput requirements

5. Dependencies

Other components this component calls synchronously
Events this component subscribes to

6. Outputs

Events this component publishes
Data this component exposes to others

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
# Order Service Component Specification
 
purpose: |
  Manages the complete lifecycle of customer orders from
  creation through fulfillment and completion.
 
responsibilities:
  in_scope:
    - Order creation and cart management
    - Order status transitions and validation
    - Order history storage and retrieval
    - Coordinating order events to other services
  out_of_scope:
    - Payment processing (Payment Service)
    - Driver assignment (Dispatch Service)
    - Customer data management (Customer Service)
 
data_ownership:
  entities:
    - Orders (primary entity)
    - OrderItems (line items within orders)
    - OrderStatusHistory (audit trail)
  volumes:
    - Expected: 100K orders/day initially
    - Growth: 10x over 2 years
 
key_operations:
  - CreateOrder: Creates new order from cart
    latency: < 200ms p99
    throughput: 500 req/sec peak
  - GetOrder: Retrieves order details
    latency: < 50ms p99
    throughput: 2000 req/sec peak
  - UpdateOrderStatus: Transitions order state
    latency: < 100ms p99
    throughput: 1000 req/sec peak
 
dependencies:
  synchronous:
    - CustomerService: Validate customer, get address
    - RestaurantService: Validate menu items, prices
    - PricingService: Calculate fees, apply promotions
  asynchronous:
    - PaymentCompleted event from PaymentService
    - DeliveryCompleted event from DispatchService
 
outputs:
  events_published:
    - OrderCreated
    - OrderConfirmed
    - OrderReadyForPickup
    - OrderDelivered
    - OrderCancelled

Documentation Depth

Common Pitfalls in Component Identification

Even experienced architects make mistakes during component identification. Recognizing these patterns helps avoid them.

Pitfall 1: Entity-Based Decomposition

Mistake: Creating one service per database table (UserService, OrderService, AddressService, CartService, CartItemService...)

Problem: This fragments related functionality. A single operation requires orchestrating many services. Operational complexity explodes.

Solution: Group entities by domain boundary, not by table. Cart and CartItem belong together in Order context.

Pitfall 2: Technology-Based Decomposition

Mistake: Organizing by technology (Frontend, Backend, Database, Cache, Queue)

Problem: This doesn't reflect business domains. Changes to a single feature touch every 'component'. No independent evolution.

Solution: Each component should own its full stack from API to data storage. Technology is implementation, not architecture.

Pitfall 3: Premature Extraction

Mistake: Creating a separate service for every potential reuse opportunity (StringUtilsService, ValidationService, LoggingService)

Problem: Overhead of network calls, deployment, and monitoring for trivial functionality. Libraries work better for utilities.

Solution: Extract services for capabilities that require independence (different scaling, different data, different teams), not for code reuse.

Pitfall 4: Ignoring Data

Mistake: Defining components without considering data ownership and access patterns

Problem: Multiple services end up reading/writing the same database, creating hidden coupling and consistency issues.

Solution: Data ownership must be a primary consideration. If two components need the same data, reconsider boundaries.

Red Flags in Component Design

•Service names ending in 'Manager', 'Handler', 'Processor' without specific domain context
•Components that are always deployed together but defined separately
•Components that can't be explained in one sentence without using 'and'
•Services that require database joins across ownership boundaries
•Components named after technical functions rather than business capabilities
•More than 10-12 synchronous hops in any request path

Summary: Component Identification

Component identification transforms requirements into architectural building blocks. Let's consolidate the key principles:

Key Takeaways

•Components are logical units — Autonomous, single-responsibility building blocks with clear boundaries and interfaces.
•Decomposition requires balance — Too few components create monoliths; too many create distributed complexity. Find appropriate granularity.
•Multiple strategies apply — Domain-driven, capability-based, use case-based, and data-oriented decomposition each have strengths.
•The process is iterative — Extract nouns/verbs, identify boundaries, test responsibilities, analyze data, and refine continuously.
•Patterns accelerate identification — Common components (Gateway, Auth, Notification, Search) appear across most systems.
•Documentation clarifies intent — Specify purpose, responsibilities, data, operations, dependencies, and outputs for each component.
•Avoid common pitfalls — Entity-based, technology-based, and premature extraction lead to poor designs.

What's next:

Page Complete

1 / 5