System Design (LLD)The Cost of Wrong Abstractions

The Cost of Wrong Abstractions

LevelIntermediate

Duration90 mins

TopicThe Cost of Wrong Abstractions

2 / 4

Under-Abstraction — The Hidden Tax of Concrete Code

The Opposite Extreme

If over-abstraction is the disease of the intermediately skilled engineer, under-abstraction is frequently the disease of the pragmatic hacker—and sometimes of teams that overcorrected from previous over-abstraction. Under-abstraction occurs when code lacks the generalizations needed to prevent duplication, inconsistency, and rigidity.

The symptoms are familiar: copy-pasted logic scattered across files, subtle bugs where one copy was fixed but others weren't, and a growing sense that every small change requires hunting through the entire codebase.

What You Will Learn

By the end of this page, you will understand what under-abstraction is, why engineers avoid necessary abstraction, the concrete costs it imposes on software projects, and practical heuristics to identify when abstraction is genuinely needed.

Defining Under-Abstraction

Under-abstraction occurs when the absence of abstraction creates costs that would be eliminated by appropriate generalization. Unlike over-abstraction, which adds unnecessary complexity, under-abstraction fails to capture commonalities that exist in the problem domain.

Under-abstraction manifests in characteristic patterns:

Forms of Under-Abstraction

•Duplicated Logic — The same algorithm or business rule appears in multiple places, each slightly different, each a potential source of divergent bugs.
•Primitive Obsession — Complex domain concepts represented as strings, numbers, or raw data structures instead of meaningful types.
•Missing Domain Concepts — Business logic scattered across procedures without cohesive representation of the entities involved.
•Hardcoded Variations — Conditional branches that could be polymorphism, switch statements that could be strategy patterns, but aren't.
•Incomplete Encapsulation — Data structures with their manipulation logic separated and spread across consumers.
•Inline Everything — Functions that do too much because extracting common patterns was "too much work."

The Maintenance Trap

Under-abstracted code often seems simpler initially—no layers to traverse, no patterns to learn. But this simplicity is illusory. The real complexity is distributed across duplications, each of which must be maintained separately. What looks simple is actually just scattered.

Why Under-Abstraction Happens

Understanding why under-abstraction occurs helps prevent it. Several forces push engineers away from necessary abstraction:

Root Causes of Under-Abstraction

•Time Pressure — "I don't have time to refactor, I'll just copy this and modify." Short-term speed creates long-term debt.
•Fear of Premature Abstraction — Having been burned by over-abstraction (or read articles about it), engineers overcorrect and avoid abstraction entirely.
•Lack of Domain Understanding — Without clear understanding of the problem space, engineers can't see the abstractions that should exist.
•"It's Only Two Places" — Duplication seems acceptable at first. By the time it's five places, the abstraction seems too hard to retrofit.
•Local Optimization — Each developer solves their immediate problem without seeing the broader pattern across the codebase.
•Unfamiliarity with Abstraction Techniques — Some engineers lack experience with composition, strategy patterns, or other abstraction mechanisms.
•Copy-Paste Culture — Teams that normalize copying code perpetuate under-abstraction through social norms.

The "just ship it" trap:

In fast-moving environments, there's constant pressure to deliver quickly. Abstraction feels like gold-plating—unnecessary polish that slows delivery. But this perspective confuses premature abstraction with appropriate abstraction.

Premature abstraction is harmful because it guesses at patterns that may not exist. Appropriate abstraction extracts patterns that already exist and are already causing problems. The former is speculation; the latter is debt repayment.

Teams that never abstract because "we'll clean it up later" often never do. The debt accumulates until the codebase becomes a maintenance nightmare, and eventually a rewrite seems easier than rehabilitation.

The Refactoring Window

The best time to abstract is immediately after you identify the pattern. The second duplication is the signal; don't wait for the third, fourth, or fifth. When you copy code and think "this is similar to...", that's the moment to stop and extract the abstraction.

The Concrete Costs of Under-Abstraction

Under-abstraction imposes severe costs on software projects, often worse than the abstractions it avoids would have cost:

The Multi-Dimensional Costs of Under-Abstraction
Cost Category	Impact	Example
Duplicated Bug Fixes	Every bug must be found and fixed in multiple locations	Security vulnerability patched in 2 of 5 copies; 3 remain exploitable
Inconsistent Behavior	Same concept behaves differently in different contexts	Order total calculation differs between cart and checkout
Change Amplification	Single logical change requires multiple code changes	Updating tax calculation requires changes in 12 files
Knowledge Loss	Logic purpose unclear when scattered and duplicated	"Why do we add 0.5 before rounding? No one remembers"
Test Burden	Same logic tested repeatedly, often inconsistently	Cart total has 50 tests; checkout total has 3
Onboarding Confusion	New developers can't find 'the' implementation	"Which of these 4 validation functions should I use?"
Type Safety Erosion	Primitive types can't prevent semantic errors	Mixing up orderId and customerId (both strings)

The shotgun surgery problem:

Under-abstraction's signature pain is shotgun surgery: a single logical change requires modifications in many different places. If you need to update how prices are formatted, and price formatting logic is copy-pasted in 15 places, you're performing shotgun surgery on every such change.

The risk isn't just the effort—it's the probability of missing one location. Every incomplete shotgun surgery creates an inconsistency. Over time, the codebase diverges from itself, with subtly different behaviors depending on which code path is executed.

The Consistency Tax

Under-abstracted codebases require constant vigilance to maintain consistency. Every change must be cross-referenced against other locations. This vigilance is an invisible tax on all development—and taxes are often unpaid, leading to bugs.

Case Study: Duplicated Validation Logic

Let's examine a concrete example of under-abstraction and its consequences. Consider a system that validates email addresses in multiple places:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
// user-registration.ts
function registerUser(email: string, password: string) {
    // Inline email validation
    if (!email || !email.includes('@') || email.length < 5) {
        throw new Error('Invalid email');
    }
    // ... registration logic
}
 
// newsletter-signup.ts  
function subscribeToNewsletter(email: string) {
    // Similar but slightly different validation
    if (!email.includes('@') || !email.includes('.')) {
        throw new Error('Please enter a valid email');
    }
    // ... subscription logic
}
 
// contact-form.ts
function submitContactForm(email: string, message: string) {
    // Yet another variation
    const emailRegex = /^[^\s@]+@[^\s@]+\.[^\s@]+$/;
    if (!emailRegex.test(email)) {
        throw new Error('Email format is invalid');
    }
    // ... form submission logic
}
 
// password-reset.ts
function requestPasswordReset(email: string) {
    // Copy-pasted with modifications
    if (email.indexOf('@') === -1) {
        throw new Error('Not a valid email address');
    }
    // ... reset logic
}
 
// Problems:
// 1. Four different validation rules for the same concept
// 2. Different error messages cause inconsistent UX
// 3. Bug in one (missing null check) doesn't affect others
// 4. Updating validation requires finding and changing all copies

Now let's see the properly abstracted version:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
// email.ts - Single source of truth for email concept
 
/**
 * Email value object - represents a validated email address.
 * If an Email instance exists, it's guaranteed to be valid.
 */
class Email {
    private static readonly EMAIL_REGEX = /^[^\s@]+@[^\s@]+\.[^\s@]+$/;
    
    private constructor(private readonly value: string) {}
    
    /**
     * Factory method that validates and creates Email.
     * Throws if email is invalid, ensuring all Email instances are valid.
     */
    static create(input: string): Email {
        if (!input || typeof input !== 'string') {
            throw new InvalidEmailError('Email is required');
        }
        
        const trimmed = input.trim().toLowerCase();
        
        if (!Email.EMAIL_REGEX.test(trimmed)) {
            throw new InvalidEmailError(`'${input}' is not a valid email address`);
        }
        
        return new Email(trimmed);
    }
    
    static isValid(input: string): boolean {
        try {
            Email.create(input);
            return true;
        } catch {
            return false;
        }
    }
    
    toString(): string {
        return this.value;
    }
    
    equals(other: Email): boolean {
        return this.value === other.value;
    }
    
    getDomain(): string {
        return this.value.split('@')[1];
    }
}
 
class InvalidEmailError extends Error {
    constructor(message: string) {
        super(message);
        this.name = 'InvalidEmailError';
    }
}
 
// Now all usages are consistent:
 
// user-registration.ts
function registerUser(email: Email, password: string) {
    // Email is already validated by type system
    // ... registration logic
}
 
// newsletter-signup.ts  
function subscribeToNewsletter(email: Email) {
    // Guaranteed valid
    // ... subscription logic
}
 
// Usage at API boundary:
const email = Email.create(request.body.email); // Validates once
registerUser(email, password);
 
// Benefits:
// 1. Single validation rule, enforced everywhere
// 2. Consistent error handling
// 3. Type safety prevents mixing up strings
// 4. Rich domain model (getDomain, equals, etc.)
// 5. Changes in one place affect entire system

Value Objects

The Email class is a value object—it represents a concept from the domain with specific semantics. Using value objects instead of primitives eliminates entire categories of bugs and creates self-documenting code. If a function takes an Email parameter, you know it requires a valid email without checking the implementation.

Recognizing Under-Abstraction in Your Codebase

Identifying under-abstraction requires examining your codebase for patterns of duplication and scattered logic. Here are concrete signals:

Warning Signs of Under-Abstraction

•Grep Duplication — If you can grep for a code pattern and find it in multiple files with minor variations, you have duplication that might benefit from abstraction.
•Shotgun Surgery — If recent changes required modifications in many files for a single logical change, the changed code likely represents an under-abstracted concept.
•Primitive Parameters — Functions that take (string, string, number, boolean) instead of meaningful domain types suffer from primitive obsession.
•Long Parameter Lists — When functions need many parameters, it often indicates missing objects that group related data.
•Similar Bug Fixes — If the same bug class recurs in different parts of the codebase, the shared logic should likely be consolidated.
•Inconsistent Behavior Reports — When QA reports "it works here but not there" for what should be the same feature, duplication has diverged.
•Comment-Heavy Procedures — Excessive comments explaining what code does often signal missing abstractions that would make the code self-documenting.

Signs of Under-Abstraction

•Copy-paste detected by linters
•Same calculation in multiple places
•Business rules scattered across files
•Strings used for everything
•Long switch statements on types
•Inconsistent formatting/validation

Signs of Appropriate Abstraction

•Single source of truth for logic
•Calculations defined once, used many times
•Business rules in domain objects
•Domain types (Money, Email, UserId)
•Polymorphism replaces conditionals
•Consistent behavior guaranteed by type

The Similarity Test

Review recent pull requests. If any PR fixed a bug by changing similar code in multiple files, you've identified under-abstraction. Track these occurrences—they're a roadmap for where abstractions are needed.

Strategies to Address Under-Abstraction

Addressing under-abstraction requires both immediate refactoring and cultural changes to prevent recurrence:

Refactoring Techniques for Under-Abstraction

•Extract Method/Function — When the same logic appears inline in multiple places, extract it to a named function. This is the simplest form of abstraction.
•Extract Class — When multiple functions operate on the same data, group them into a class with that data. This creates a home for related logic.
•Replace Primitive with Object — When a primitive (string, number) represents a domain concept, create a value object. Money, Email, PhoneNumber, OrderId are better than raw types.
•Introduce Parameter Object — When multiple parameters travel together, group them into a single object. This simplifies signatures and reveals domain concepts.
•Replace Conditional with Polymorphism — When switch statements or if-else chains select behavior based on type, use polymorphism instead.
•Pull Up Method — When subclasses share similar methods, move the common logic to the parent class.
•Form Template Method — When algorithms share structure but vary in steps, extract the skeleton and make steps overridable.

The Boy Scout Rule:

"Always leave the code better than you found it." When you encounter duplication while working on a feature, take time to extract the abstraction. Don't create a separate "refactoring sprint"—integrate refactoring into daily work.

This incremental approach has several advantages:

Each refactoring is small and low-risk
You refactor code you're already understanding
The codebase gradually improves
Refactoring becomes habitual, not exceptional

The Second Occurrence Rule

When you see the second occurrence of similar code, extract the abstraction immediately. Don't wait for a third. The marginal cost of extracting on the second occurrence is minimal; the marginal cost of tracking down a fifth occurrence later is substantial.

Primitive Obsession: Under-Abstraction's Most Common Form

Primitive Obsession deserves special attention because it's the most pervasive form of under-abstraction. It occurs when developers use built-in types (strings, numbers, booleans) to represent domain concepts that deserve their own types.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
// Primitive Obsession - Using strings for everything
 
function processOrder(
    orderId: string,         // Could be any string
    customerId: string,      // Easy to mix up with orderId
    productCode: string,     // No validation
    quantity: number,        // Could be negative
    priceInCents: number     // Unit unclear, could be dollars
): void {
    // What happens if we pass customerId where orderId is expected?
    // The compiler won't catch it - they're both strings
}
 
// Realistic bug: arguments swapped
processOrder(
    customer.id,    // Oops, should be orderId
    order.id,       // Oops, should be customerId
    "ABC123",
    5,
    1999
);
 
// ---
 
// Proper Domain Types - Self-documenting and type-safe
 
class OrderId {
    private constructor(private readonly value: string) {}
    static create(value: string): OrderId {
        if (!value.match(/^ORD-\d{6}$/)) {
            throw new Error('Invalid order ID format');
        }
        return new OrderId(value);
    }
    toString(): string { return this.value; }
}
 
class CustomerId {
    private constructor(private readonly value: string) {}
    static create(value: string): CustomerId {
        if (!value.match(/^CUS-\d{8}$/)) {
            throw new Error('Invalid customer ID format');
        }
        return new CustomerId(value);
    }
    toString(): string { return this.value; }
}
 
class Money {
    private constructor(
        private readonly cents: number,
        private readonly currency: string
    ) {}
    
    static usd(dollars: number): Money {
        return new Money(Math.round(dollars * 100), 'USD');
    }
    
    static cents(cents: number): Money {
        if (!Number.isInteger(cents)) {
            throw new Error('Cents must be integer');
        }
        return new Money(cents, 'USD');
    }
    
    add(other: Money): Money {
        if (this.currency !== other.currency) {
            throw new Error('Cannot add different currencies');
        }
        return new Money(this.cents + other.cents, this.currency);
    }
    
    toDisplayString(): string {
        return `$${(this.cents / 100).toFixed(2)}`;
    }
}
 
class Quantity {
    private constructor(private readonly value: number) {}
    static create(value: number): Quantity {
        if (!Number.isInteger(value) || value < 1) {
            throw new Error('Quantity must be positive integer');
        }
        return new Quantity(value);
    }
    getValue(): number { return this.value; }
}
 
// Now the function signature prevents bugs
function processOrder(
    orderId: OrderId,      // Can only be OrderId
    customerId: CustomerId, // Can only be CustomerId
    product: ProductCode,
    quantity: Quantity,
    price: Money
): void {
    // Impossible to swap orderId and customerId
    // quantity cannot be negative
    // price has clear semantics
}
 
// Compiler catches the mistake!
processOrder(
    customerId,  // Error: CustomerId not assignable to OrderId
    orderId,     // Error: OrderId not assignable to CustomerId
    productCode,
    quantity,
    price
);

When to Create Domain Types

Create a domain type when: (1) the concept has validation rules, (2) it appears in multiple places, (3) it has associated operations, or (4) mixing it with similar primitives would be a bug. Most domain concepts meet at least one of these criteria.

Summary: Escaping the Under-Abstraction Trap

Under-abstraction may seem like simplicity, but it's actually complexity scattered across the codebase. Let's consolidate the essential insights:

Key Takeaways

•Under-abstraction creates scattered complexity — Logic duplicated across files isn't simple; it's complexity hiding in plain sight.
•Duplication breeds inconsistency — Every copy is a chance for divergent bug fixes, subtle behavioral differences, and maintenance burden.
•Shotgun surgery is the signature symptom — If single logical changes require multi-file modifications, abstraction is needed.
•Primitive obsession is the most common form — Using strings for OrderId, CustomerId, Email, etc. is under-abstraction that types can solve.
•Extract on the second occurrence — Don't wait for extensive duplication. When you copy code, immediately consider extracting.
•Value objects capture domain concepts — Classes like Money, Email, and UserId make code self-documenting and prevent entire bug categories.
•The Boy Scout Rule prevents accumulation — Refactor incrementally as part of normal work, not in separate sprints.

What's Next:

We've explored over-abstraction and under-abstraction—the two extremes of the abstraction spectrum. Both cause harm, but a particularly dangerous form occurs when abstraction is created at the wrong time: premature abstraction. The next page examines this timing problem and develops heuristics for knowing when to abstract.

Page Complete

You now understand under-abstraction: its definition, causes, costs, recognition signals, and remediation strategies. The key insight is that avoiding abstraction isn't simplicity—it's deferred complexity that accumulates interest over time.

2 / 4

Loading learning content...

System Design (LLD)The Cost of Wrong Abstractions

The Cost of Wrong Abstractions

LevelIntermediate

Duration90 mins

TopicThe Cost of Wrong Abstractions

2 / 4

Under-Abstraction — The Hidden Tax of Concrete Code

The Opposite Extreme

What You Will Learn

Defining Under-Abstraction

Under-abstraction manifests in characteristic patterns:

Forms of Under-Abstraction

•Duplicated Logic — The same algorithm or business rule appears in multiple places, each slightly different, each a potential source of divergent bugs.
•Primitive Obsession — Complex domain concepts represented as strings, numbers, or raw data structures instead of meaningful types.
•Missing Domain Concepts — Business logic scattered across procedures without cohesive representation of the entities involved.
•Hardcoded Variations — Conditional branches that could be polymorphism, switch statements that could be strategy patterns, but aren't.
•Incomplete Encapsulation — Data structures with their manipulation logic separated and spread across consumers.
•Inline Everything — Functions that do too much because extracting common patterns was "too much work."

The Maintenance Trap

Why Under-Abstraction Happens

Understanding why under-abstraction occurs helps prevent it. Several forces push engineers away from necessary abstraction:

Root Causes of Under-Abstraction

•Time Pressure — "I don't have time to refactor, I'll just copy this and modify." Short-term speed creates long-term debt.
•Fear of Premature Abstraction — Having been burned by over-abstraction (or read articles about it), engineers overcorrect and avoid abstraction entirely.
•Lack of Domain Understanding — Without clear understanding of the problem space, engineers can't see the abstractions that should exist.
•"It's Only Two Places" — Duplication seems acceptable at first. By the time it's five places, the abstraction seems too hard to retrofit.
•Local Optimization — Each developer solves their immediate problem without seeing the broader pattern across the codebase.
•Unfamiliarity with Abstraction Techniques — Some engineers lack experience with composition, strategy patterns, or other abstraction mechanisms.
•Copy-Paste Culture — Teams that normalize copying code perpetuate under-abstraction through social norms.

The "just ship it" trap:

The Refactoring Window

The Concrete Costs of Under-Abstraction

Under-abstraction imposes severe costs on software projects, often worse than the abstractions it avoids would have cost:

The Multi-Dimensional Costs of Under-Abstraction
Cost Category	Impact	Example
Duplicated Bug Fixes	Every bug must be found and fixed in multiple locations	Security vulnerability patched in 2 of 5 copies; 3 remain exploitable
Inconsistent Behavior	Same concept behaves differently in different contexts	Order total calculation differs between cart and checkout
Change Amplification	Single logical change requires multiple code changes	Updating tax calculation requires changes in 12 files
Knowledge Loss	Logic purpose unclear when scattered and duplicated	"Why do we add 0.5 before rounding? No one remembers"
Test Burden	Same logic tested repeatedly, often inconsistently	Cart total has 50 tests; checkout total has 3
Onboarding Confusion	New developers can't find 'the' implementation	"Which of these 4 validation functions should I use?"
Type Safety Erosion	Primitive types can't prevent semantic errors	Mixing up orderId and customerId (both strings)

The shotgun surgery problem:

The Consistency Tax

Case Study: Duplicated Validation Logic

Let's examine a concrete example of under-abstraction and its consequences. Consider a system that validates email addresses in multiple places:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
// user-registration.ts
function registerUser(email: string, password: string) {
    // Inline email validation
    if (!email || !email.includes('@') || email.length < 5) {
        throw new Error('Invalid email');
    }
    // ... registration logic
}
 
// newsletter-signup.ts  
function subscribeToNewsletter(email: string) {
    // Similar but slightly different validation
    if (!email.includes('@') || !email.includes('.')) {
        throw new Error('Please enter a valid email');
    }
    // ... subscription logic
}
 
// contact-form.ts
function submitContactForm(email: string, message: string) {
    // Yet another variation
    const emailRegex = /^[^\s@]+@[^\s@]+\.[^\s@]+$/;
    if (!emailRegex.test(email)) {
        throw new Error('Email format is invalid');
    }
    // ... form submission logic
}
 
// password-reset.ts
function requestPasswordReset(email: string) {
    // Copy-pasted with modifications
    if (email.indexOf('@') === -1) {
        throw new Error('Not a valid email address');
    }
    // ... reset logic
}
 
// Problems:
// 1. Four different validation rules for the same concept
// 2. Different error messages cause inconsistent UX
// 3. Bug in one (missing null check) doesn't affect others
// 4. Updating validation requires finding and changing all copies

Now let's see the properly abstracted version:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
// email.ts - Single source of truth for email concept
 
/**
 * Email value object - represents a validated email address.
 * If an Email instance exists, it's guaranteed to be valid.
 */
class Email {
    private static readonly EMAIL_REGEX = /^[^\s@]+@[^\s@]+\.[^\s@]+$/;
    
    private constructor(private readonly value: string) {}
    
    /**
     * Factory method that validates and creates Email.
     * Throws if email is invalid, ensuring all Email instances are valid.
     */
    static create(input: string): Email {
        if (!input || typeof input !== 'string') {
            throw new InvalidEmailError('Email is required');
        }
        
        const trimmed = input.trim().toLowerCase();
        
        if (!Email.EMAIL_REGEX.test(trimmed)) {
            throw new InvalidEmailError(`'${input}' is not a valid email address`);
        }
        
        return new Email(trimmed);
    }
    
    static isValid(input: string): boolean {
        try {
            Email.create(input);
            return true;
        } catch {
            return false;
        }
    }
    
    toString(): string {
        return this.value;
    }
    
    equals(other: Email): boolean {
        return this.value === other.value;
    }
    
    getDomain(): string {
        return this.value.split('@')[1];
    }
}
 
class InvalidEmailError extends Error {
    constructor(message: string) {
        super(message);
        this.name = 'InvalidEmailError';
    }
}
 
// Now all usages are consistent:
 
// user-registration.ts
function registerUser(email: Email, password: string) {
    // Email is already validated by type system
    // ... registration logic
}
 
// newsletter-signup.ts  
function subscribeToNewsletter(email: Email) {
    // Guaranteed valid
    // ... subscription logic
}
 
// Usage at API boundary:
const email = Email.create(request.body.email); // Validates once
registerUser(email, password);
 
// Benefits:
// 1. Single validation rule, enforced everywhere
// 2. Consistent error handling
// 3. Type safety prevents mixing up strings
// 4. Rich domain model (getDomain, equals, etc.)
// 5. Changes in one place affect entire system

Value Objects

Recognizing Under-Abstraction in Your Codebase

Identifying under-abstraction requires examining your codebase for patterns of duplication and scattered logic. Here are concrete signals:

Warning Signs of Under-Abstraction

•Grep Duplication — If you can grep for a code pattern and find it in multiple files with minor variations, you have duplication that might benefit from abstraction.
•Shotgun Surgery — If recent changes required modifications in many files for a single logical change, the changed code likely represents an under-abstracted concept.
•Primitive Parameters — Functions that take (string, string, number, boolean) instead of meaningful domain types suffer from primitive obsession.
•Long Parameter Lists — When functions need many parameters, it often indicates missing objects that group related data.
•Similar Bug Fixes — If the same bug class recurs in different parts of the codebase, the shared logic should likely be consolidated.
•Inconsistent Behavior Reports — When QA reports "it works here but not there" for what should be the same feature, duplication has diverged.
•Comment-Heavy Procedures — Excessive comments explaining what code does often signal missing abstractions that would make the code self-documenting.

Signs of Under-Abstraction

•Copy-paste detected by linters
•Same calculation in multiple places
•Business rules scattered across files
•Strings used for everything
•Long switch statements on types
•Inconsistent formatting/validation

Signs of Appropriate Abstraction

•Single source of truth for logic
•Calculations defined once, used many times
•Business rules in domain objects
•Domain types (Money, Email, UserId)
•Polymorphism replaces conditionals
•Consistent behavior guaranteed by type

The Similarity Test

Strategies to Address Under-Abstraction

Addressing under-abstraction requires both immediate refactoring and cultural changes to prevent recurrence:

Refactoring Techniques for Under-Abstraction

•Extract Method/Function — When the same logic appears inline in multiple places, extract it to a named function. This is the simplest form of abstraction.
•Extract Class — When multiple functions operate on the same data, group them into a class with that data. This creates a home for related logic.
•Replace Primitive with Object — When a primitive (string, number) represents a domain concept, create a value object. Money, Email, PhoneNumber, OrderId are better than raw types.
•Introduce Parameter Object — When multiple parameters travel together, group them into a single object. This simplifies signatures and reveals domain concepts.
•Replace Conditional with Polymorphism — When switch statements or if-else chains select behavior based on type, use polymorphism instead.
•Pull Up Method — When subclasses share similar methods, move the common logic to the parent class.
•Form Template Method — When algorithms share structure but vary in steps, extract the skeleton and make steps overridable.

The Boy Scout Rule:

This incremental approach has several advantages:

Each refactoring is small and low-risk
You refactor code you're already understanding
The codebase gradually improves
Refactoring becomes habitual, not exceptional

The Second Occurrence Rule

Primitive Obsession: Under-Abstraction's Most Common Form

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
// Primitive Obsession - Using strings for everything
 
function processOrder(
    orderId: string,         // Could be any string
    customerId: string,      // Easy to mix up with orderId
    productCode: string,     // No validation
    quantity: number,        // Could be negative
    priceInCents: number     // Unit unclear, could be dollars
): void {
    // What happens if we pass customerId where orderId is expected?
    // The compiler won't catch it - they're both strings
}
 
// Realistic bug: arguments swapped
processOrder(
    customer.id,    // Oops, should be orderId
    order.id,       // Oops, should be customerId
    "ABC123",
    5,
    1999
);
 
// ---
 
// Proper Domain Types - Self-documenting and type-safe
 
class OrderId {
    private constructor(private readonly value: string) {}
    static create(value: string): OrderId {
        if (!value.match(/^ORD-\d{6}$/)) {
            throw new Error('Invalid order ID format');
        }
        return new OrderId(value);
    }
    toString(): string { return this.value; }
}
 
class CustomerId {
    private constructor(private readonly value: string) {}
    static create(value: string): CustomerId {
        if (!value.match(/^CUS-\d{8}$/)) {
            throw new Error('Invalid customer ID format');
        }
        return new CustomerId(value);
    }
    toString(): string { return this.value; }
}
 
class Money {
    private constructor(
        private readonly cents: number,
        private readonly currency: string
    ) {}
    
    static usd(dollars: number): Money {
        return new Money(Math.round(dollars * 100), 'USD');
    }
    
    static cents(cents: number): Money {
        if (!Number.isInteger(cents)) {
            throw new Error('Cents must be integer');
        }
        return new Money(cents, 'USD');
    }
    
    add(other: Money): Money {
        if (this.currency !== other.currency) {
            throw new Error('Cannot add different currencies');
        }
        return new Money(this.cents + other.cents, this.currency);
    }
    
    toDisplayString(): string {
        return `$${(this.cents / 100).toFixed(2)}`;
    }
}
 
class Quantity {
    private constructor(private readonly value: number) {}
    static create(value: number): Quantity {
        if (!Number.isInteger(value) || value < 1) {
            throw new Error('Quantity must be positive integer');
        }
        return new Quantity(value);
    }
    getValue(): number { return this.value; }
}
 
// Now the function signature prevents bugs
function processOrder(
    orderId: OrderId,      // Can only be OrderId
    customerId: CustomerId, // Can only be CustomerId
    product: ProductCode,
    quantity: Quantity,
    price: Money
): void {
    // Impossible to swap orderId and customerId
    // quantity cannot be negative
    // price has clear semantics
}
 
// Compiler catches the mistake!
processOrder(
    customerId,  // Error: CustomerId not assignable to OrderId
    orderId,     // Error: OrderId not assignable to CustomerId
    productCode,
    quantity,
    price
);

When to Create Domain Types

Summary: Escaping the Under-Abstraction Trap

Under-abstraction may seem like simplicity, but it's actually complexity scattered across the codebase. Let's consolidate the essential insights:

Key Takeaways

•Under-abstraction creates scattered complexity — Logic duplicated across files isn't simple; it's complexity hiding in plain sight.
•Duplication breeds inconsistency — Every copy is a chance for divergent bug fixes, subtle behavioral differences, and maintenance burden.
•Shotgun surgery is the signature symptom — If single logical changes require multi-file modifications, abstraction is needed.
•Primitive obsession is the most common form — Using strings for OrderId, CustomerId, Email, etc. is under-abstraction that types can solve.
•Extract on the second occurrence — Don't wait for extensive duplication. When you copy code, immediately consider extracting.
•Value objects capture domain concepts — Classes like Money, Email, and UserId make code self-documenting and prevent entire bug categories.
•The Boy Scout Rule prevents accumulation — Refactor incrementally as part of normal work, not in separate sprints.

What's Next:

Page Complete

2 / 4