System Design (LLD)Refactoring from Inheritance to Composition

Refactoring from Inheritance to Composition

LevelAdvanced

Duration60 mins

TopicRefactoring from Inheritance to Composition

3 / 4

Maintaining Behavior During Refactoring

The Invisible Contract

Every line of production code carries an implicit contract with its users: "I will behave in this specific way." These contracts extend far beyond documented APIs—they include error handling quirks, timing behaviors, edge case responses, and countless other details that users (whether human or other code) have come to depend on.

When refactoring from inheritance to composition, your primary obligation is preserving these contracts. A refactoring that changes behavior—even fixing apparent "bugs"—can cause production incidents. Users who depended on the old behavior now find their assumptions violated.

The Behavioral Preservation Mandate

Refactoring is structure change, not behavior change. If you discover bugs or suboptimal behavior during refactoring, document them but DO NOT fix them as part of the refactoring. Fix them in separate, targeted changes after the refactoring is complete.

What You Will Learn

By the end of this page, you will understand techniques for ensuring behavioral equivalence during refactoring: contract documentation, characterization testing, shadow execution, behavioral diff detection, and strategies for handling discovered edge cases.

Understanding Behavioral Contracts

Before we can preserve behavior, we must understand what behavior actually means in the context of software systems. Behavior is multi-dimensional:

1.1 Explicit Contracts

These are the documented, intentional behaviors:

Method signatures and return types
Documented pre-conditions and post-conditions
Specified error conditions and exceptions
Performance guarantees (SLAs)

1.2 Implicit Contracts

These are undocumented behaviors that users nonetheless depend on:

The order in which side effects occur
Specific exception types thrown for specific failures
Null vs. empty collection returns
Caching and memoization behavior
Timing and ordering of asynchronous operations

1.3 Emergent Contracts

These arise from the interaction of multiple components:

The specific sequence of database calls
Thread safety characteristics
Resource consumption patterns
Interaction with global state

behavioral_contract_examples.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
// Examples of behavioral contracts (explicit and implicit)
 
class OrderProcessor {
    
    // EXPLICIT: Documented to throw on invalid order
    /**
     * Processes the order and returns confirmation number.
     * @throws InvalidOrderException if order validation fails
     */
    public String processOrder(Order order) throws InvalidOrderException {
        // Implementation...
    }
    
    // IMPLICIT: Users depend on specific exception type
    // Even though not documented, changing to a different exception
    // would break callers who catch InvalidOrderException specifically
    
    // IMPLICIT: Users may depend on empty string vs null
    public String getOrderNotes(Order order) {
        if (order.getNotes() == null) {
            return "";  // Returning null would break callers doing .length()
        }
        return order.getNotes();
    }
    
    // EMERGENT: The specific ordering of these calls matters
    // because downstream systems expect this sequence
    public void fulfillOrder(Order order) {
        inventoryService.reserve(order);    // Must happen first
        paymentService.charge(order);       // Must happen second
        shippingService.schedule(order);    // Must happen third
        notificationService.notify(order);  // Must happen last
        // Reordering would violate contracts even if each call succeeds
    }
}

Hyrum's Law

"With a sufficient number of users of an API, it does not matter what you promise in the contract: all observable behaviors of your system will be depended on by somebody." — Hyrum Wright, Google. This is why preserving ALL observable behavior, not just documented behavior, is critical.

Characterization Testing

Characterization tests (also called Golden Master tests or Approval tests) capture the current behavior of a system without asserting that the behavior is correct—only that it is consistent.

2.1 The Philosophy of Characterization Tests

Traditional unit tests assert expected behavior:

assertEquals(expected, actual);

Characterization tests assert consistent behavior:

assertEquals(previouslyRecordedOutput, actual);

The previously recorded output becomes the "golden master"—any deviation triggers a test failure, requiring explicit acknowledgment of the behavior change.

2.2 Building a Characterization Test Suite

To build characterization tests for refactoring:

Identify the surface area — What methods/behaviors are being refactored?
Gather diverse inputs — Include normal cases, edge cases, error cases
Record current outputs — Capture return values, side effects, exceptions
Create tests that compare — New runs must match recorded outputs

characterization_test_example.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
// Characterization test framework for refactoring validation
 
public class NotificationBehaviorCharacterization {
    
    private static final Path GOLDEN_MASTER_DIR = 
        Paths.get("src/test/resources/golden-masters/notifications");
    
    @Test
    void emailNotification_standardCase() throws Exception {
        EmailNotification email = createEmailNotification(
            "user@example.com",
            "Test Subject",
            "This is the body content"
        );
        
        // Capture all observable behaviors
        CharacterizationResult result = new CharacterizationResult();
        result.formattedContent = email.formatContent();
        result.validationResult = captureValidation(email);
        result.sideEffects = captureSideEffects(() -> email.send());
        result.thrownException = captureThrownException(() -> email.send());
        
        // Compare against golden master
        assertMatchesGoldenMaster("email-standard", result);
    }
    
    @Test
    void emailNotification_nullSubject() throws Exception {
        EmailNotification email = createEmailNotification(
            "user@example.com",
            null,  // Edge case: null subject
            "Body content"
        );
        
        CharacterizationResult result = captureAllBehaviors(email);
        assertMatchesGoldenMaster("email-null-subject", result);
    }
    
    @Test
    void emailNotification_emptyBody() throws Exception {
        EmailNotification email = createEmailNotification(
            "user@example.com",
            "Subject",
            ""  // Edge case: empty body
        );
        
        CharacterizationResult result = captureAllBehaviors(email);
        assertMatchesGoldenMaster("email-empty-body", result);
    }
    
    @Test
    void emailNotification_invalidRecipient() throws Exception {
        EmailNotification email = createEmailNotification(
            "not-an-email",  // Edge case: invalid recipient
            "Subject",
            "Body"
        );
        
        CharacterizationResult result = captureAllBehaviors(email);
        // This might capture an exception, which is still valid behavior
        assertMatchesGoldenMaster("email-invalid-recipient", result);
    }
    
    // Helper to capture all observable behaviors
    private CharacterizationResult captureAllBehaviors(EmailNotification email) {
        CharacterizationResult result = new CharacterizationResult();
        
        // Capture formatting behavior
        try {
            result.formattedContent = email.formatContent();
        } catch (Exception e) {
            result.formatException = e.getClass().getName() + ": " + e.getMessage();
        }
        
        // Capture validation behavior
        try {
            result.validationPassed = email.validate();
            result.validationMessages = email.getValidationMessages();
        } catch (Exception e) {
            result.validationException = e.getClass().getName() + ": " + e.getMessage();
        }
        
        // Capture send behavior (mocked)
        try (MockedStatic<SmtpClient> mockedSmtp = mockStatic(SmtpClient.class)) {
            List<Object[]> calls = new ArrayList<>();
            mockedSmtp.when(() -> SmtpClient.send(any(), any(), any()))
                .thenAnswer(inv -> {
                    calls.add(inv.getArguments());
                    return "MSG-ID-123";
                });
            
            email.send();
            result.smtpCalls = calls;
        } catch (Exception e) {
            result.sendException = e.getClass().getName() + ": " + e.getMessage();
        }
        
        return result;
    }
    
    private void assertMatchesGoldenMaster(String testName, CharacterizationResult result) 
            throws Exception {
        Path goldenPath = GOLDEN_MASTER_DIR.resolve(testName + ".json");
        String resultJson = toJson(result);
        
        if (Files.exists(goldenPath)) {
            // Compare with existing golden master
            String goldenJson = Files.readString(goldenPath);
            assertEquals(goldenJson, resultJson, 
                "Behavior changed! If this is intentional, update golden master.");
        } else {
            // First run: create golden master
            Files.writeString(goldenPath, resultJson);
            System.out.println("Created new golden master: " + goldenPath);
        }
    }
    
    @Data
    static class CharacterizationResult {
        String formattedContent;
        String formatException;
        boolean validationPassed;
        List<String> validationMessages;
        String validationException;
        List<Object[]> smtpCalls;
        String sendException;
    }
}

Automated Golden Master Generation

Use production logs and monitoring data to identify real-world input patterns. The most valuable characterization tests use inputs that actually occur in production, not just theoretical edge cases.

Shadow Execution Strategy

Shadow execution (also called dark launching or parallel running) is a production verification technique where both old and new implementations run simultaneously, but only the old implementation's results are used. This catches discrepancies on real production traffic without impacting users.

3.1 Shadow Execution Architecture

shadow_execution_wrapper.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
// Shadow execution wrapper for safe production verification
 
public class ShadowExecutionNotificationService implements NotificationService {
    
    private final NotificationService legacyService;  // Old inheritance-based
    private final NotificationService newService;     // New composition-based
    private final ShadowMetrics metrics;
    private final Logger logger;
    
    // Configuration
    private final double shadowTrafficPercentage;  // e.g., 0.1 = 10%
    private final boolean logDiscrepancies;
    
    public ShadowExecutionNotificationService(
            NotificationService legacyService,
            NotificationService newService,
            ShadowMetrics metrics,
            double shadowTrafficPercentage) {
        this.legacyService = legacyService;
        this.newService = newService;
        this.metrics = metrics;
        this.shadowTrafficPercentage = shadowTrafficPercentage;
        this.logDiscrepancies = true;
        this.logger = LoggerFactory.getLogger(getClass());
    }
    
    @Override
    public SendResult send(Notification notification) {
        // Always execute legacy path (this is production)
        SendResult legacyResult;
        Exception legacyException = null;
        long legacyStart = System.nanoTime();
        
        try {
            legacyResult = legacyService.send(notification);
        } catch (Exception e) {
            legacyException = e;
            legacyResult = null;
        }
        long legacyDuration = System.nanoTime() - legacyStart;
        
        // Conditionally execute shadow path
        if (shouldRunShadow()) {
            runShadowAsync(notification, legacyResult, legacyException, legacyDuration);
        }
        
        // Return legacy result (production behavior unchanged)
        if (legacyException != null) {
            throw new RuntimeException(legacyException);
        }
        return legacyResult;
    }
    
    private boolean shouldRunShadow() {
        return Math.random() < shadowTrafficPercentage;
    }
    
    private void runShadowAsync(
            Notification notification,
            SendResult legacyResult,
            Exception legacyException,
            long legacyDuration) {
        
        CompletableFuture.runAsync(() -> {
            SendResult newResult = null;
            Exception newException = null;
            long newStart = System.nanoTime();
            
            try {
                newResult = newService.send(notification);
            } catch (Exception e) {
                newException = e;
            }
            long newDuration = System.nanoTime() - newStart;
            
            // Compare and record
            compareBehaviors(
                notification,
                legacyResult, legacyException, legacyDuration,
                newResult, newException, newDuration
            );
        });
    }
    
    private void compareBehaviors(
            Notification notification,
            SendResult legacyResult, Exception legacyException, long legacyDuration,
            SendResult newResult, Exception newException, long newDuration) {
        
        boolean outcomeMatches = compareOutcomes(
            legacyResult, legacyException,
            newResult, newException
        );
        
        // Record metrics
        metrics.recordShadowExecution(
            notification.getType(),
            outcomeMatches,
            legacyDuration,
            newDuration
        );
        
        if (!outcomeMatches && logDiscrepancies) {
            logger.warn("Shadow execution discrepancy detected. " +
                "Notification: {}, Legacy: {}/{}, New: {}/{}", 
                notification.getId(),
                legacyResult, legacyException,
                newResult, newException);
            
            // Store for later analysis
            metrics.recordDiscrepancy(new DiscrepancyRecord(
                notification,
                legacyResult, legacyException,
                newResult, newException
            ));
        }
    }
    
    private boolean compareOutcomes(
            SendResult legacy, Exception legacyEx,
            SendResult newR, Exception newEx) {
        
        // Both exceptions
        if (legacyEx != null && newEx != null) {
            return compareExceptions(legacyEx, newEx);
        }
        
        // One exception, one success = mismatch
        if ((legacyEx != null) != (newEx != null)) {
            return false;
        }
        
        // Both success: compare results
        return compareResults(legacy, newR);
    }
    
    private boolean compareExceptions(Exception legacy, Exception newEx) {
        // Same exception type?
        return legacy.getClass().equals(newEx.getClass());
    }
    
    private boolean compareResults(SendResult legacy, SendResult newR) {
        // Compare relevant fields (not things like timestamps)
        return legacy.isSuccess() == newR.isSuccess() &&
               legacy.getRecipient().equals(newR.getRecipient());
    }
}

3.2 Shadow Execution Metrics Dashboard

Monitor shadow execution to build confidence before switching:

Shadow Execution Metrics to Track
Metric	Description	Target Before Cutover
Match Rate	Percentage of requests with identical outcomes	99.9%
Latency Comparison	New vs. legacy execution time	New ≤ Legacy + 10%
Exception Parity	Same exceptions thrown for same inputs	100%
Discrepancy Categories	Classification of mismatches by type	All explained
Shadow Volume	Percentage of traffic shadow-executed	Gradually increase to 100%

Shadow Execution Pitfalls

Be careful with side effects! If your notification service actually sends emails, the shadow path should use a mock or dev endpoint. Shadow execution should never cause duplicate side effects in production.

Behavioral Diff Detection

When shadow execution reveals discrepancies, you need systematic techniques to identify the root cause and determine whether the difference is a bug in the new implementation, a bug in the old implementation, or acceptable variance.

4.1 Diff Analysis Workflow

Collect discrepancies with full context
Categorize by type (result mismatch, exception mismatch, timing difference)
Investigate root cause for each category
Decide disposition (fix new, document old bug, accept variance)
Adjust implementation or tests accordingly

discrepancy_analysis.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
// Automated discrepancy analysis system
 
public class DiscrepancyAnalyzer {
    
    public AnalysisReport analyze(List<DiscrepancyRecord> discrepancies) {
        AnalysisReport report = new AnalysisReport();
        
        // Categorize discrepancies
        Map<DiscrepancyCategory, List<DiscrepancyRecord>> categorized = 
            discrepancies.stream()
                .collect(Collectors.groupingBy(this::categorize));
        
        // Analyze each category
        for (var entry : categorized.entrySet()) {
            CategoryAnalysis analysis = analyzeCategory(entry.getKey(), entry.getValue());
            report.addCategoryAnalysis(analysis);
        }
        
        // Generate recommendations
        report.setRecommendations(generateRecommendations(categorized));
        
        return report;
    }
    
    private DiscrepancyCategory categorize(DiscrepancyRecord record) {
        // Exception vs. success mismatch
        if (record.legacySucceeded() != record.newSucceeded()) {
            return record.legacySucceeded() 
                ? DiscrepancyCategory.NEW_FAILS_WHERE_LEGACY_SUCCEEDS
                : DiscrepancyCategory.NEW_SUCCEEDS_WHERE_LEGACY_FAILS;
        }
        
        // Both threw exceptions but different types
        if (record.legacyException() != null && 
            !record.legacyException().getClass()
                .equals(record.newException().getClass())) {
            return DiscrepancyCategory.EXCEPTION_TYPE_MISMATCH;
        }
        
        // Both succeeded but different results
        if (record.legacyResult() != null && record.newResult() != null) {
            return DiscrepancyCategory.RESULT_VALUE_MISMATCH;
        }
        
        return DiscrepancyCategory.OTHER;
    }
    
    private CategoryAnalysis analyzeCategory(
            DiscrepancyCategory category, 
            List<DiscrepancyRecord> records) {
        
        CategoryAnalysis analysis = new CategoryAnalysis(category);
        analysis.setCount(records.size());
        analysis.setExamples(records.stream().limit(5).toList());
        
        // Look for patterns
        analysis.setInputPatterns(findInputPatterns(records));
        analysis.setTimePatterns(findTimePatterns(records));
        
        // Severity assessment
        analysis.setSeverity(assessSeverity(category, records));
        
        return analysis;
    }
    
    private List<String> findInputPatterns(List<DiscrepancyRecord> records) {
        // Identify common characteristics of inputs that cause discrepancies
        List<String> patterns = new ArrayList<>();
        
        long nullRecipients = records.stream()
            .filter(r -> r.getNotification().getRecipient() == null)
            .count();
        if (nullRecipients > records.size() * 0.5) {
            patterns.add("50%+ have null recipient");
        }
        
        long emptyContent = records.stream()
            .filter(r -> r.getNotification().getContent().isEmpty())
            .count();
        if (emptyContent > records.size() * 0.5) {
            patterns.add("50%+ have empty content");
        }
        
        // Add more pattern detection as needed
        return patterns;
    }
    
    enum DiscrepancyCategory {
        NEW_FAILS_WHERE_LEGACY_SUCCEEDS,      // Critical: new impl more strict
        NEW_SUCCEEDS_WHERE_LEGACY_FAILS,      // Often OK: new impl more lenient
        EXCEPTION_TYPE_MISMATCH,              // May matter for catch blocks
        RESULT_VALUE_MISMATCH,                // Investigate case by case
        OTHER
    }
}

4.2 Disposition Decisions

For each discrepancy category, you must decide the disposition:

Disposition	When to Use	Action Required
Fix New	New implementation has a bug	Modify new code to match legacy
Document Legacy Bug	Legacy has a known bug being preserved	Add comment explaining the preserved bug
Accept Variance	Difference is acceptable (e.g., timing)	Adjust comparison logic to allow this
Upgrade Behavior	Both should change	Create follow-up ticket for post-refactor fix
Investigate Further	Root cause unclear	Gather more data before deciding

Handling Discovered Edge Cases

Refactoring often surfaces edge cases that the original developers didn't consciously design for—behavior that emerged from specific implementation details rather than intentional design.

5.1 The Edge Case Decision Framework

When you discover an edge case during refactoring, work through these questions:

Is this behavior documented? If yes, preserve it exactly.
Do users depend on this behavior? If uncertain, assume yes.
Is this a security or correctness issue? If yes, flag for immediate fix (separate from refactoring).
Can this behavior be preserved in the new design? If not, what's the migration path?

edge_case_documentation.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
// Documenting preserved edge case behavior
 
public class SmsFormatter implements ContentFormatter {
    
    /*
     * EDGE CASE DOCUMENTATION:
     * 
     * Legacy Behavior: When content contains a null character (\0), the SMS
     * gateway truncates at that point. We preserve this behavior even though
     * it's technically a gateway bug, because:
     * 1. Some integrations might rely on this for content termination
     * 2. The new formatter should be behaviorally identical during refactoring
     * 
     * Post-Refactoring: Consider sanitizing null characters. See JIRA-4521.
     */
    @Override
    public String format(String rawContent, FormattingContext context) {
        // Preserve legacy null-character truncation behavior
        int nullIndex = rawContent.indexOf('\0');
        if (nullIndex >= 0) {
            rawContent = rawContent.substring(0, nullIndex);
        }
        
        // Normal formatting continues...
        return truncateToSmsLimit(rawContent);
    }
}

5.2 Edge Case Regression Tests

Every discovered edge case should become a test case. This serves multiple purposes:

Documents the edge case behavior explicitly
Prevents future regressions
Provides context for future maintainers
Creates a catalog of known quirks

edge_case_tests.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
// Explicit edge case tests from discoveries during refactoring
 
class SmsFormatterEdgeCaseTests {
    
    @Test
    @DisplayName("Edge Case: Null character in content causes truncation")
    void format_contentWithNullCharacter_truncatesAtNullChar() {
        // DISCOVERED: 2024-01-15 during refactoring
        // ROOT CAUSE: SMS gateway behavior, not intentional design
        // DECISION: Preserve for backward compatibility
        
        SmsFormatter formatter = new SmsFormatter(" STOP");
        String contentWithNull = "Hello\0World";
        
        String result = formatter.format(contentWithNull, context());
        
        assertEquals("Hello STOP", result);  // "World" is truncated
    }
    
    @Test
    @DisplayName("Edge Case: Unicode emoji counts as 2 characters for length limit")
    void format_contentWithEmoji_countsEmojiAsMultipleChars() {
        // DISCOVERED: 2024-01-16 during shadow execution
        // ROOT CAUSE: GSM-7 encoding limitation
        // DECISION: Preserve; carrier limitation, not our bug
        
        SmsFormatter formatter = new SmsFormatter("");
        String contentWithEmoji = "Test 😀 message";  // 😀 = 2 chars
        
        String result = formatter.format(contentWithEmoji, context());
        
        // Emoji should count toward the 160 limit as 2 characters
        assertTrue(contentWithEmoji.length() < 160);  // Looks short
        // But effective length includes emoji multi-char counting
    }
    
    @Test
    @DisplayName("Edge Case: Leading whitespace in content is preserved")
    void format_contentWithLeadingWhitespace_preservesWhitespace() {
        // DISCOVERED: 2024-01-17 during code review
        // ROOT CAUSE: Intentional design (some clients use for alignment)
        // DECISION: Document and preserve
        
        SmsFormatter formatter = new SmsFormatter("");
        String content = "   Indented message";
        
        String result = formatter.format(content, context());
        
        assertTrue(result.startsWith("   "));
    }
}

Edge Case Catalog

Maintain a living document cataloging all discovered edge cases. This becomes invaluable for future maintainers and helps when similar refactoring is done elsewhere in the system.

Contract Testing Between Components

When breaking apart an inheritance hierarchy into composed components, you're introducing new boundaries. These boundaries need contract tests to ensure components continue to work together correctly.

6.1 Interface Contract Tests

For each interface created during extraction, write tests that verify any implementation meets the interface's behavioral contract:

interface_contract_tests.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
// Contract tests that any DeliveryChannel implementation must pass
 
public abstract class DeliveryChannelContractTest {
    
    // Subclasses provide the implementation under test
    protected abstract DeliveryChannel createChannel();
    
    @Test
    void deliver_validNotification_returnsSuccessResult() {
        DeliveryChannel channel = createChannel();
        FormattedNotification notification = createValidNotification();
        
        DeliveryResult result = channel.deliver(notification);
        
        assertTrue(result.success());
        assertNotNull(result.messageId());
        assertNull(result.error());
    }
    
    @Test
    void deliver_validNotification_doesNotThrow() {
        DeliveryChannel channel = createChannel();
        FormattedNotification notification = createValidNotification();
        
        assertDoesNotThrow(() -> channel.deliver(notification));
    }
    
    @Test
    void deliver_nullNotification_throwsNullPointerException() {
        DeliveryChannel channel = createChannel();
        
        assertThrows(NullPointerException.class, 
            () -> channel.deliver(null));
    }
    
    @Test
    void supportsRecipient_validForChannel_returnsTrue() {
        DeliveryChannel channel = createChannel();
        String validRecipient = getValidRecipientForChannel();
        
        assertTrue(channel.supportsRecipient(validRecipient));
    }
    
    @Test
    void supportsRecipient_invalidForChannel_returnsFalse() {
        DeliveryChannel channel = createChannel();
        String invalidRecipient = getInvalidRecipientForChannel();
        
        assertFalse(channel.supportsRecipient(invalidRecipient));
    }
    
    // Abstract methods for subclass configuration
    protected abstract FormattedNotification createValidNotification();
    protected abstract String getValidRecipientForChannel();
    protected abstract String getInvalidRecipientForChannel();
}
 
// Concrete test for SMTP channel
class SmtpDeliveryChannelContractTest extends DeliveryChannelContractTest {
    
    private SmtpClient mockClient;
    
    @BeforeEach
    void setUp() {
        mockClient = mock(SmtpClient.class);
        when(mockClient.send(any(), any(), any())).thenReturn("MSG-123");
    }
    
    @Override
    protected DeliveryChannel createChannel() {
        return new SmtpDeliveryChannel(mockClient);
    }
    
    @Override
    protected FormattedNotification createValidNotification() {
        return new FormattedNotification(
            "Content",
            Map.of("to", "user@example.com", "subject", "Test")
        );
    }
    
    @Override
    protected String getValidRecipientForChannel() {
        return "user@example.com";  // Valid email
    }
    
    @Override
    protected String getInvalidRecipientForChannel() {
        return "+1234567890";  // Phone number, not email
    }
}

6.2 Integration Contract Tests

Test that composed components work correctly together:

integration_contract_tests.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
// Integration tests for composed notification system
 
class NotificationSystemIntegrationTest {
    
    @Test
    void fullPipeline_validEmail_sendsFormattedContent() {
        // Compose the full system
        TemplateEngine templateEngine = new InMemoryTemplateEngine();
        templateEngine.register("email-template.html", "<html>{{content}}</html>");
        
        SmtpClient mockSmtp = mock(SmtpClient.class);
        when(mockSmtp.send(any(), any(), any())).thenReturn("MSG-123");
        
        ContentFormatter formatter = new HtmlEmailFormatter(templateEngine);
        DeliveryChannel channel = new SmtpDeliveryChannel(mockSmtp);
        NotificationValidator validator = new CompositeValidator();
        RetryPolicy retryPolicy = new NoRetryPolicy();
        
        // Create notification using factory
        EmailNotification notification = new EmailNotification(
            "user@example.com",
            "Hello, World!",
            "Test Subject",
            formatter,
            channel,
            validator,
            retryPolicy
        );
        
        // Execute
        notification.send();
        
        // Verify the formatted content reached the SMTP client
        verify(mockSmtp).send(
            eq("user@example.com"),
            eq("Test Subject"),
            eq("<html>Hello, World!</html>")
        );
    }
    
    @Test
    void fullPipeline_validationFails_doesNotSend() {
        // Compose with a validator that rejects
        NotificationValidator rejectingValidator = new NotificationValidator() {
            @Override
            public ValidationResult validate(NotificationData data) {
                return new ValidationResult(false, List.of("Invalid recipient"));
            }
        };
        
        SmtpClient mockSmtp = mock(SmtpClient.class);
        
        EmailNotification notification = new EmailNotification(
            "invalid",
            "Content",
            "Subject",
            new HtmlEmailFormatter(new DummyTemplateEngine()),
            new SmtpDeliveryChannel(mockSmtp),
            rejectingValidator,
            new NoRetryPolicy()
        );
        
        assertThrows(InvalidNotificationException.class, notification::send);
        verifyNoInteractions(mockSmtp);  // Should not reach delivery
    }
}

Summary: Maintaining Behavioral Correctness

Key Takeaways

•Behavior includes implicit contracts — Users depend on undocumented behaviors; preserve them all
•Characterization tests capture reality — Test what IS, not just what should be; golden masters catch changes
•Shadow execution validates in production — Real traffic reveals edge cases that tests miss
•Systematically analyze discrepancies — Categorize, investigate, and decide disposition for each difference
•Document discovered edge cases — Every edge case becomes a test; maintain an edge case catalog
•Contract tests protect new boundaries — Composed components need explicit behavioral contracts

What's Next

The final page covers testing the refactored design—comprehensive testing strategies that validate not just behavioral correctness, but also that the new composition-based design achieves its goals of flexibility, maintainability, and extensibility.

3 / 4

Loading learning content...

System Design (LLD)Refactoring from Inheritance to Composition

Refactoring from Inheritance to Composition

LevelAdvanced

Duration60 mins

TopicRefactoring from Inheritance to Composition

3 / 4

Maintaining Behavior During Refactoring

The Invisible Contract

The Behavioral Preservation Mandate

What You Will Learn

Understanding Behavioral Contracts

Before we can preserve behavior, we must understand what behavior actually means in the context of software systems. Behavior is multi-dimensional:

1.1 Explicit Contracts

These are the documented, intentional behaviors:

Method signatures and return types
Documented pre-conditions and post-conditions
Specified error conditions and exceptions
Performance guarantees (SLAs)

1.2 Implicit Contracts

These are undocumented behaviors that users nonetheless depend on:

The order in which side effects occur
Specific exception types thrown for specific failures
Null vs. empty collection returns
Caching and memoization behavior
Timing and ordering of asynchronous operations

1.3 Emergent Contracts

These arise from the interaction of multiple components:

The specific sequence of database calls
Thread safety characteristics
Resource consumption patterns
Interaction with global state

behavioral_contract_examples.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
// Examples of behavioral contracts (explicit and implicit)
 
class OrderProcessor {
    
    // EXPLICIT: Documented to throw on invalid order
    /**
     * Processes the order and returns confirmation number.
     * @throws InvalidOrderException if order validation fails
     */
    public String processOrder(Order order) throws InvalidOrderException {
        // Implementation...
    }
    
    // IMPLICIT: Users depend on specific exception type
    // Even though not documented, changing to a different exception
    // would break callers who catch InvalidOrderException specifically
    
    // IMPLICIT: Users may depend on empty string vs null
    public String getOrderNotes(Order order) {
        if (order.getNotes() == null) {
            return "";  // Returning null would break callers doing .length()
        }
        return order.getNotes();
    }
    
    // EMERGENT: The specific ordering of these calls matters
    // because downstream systems expect this sequence
    public void fulfillOrder(Order order) {
        inventoryService.reserve(order);    // Must happen first
        paymentService.charge(order);       // Must happen second
        shippingService.schedule(order);    // Must happen third
        notificationService.notify(order);  // Must happen last
        // Reordering would violate contracts even if each call succeeds
    }
}

Hyrum's Law

Characterization Testing

Characterization tests (also called Golden Master tests or Approval tests) capture the current behavior of a system without asserting that the behavior is correct—only that it is consistent.

2.1 The Philosophy of Characterization Tests

Traditional unit tests assert expected behavior:

assertEquals(expected, actual);

Characterization tests assert consistent behavior:

assertEquals(previouslyRecordedOutput, actual);

The previously recorded output becomes the "golden master"—any deviation triggers a test failure, requiring explicit acknowledgment of the behavior change.

2.2 Building a Characterization Test Suite

To build characterization tests for refactoring:

Identify the surface area — What methods/behaviors are being refactored?
Gather diverse inputs — Include normal cases, edge cases, error cases
Record current outputs — Capture return values, side effects, exceptions
Create tests that compare — New runs must match recorded outputs

characterization_test_example.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
// Characterization test framework for refactoring validation
 
public class NotificationBehaviorCharacterization {
    
    private static final Path GOLDEN_MASTER_DIR = 
        Paths.get("src/test/resources/golden-masters/notifications");
    
    @Test
    void emailNotification_standardCase() throws Exception {
        EmailNotification email = createEmailNotification(
            "user@example.com",
            "Test Subject",
            "This is the body content"
        );
        
        // Capture all observable behaviors
        CharacterizationResult result = new CharacterizationResult();
        result.formattedContent = email.formatContent();
        result.validationResult = captureValidation(email);
        result.sideEffects = captureSideEffects(() -> email.send());
        result.thrownException = captureThrownException(() -> email.send());
        
        // Compare against golden master
        assertMatchesGoldenMaster("email-standard", result);
    }
    
    @Test
    void emailNotification_nullSubject() throws Exception {
        EmailNotification email = createEmailNotification(
            "user@example.com",
            null,  // Edge case: null subject
            "Body content"
        );
        
        CharacterizationResult result = captureAllBehaviors(email);
        assertMatchesGoldenMaster("email-null-subject", result);
    }
    
    @Test
    void emailNotification_emptyBody() throws Exception {
        EmailNotification email = createEmailNotification(
            "user@example.com",
            "Subject",
            ""  // Edge case: empty body
        );
        
        CharacterizationResult result = captureAllBehaviors(email);
        assertMatchesGoldenMaster("email-empty-body", result);
    }
    
    @Test
    void emailNotification_invalidRecipient() throws Exception {
        EmailNotification email = createEmailNotification(
            "not-an-email",  // Edge case: invalid recipient
            "Subject",
            "Body"
        );
        
        CharacterizationResult result = captureAllBehaviors(email);
        // This might capture an exception, which is still valid behavior
        assertMatchesGoldenMaster("email-invalid-recipient", result);
    }
    
    // Helper to capture all observable behaviors
    private CharacterizationResult captureAllBehaviors(EmailNotification email) {
        CharacterizationResult result = new CharacterizationResult();
        
        // Capture formatting behavior
        try {
            result.formattedContent = email.formatContent();
        } catch (Exception e) {
            result.formatException = e.getClass().getName() + ": " + e.getMessage();
        }
        
        // Capture validation behavior
        try {
            result.validationPassed = email.validate();
            result.validationMessages = email.getValidationMessages();
        } catch (Exception e) {
            result.validationException = e.getClass().getName() + ": " + e.getMessage();
        }
        
        // Capture send behavior (mocked)
        try (MockedStatic<SmtpClient> mockedSmtp = mockStatic(SmtpClient.class)) {
            List<Object[]> calls = new ArrayList<>();
            mockedSmtp.when(() -> SmtpClient.send(any(), any(), any()))
                .thenAnswer(inv -> {
                    calls.add(inv.getArguments());
                    return "MSG-ID-123";
                });
            
            email.send();
            result.smtpCalls = calls;
        } catch (Exception e) {
            result.sendException = e.getClass().getName() + ": " + e.getMessage();
        }
        
        return result;
    }
    
    private void assertMatchesGoldenMaster(String testName, CharacterizationResult result) 
            throws Exception {
        Path goldenPath = GOLDEN_MASTER_DIR.resolve(testName + ".json");
        String resultJson = toJson(result);
        
        if (Files.exists(goldenPath)) {
            // Compare with existing golden master
            String goldenJson = Files.readString(goldenPath);
            assertEquals(goldenJson, resultJson, 
                "Behavior changed! If this is intentional, update golden master.");
        } else {
            // First run: create golden master
            Files.writeString(goldenPath, resultJson);
            System.out.println("Created new golden master: " + goldenPath);
        }
    }
    
    @Data
    static class CharacterizationResult {
        String formattedContent;
        String formatException;
        boolean validationPassed;
        List<String> validationMessages;
        String validationException;
        List<Object[]> smtpCalls;
        String sendException;
    }
}

Automated Golden Master Generation

Use production logs and monitoring data to identify real-world input patterns. The most valuable characterization tests use inputs that actually occur in production, not just theoretical edge cases.

Shadow Execution Strategy

3.1 Shadow Execution Architecture

shadow_execution_wrapper.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
// Shadow execution wrapper for safe production verification
 
public class ShadowExecutionNotificationService implements NotificationService {
    
    private final NotificationService legacyService;  // Old inheritance-based
    private final NotificationService newService;     // New composition-based
    private final ShadowMetrics metrics;
    private final Logger logger;
    
    // Configuration
    private final double shadowTrafficPercentage;  // e.g., 0.1 = 10%
    private final boolean logDiscrepancies;
    
    public ShadowExecutionNotificationService(
            NotificationService legacyService,
            NotificationService newService,
            ShadowMetrics metrics,
            double shadowTrafficPercentage) {
        this.legacyService = legacyService;
        this.newService = newService;
        this.metrics = metrics;
        this.shadowTrafficPercentage = shadowTrafficPercentage;
        this.logDiscrepancies = true;
        this.logger = LoggerFactory.getLogger(getClass());
    }
    
    @Override
    public SendResult send(Notification notification) {
        // Always execute legacy path (this is production)
        SendResult legacyResult;
        Exception legacyException = null;
        long legacyStart = System.nanoTime();
        
        try {
            legacyResult = legacyService.send(notification);
        } catch (Exception e) {
            legacyException = e;
            legacyResult = null;
        }
        long legacyDuration = System.nanoTime() - legacyStart;
        
        // Conditionally execute shadow path
        if (shouldRunShadow()) {
            runShadowAsync(notification, legacyResult, legacyException, legacyDuration);
        }
        
        // Return legacy result (production behavior unchanged)
        if (legacyException != null) {
            throw new RuntimeException(legacyException);
        }
        return legacyResult;
    }
    
    private boolean shouldRunShadow() {
        return Math.random() < shadowTrafficPercentage;
    }
    
    private void runShadowAsync(
            Notification notification,
            SendResult legacyResult,
            Exception legacyException,
            long legacyDuration) {
        
        CompletableFuture.runAsync(() -> {
            SendResult newResult = null;
            Exception newException = null;
            long newStart = System.nanoTime();
            
            try {
                newResult = newService.send(notification);
            } catch (Exception e) {
                newException = e;
            }
            long newDuration = System.nanoTime() - newStart;
            
            // Compare and record
            compareBehaviors(
                notification,
                legacyResult, legacyException, legacyDuration,
                newResult, newException, newDuration
            );
        });
    }
    
    private void compareBehaviors(
            Notification notification,
            SendResult legacyResult, Exception legacyException, long legacyDuration,
            SendResult newResult, Exception newException, long newDuration) {
        
        boolean outcomeMatches = compareOutcomes(
            legacyResult, legacyException,
            newResult, newException
        );
        
        // Record metrics
        metrics.recordShadowExecution(
            notification.getType(),
            outcomeMatches,
            legacyDuration,
            newDuration
        );
        
        if (!outcomeMatches && logDiscrepancies) {
            logger.warn("Shadow execution discrepancy detected. " +
                "Notification: {}, Legacy: {}/{}, New: {}/{}", 
                notification.getId(),
                legacyResult, legacyException,
                newResult, newException);
            
            // Store for later analysis
            metrics.recordDiscrepancy(new DiscrepancyRecord(
                notification,
                legacyResult, legacyException,
                newResult, newException
            ));
        }
    }
    
    private boolean compareOutcomes(
            SendResult legacy, Exception legacyEx,
            SendResult newR, Exception newEx) {
        
        // Both exceptions
        if (legacyEx != null && newEx != null) {
            return compareExceptions(legacyEx, newEx);
        }
        
        // One exception, one success = mismatch
        if ((legacyEx != null) != (newEx != null)) {
            return false;
        }
        
        // Both success: compare results
        return compareResults(legacy, newR);
    }
    
    private boolean compareExceptions(Exception legacy, Exception newEx) {
        // Same exception type?
        return legacy.getClass().equals(newEx.getClass());
    }
    
    private boolean compareResults(SendResult legacy, SendResult newR) {
        // Compare relevant fields (not things like timestamps)
        return legacy.isSuccess() == newR.isSuccess() &&
               legacy.getRecipient().equals(newR.getRecipient());
    }
}

3.2 Shadow Execution Metrics Dashboard

Monitor shadow execution to build confidence before switching:

Shadow Execution Metrics to Track
Metric	Description	Target Before Cutover
Match Rate	Percentage of requests with identical outcomes	99.9%
Latency Comparison	New vs. legacy execution time	New ≤ Legacy + 10%
Exception Parity	Same exceptions thrown for same inputs	100%
Discrepancy Categories	Classification of mismatches by type	All explained
Shadow Volume	Percentage of traffic shadow-executed	Gradually increase to 100%

Shadow Execution Pitfalls

Behavioral Diff Detection

4.1 Diff Analysis Workflow

Collect discrepancies with full context
Categorize by type (result mismatch, exception mismatch, timing difference)
Investigate root cause for each category
Decide disposition (fix new, document old bug, accept variance)
Adjust implementation or tests accordingly

discrepancy_analysis.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
// Automated discrepancy analysis system
 
public class DiscrepancyAnalyzer {
    
    public AnalysisReport analyze(List<DiscrepancyRecord> discrepancies) {
        AnalysisReport report = new AnalysisReport();
        
        // Categorize discrepancies
        Map<DiscrepancyCategory, List<DiscrepancyRecord>> categorized = 
            discrepancies.stream()
                .collect(Collectors.groupingBy(this::categorize));
        
        // Analyze each category
        for (var entry : categorized.entrySet()) {
            CategoryAnalysis analysis = analyzeCategory(entry.getKey(), entry.getValue());
            report.addCategoryAnalysis(analysis);
        }
        
        // Generate recommendations
        report.setRecommendations(generateRecommendations(categorized));
        
        return report;
    }
    
    private DiscrepancyCategory categorize(DiscrepancyRecord record) {
        // Exception vs. success mismatch
        if (record.legacySucceeded() != record.newSucceeded()) {
            return record.legacySucceeded() 
                ? DiscrepancyCategory.NEW_FAILS_WHERE_LEGACY_SUCCEEDS
                : DiscrepancyCategory.NEW_SUCCEEDS_WHERE_LEGACY_FAILS;
        }
        
        // Both threw exceptions but different types
        if (record.legacyException() != null && 
            !record.legacyException().getClass()
                .equals(record.newException().getClass())) {
            return DiscrepancyCategory.EXCEPTION_TYPE_MISMATCH;
        }
        
        // Both succeeded but different results
        if (record.legacyResult() != null && record.newResult() != null) {
            return DiscrepancyCategory.RESULT_VALUE_MISMATCH;
        }
        
        return DiscrepancyCategory.OTHER;
    }
    
    private CategoryAnalysis analyzeCategory(
            DiscrepancyCategory category, 
            List<DiscrepancyRecord> records) {
        
        CategoryAnalysis analysis = new CategoryAnalysis(category);
        analysis.setCount(records.size());
        analysis.setExamples(records.stream().limit(5).toList());
        
        // Look for patterns
        analysis.setInputPatterns(findInputPatterns(records));
        analysis.setTimePatterns(findTimePatterns(records));
        
        // Severity assessment
        analysis.setSeverity(assessSeverity(category, records));
        
        return analysis;
    }
    
    private List<String> findInputPatterns(List<DiscrepancyRecord> records) {
        // Identify common characteristics of inputs that cause discrepancies
        List<String> patterns = new ArrayList<>();
        
        long nullRecipients = records.stream()
            .filter(r -> r.getNotification().getRecipient() == null)
            .count();
        if (nullRecipients > records.size() * 0.5) {
            patterns.add("50%+ have null recipient");
        }
        
        long emptyContent = records.stream()
            .filter(r -> r.getNotification().getContent().isEmpty())
            .count();
        if (emptyContent > records.size() * 0.5) {
            patterns.add("50%+ have empty content");
        }
        
        // Add more pattern detection as needed
        return patterns;
    }
    
    enum DiscrepancyCategory {
        NEW_FAILS_WHERE_LEGACY_SUCCEEDS,      // Critical: new impl more strict
        NEW_SUCCEEDS_WHERE_LEGACY_FAILS,      // Often OK: new impl more lenient
        EXCEPTION_TYPE_MISMATCH,              // May matter for catch blocks
        RESULT_VALUE_MISMATCH,                // Investigate case by case
        OTHER
    }
}

4.2 Disposition Decisions

For each discrepancy category, you must decide the disposition:

Disposition	When to Use	Action Required
Fix New	New implementation has a bug	Modify new code to match legacy
Document Legacy Bug	Legacy has a known bug being preserved	Add comment explaining the preserved bug
Accept Variance	Difference is acceptable (e.g., timing)	Adjust comparison logic to allow this
Upgrade Behavior	Both should change	Create follow-up ticket for post-refactor fix
Investigate Further	Root cause unclear	Gather more data before deciding

Handling Discovered Edge Cases

Refactoring often surfaces edge cases that the original developers didn't consciously design for—behavior that emerged from specific implementation details rather than intentional design.

5.1 The Edge Case Decision Framework

When you discover an edge case during refactoring, work through these questions:

Is this behavior documented? If yes, preserve it exactly.
Do users depend on this behavior? If uncertain, assume yes.
Is this a security or correctness issue? If yes, flag for immediate fix (separate from refactoring).
Can this behavior be preserved in the new design? If not, what's the migration path?

edge_case_documentation.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
// Documenting preserved edge case behavior
 
public class SmsFormatter implements ContentFormatter {
    
    /*
     * EDGE CASE DOCUMENTATION:
     * 
     * Legacy Behavior: When content contains a null character (\0), the SMS
     * gateway truncates at that point. We preserve this behavior even though
     * it's technically a gateway bug, because:
     * 1. Some integrations might rely on this for content termination
     * 2. The new formatter should be behaviorally identical during refactoring
     * 
     * Post-Refactoring: Consider sanitizing null characters. See JIRA-4521.
     */
    @Override
    public String format(String rawContent, FormattingContext context) {
        // Preserve legacy null-character truncation behavior
        int nullIndex = rawContent.indexOf('\0');
        if (nullIndex >= 0) {
            rawContent = rawContent.substring(0, nullIndex);
        }
        
        // Normal formatting continues...
        return truncateToSmsLimit(rawContent);
    }
}

5.2 Edge Case Regression Tests

Every discovered edge case should become a test case. This serves multiple purposes:

Documents the edge case behavior explicitly
Prevents future regressions
Provides context for future maintainers
Creates a catalog of known quirks

edge_case_tests.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
// Explicit edge case tests from discoveries during refactoring
 
class SmsFormatterEdgeCaseTests {
    
    @Test
    @DisplayName("Edge Case: Null character in content causes truncation")
    void format_contentWithNullCharacter_truncatesAtNullChar() {
        // DISCOVERED: 2024-01-15 during refactoring
        // ROOT CAUSE: SMS gateway behavior, not intentional design
        // DECISION: Preserve for backward compatibility
        
        SmsFormatter formatter = new SmsFormatter(" STOP");
        String contentWithNull = "Hello\0World";
        
        String result = formatter.format(contentWithNull, context());
        
        assertEquals("Hello STOP", result);  // "World" is truncated
    }
    
    @Test
    @DisplayName("Edge Case: Unicode emoji counts as 2 characters for length limit")
    void format_contentWithEmoji_countsEmojiAsMultipleChars() {
        // DISCOVERED: 2024-01-16 during shadow execution
        // ROOT CAUSE: GSM-7 encoding limitation
        // DECISION: Preserve; carrier limitation, not our bug
        
        SmsFormatter formatter = new SmsFormatter("");
        String contentWithEmoji = "Test 😀 message";  // 😀 = 2 chars
        
        String result = formatter.format(contentWithEmoji, context());
        
        // Emoji should count toward the 160 limit as 2 characters
        assertTrue(contentWithEmoji.length() < 160);  // Looks short
        // But effective length includes emoji multi-char counting
    }
    
    @Test
    @DisplayName("Edge Case: Leading whitespace in content is preserved")
    void format_contentWithLeadingWhitespace_preservesWhitespace() {
        // DISCOVERED: 2024-01-17 during code review
        // ROOT CAUSE: Intentional design (some clients use for alignment)
        // DECISION: Document and preserve
        
        SmsFormatter formatter = new SmsFormatter("");
        String content = "   Indented message";
        
        String result = formatter.format(content, context());
        
        assertTrue(result.startsWith("   "));
    }
}

Edge Case Catalog

Maintain a living document cataloging all discovered edge cases. This becomes invaluable for future maintainers and helps when similar refactoring is done elsewhere in the system.

Contract Testing Between Components

6.1 Interface Contract Tests

For each interface created during extraction, write tests that verify any implementation meets the interface's behavioral contract:

interface_contract_tests.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
// Contract tests that any DeliveryChannel implementation must pass
 
public abstract class DeliveryChannelContractTest {
    
    // Subclasses provide the implementation under test
    protected abstract DeliveryChannel createChannel();
    
    @Test
    void deliver_validNotification_returnsSuccessResult() {
        DeliveryChannel channel = createChannel();
        FormattedNotification notification = createValidNotification();
        
        DeliveryResult result = channel.deliver(notification);
        
        assertTrue(result.success());
        assertNotNull(result.messageId());
        assertNull(result.error());
    }
    
    @Test
    void deliver_validNotification_doesNotThrow() {
        DeliveryChannel channel = createChannel();
        FormattedNotification notification = createValidNotification();
        
        assertDoesNotThrow(() -> channel.deliver(notification));
    }
    
    @Test
    void deliver_nullNotification_throwsNullPointerException() {
        DeliveryChannel channel = createChannel();
        
        assertThrows(NullPointerException.class, 
            () -> channel.deliver(null));
    }
    
    @Test
    void supportsRecipient_validForChannel_returnsTrue() {
        DeliveryChannel channel = createChannel();
        String validRecipient = getValidRecipientForChannel();
        
        assertTrue(channel.supportsRecipient(validRecipient));
    }
    
    @Test
    void supportsRecipient_invalidForChannel_returnsFalse() {
        DeliveryChannel channel = createChannel();
        String invalidRecipient = getInvalidRecipientForChannel();
        
        assertFalse(channel.supportsRecipient(invalidRecipient));
    }
    
    // Abstract methods for subclass configuration
    protected abstract FormattedNotification createValidNotification();
    protected abstract String getValidRecipientForChannel();
    protected abstract String getInvalidRecipientForChannel();
}
 
// Concrete test for SMTP channel
class SmtpDeliveryChannelContractTest extends DeliveryChannelContractTest {
    
    private SmtpClient mockClient;
    
    @BeforeEach
    void setUp() {
        mockClient = mock(SmtpClient.class);
        when(mockClient.send(any(), any(), any())).thenReturn("MSG-123");
    }
    
    @Override
    protected DeliveryChannel createChannel() {
        return new SmtpDeliveryChannel(mockClient);
    }
    
    @Override
    protected FormattedNotification createValidNotification() {
        return new FormattedNotification(
            "Content",
            Map.of("to", "user@example.com", "subject", "Test")
        );
    }
    
    @Override
    protected String getValidRecipientForChannel() {
        return "user@example.com";  // Valid email
    }
    
    @Override
    protected String getInvalidRecipientForChannel() {
        return "+1234567890";  // Phone number, not email
    }
}

6.2 Integration Contract Tests

Test that composed components work correctly together:

integration_contract_tests.java
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
// Integration tests for composed notification system
 
class NotificationSystemIntegrationTest {
    
    @Test
    void fullPipeline_validEmail_sendsFormattedContent() {
        // Compose the full system
        TemplateEngine templateEngine = new InMemoryTemplateEngine();
        templateEngine.register("email-template.html", "<html>{{content}}</html>");
        
        SmtpClient mockSmtp = mock(SmtpClient.class);
        when(mockSmtp.send(any(), any(), any())).thenReturn("MSG-123");
        
        ContentFormatter formatter = new HtmlEmailFormatter(templateEngine);
        DeliveryChannel channel = new SmtpDeliveryChannel(mockSmtp);
        NotificationValidator validator = new CompositeValidator();
        RetryPolicy retryPolicy = new NoRetryPolicy();
        
        // Create notification using factory
        EmailNotification notification = new EmailNotification(
            "user@example.com",
            "Hello, World!",
            "Test Subject",
            formatter,
            channel,
            validator,
            retryPolicy
        );
        
        // Execute
        notification.send();
        
        // Verify the formatted content reached the SMTP client
        verify(mockSmtp).send(
            eq("user@example.com"),
            eq("Test Subject"),
            eq("<html>Hello, World!</html>")
        );
    }
    
    @Test
    void fullPipeline_validationFails_doesNotSend() {
        // Compose with a validator that rejects
        NotificationValidator rejectingValidator = new NotificationValidator() {
            @Override
            public ValidationResult validate(NotificationData data) {
                return new ValidationResult(false, List.of("Invalid recipient"));
            }
        };
        
        SmtpClient mockSmtp = mock(SmtpClient.class);
        
        EmailNotification notification = new EmailNotification(
            "invalid",
            "Content",
            "Subject",
            new HtmlEmailFormatter(new DummyTemplateEngine()),
            new SmtpDeliveryChannel(mockSmtp),
            rejectingValidator,
            new NoRetryPolicy()
        );
        
        assertThrows(InvalidNotificationException.class, notification::send);
        verifyNoInteractions(mockSmtp);  // Should not reach delivery
    }
}

Summary: Maintaining Behavioral Correctness

Key Takeaways

•Behavior includes implicit contracts — Users depend on undocumented behaviors; preserve them all
•Characterization tests capture reality — Test what IS, not just what should be; golden masters catch changes
•Shadow execution validates in production — Real traffic reveals edge cases that tests miss
•Systematically analyze discrepancies — Categorize, investigate, and decide disposition for each difference
•Document discovered edge cases — Every edge case becomes a test; maintain an edge case catalog
•Contract tests protect new boundaries — Composed components need explicit behavioral contracts

What's Next

3 / 4