Sudoku Solver - Learning Module

Loading content...

0/276

Validity Checking

The Guardian of Correctness

Every Sudoku placement must pass a gauntlet of constraints. Before we commit to placing a '7' in a cell, we must verify: Is there already a 7 in this row? In this column? In this 3×3 box? If any answer is 'yes', the placement is invalid. This seemingly simple check—repeated millions of times in a hard puzzle—is the guardian of correctness in our solver.

Validity checking is where the elegance of theory meets the demands of performance. A naïve implementation scans rows, columns, and boxes on every placement, performing 27 comparisons. A sophisticated implementation uses clever data structures to answer in O(1) time. The difference matters: in a solver exploring millions of nodes, shaving microseconds per check translates to seconds or minutes saved overall.

What You Will Learn

This page explores validity checking from first principles to optimized implementations. You'll understand: the three types of constraints that must be checked, multiple implementation strategies with different performance characteristics, data structures that enable O(1) validation, incremental checking during propagation, and common pitfalls that lead to incorrect solvers.

The Three Constraints

Sudoku's constraints are deceptively simple. Every complete valid Sudoku must satisfy three types of uniqueness constraints, all of which are instantiations of the same rule: no duplicates within a unit.

The three unit types:

Rows: Each of the 9 horizontal rows must contain digits 1-9 exactly once.
Columns: Each of the 9 vertical columns must contain digits 1-9 exactly once.
Boxes: Each of the 9 non-overlapping 3×3 boxes must contain digits 1-9 exactly once.

This gives us 27 constraints total: 9 rows × 1 constraint + 9 columns × 1 constraint + 9 boxes × 1 constraint = 27.

When checking if a value can be placed at position (row, col), we must verify that the value doesn't already exist in:

The row containing (row, col): cells (row, 0), (row, 1), ..., (row, 8)
The column containing (row, col): cells (0, col), (1, col), ..., (8, col)
The box containing (row, col): the 9 cells in the 3×3 region

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
     0   1   2   3   4   5   6   7   8
   ┌───────────┬───────────┬───────────┐
 0 │ .   .   . │ .   .   C │ .   .   . │   C = Column constraint
 1 │ .   .   . │ .   .   C │ .   .   . │
 2 │ .   .   . │ .   .   C │ .   .   . │
   ├───────────┼───────────┼───────────┤
 3 │ .   .   . │ B   B   X │ .   .   . │   B = Box constraint
 4 │ R   R   R │ R   R  [*] │ R   R   R │   R = Row constraint
 5 │ .   .   . │ B   B   X │ .   .   . │   [*] = Cell being checked
   ├───────────┼───────────┼───────────┤       X = Both Box and Column
 6 │ .   .   . │ .   .   C │ .   .   . │
 7 │ .   .   . │ .   .   C │ .   .   . │
 8 │ .   .   . │ .   .   C │ .   .   . │
   └───────────┴───────────┴───────────┘
 
Cell (4,5) is constrained by:
 - Row 4:    8 other cells (R)
 - Column 5: 8 other cells (C)
 - Box (3,3): 8 other cells (B), but 4 overlap with row/column
 
 Total unique constraining cells: 8 + 8 + 4 = 20 peers

The 20 Peers

Every cell has exactly 20 peer cells: 8 in its row, 8 in its column, and 4 additional cells in its box (the other 4 box cells are already counted in the row or column). When placing a value, we only need to verify the value doesn't appear among these 20 peers.

Naïve Validity Checking

The most straightforward approach to validity checking examines the relevant cells directly from the board. While not optimal, this approach is important to understand as a baseline and for its clarity.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
def is_valid_placement_naive(board, row, col, value):
    """
    Check if placing 'value' at position (row, col) is valid.
    
    Performs three separate scans:
    1. Row scan: Check all cells in the same row
    2. Column scan: Check all cells in the same column
    3. Box scan: Check all cells in the same 3×3 box
    
    Time Complexity: O(9 + 9 + 9) = O(27) = O(1) per check
    Space Complexity: O(1)
    """
    # Check row: does 'value' already exist in this row?
    for c in range(9):
        if c != col and board[row][c] == value:
            return False
    
    # Check column: does 'value' already exist in this column?
    for r in range(9):
        if r != row and board[r][col] == value:
            return False
    
    # Check 3×3 box: does 'value' already exist in this box?
    # First, find the top-left corner of the box
    box_row = (row // 3) * 3  # 0, 3, or 6
    box_col = (col // 3) * 3  # 0, 3, or 6
    
    for r in range(box_row, box_row + 3):
        for c in range(box_col, box_col + 3):
            if (r != row or c != col) and board[r][c] == value:
                return False
    
    return True  # Passed all checks
 
# Usage example:
# if is_valid_placement_naive(board, 4, 5, 7):
#     board[4][5] = 7  # Safe to place

Analysis of the naïve approach:

Time complexity: O(9 + 9 + 9) = O(27) per check. While this is technically O(1) for fixed-size Sudoku, the constant factor of 27 comparisons per placement matters when we make millions of placements.

Space complexity: O(1)—no additional data structures needed.

Pros:

Simple and easy to understand
No additional memory overhead
Works directly with the board representation
Easy to verify correctness

Cons:

Repeated scanning is wasteful
For hard puzzles with millions of validity checks, the 27× overhead adds up
No utilization of previously computed information

When Naïve Is Acceptable

For educational purposes, prototyping, or very easy puzzles, the naïve approach is perfectly fine. Its clarity makes it excellent for understanding and debugging. However, for competitive or production solvers, we need better approaches.

Auxiliary Data Structures for O(1) Checking

The key insight for efficient validity checking is precomputation: maintain data structures that track which values are present in each row, column, and box. Checking validity then becomes a simple lookup rather than a scan.

The approach:

Maintain three sets of tracking structures:

row_used[r]: set of values already used in row r
col_used[c]: set of values already used in column c
box_used[b]: set of values already used in box b

A placement is valid if and only if the value is not in any of the three relevant sets.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
class SudokuSolverWithSets:
    """
    Sudoku solver using set-based validity checking.
    Achieves O(1) validity checks through auxiliary data structures.
    """
    
    def __init__(self, puzzle):
        self.board = [row[:] for row in puzzle]
        
        # Tracking structures: which values are used in each unit
        self.row_used = [set() for _ in range(9)]
        self.col_used = [set() for _ in range(9)]
        self.box_used = [set() for _ in range(9)]
        
        # Initialize from initial puzzle state
        for r in range(9):
            for c in range(9):
                if self.board[r][c] != 0:
                    value = self.board[r][c]
                    self.row_used[r].add(value)
                    self.col_used[c].add(value)
                    self.box_used[self._box_index(r, c)].add(value)
    
    def _box_index(self, row, col):
        """Convert (row, col) to box index 0-8."""
        return (row // 3) * 3 + (col // 3)
    
    def is_valid(self, row, col, value):
        """
        O(1) validity check using precomputed sets.
        
        Simply check membership in three sets.
        """
        box = self._box_index(row, col)
        return (value not in self.row_used[row] and
                value not in self.col_used[col] and
                value not in self.box_used[box])
    
    def place(self, row, col, value):
        """Place a value and update tracking structures."""
        self.board[row][col] = value
        box = self._box_index(row, col)
        self.row_used[row].add(value)
        self.col_used[col].add(value)
        self.box_used[box].add(value)
    
    def remove(self, row, col, value):
        """Remove a value (backtrack) and update tracking structures."""
        self.board[row][col] = 0
        box = self._box_index(row, col)
        self.row_used[row].discard(value)
        self.col_used[col].discard(value)
        self.box_used[box].discard(value)

Analysis:

Time complexity:

is_valid(): O(1) — three set membership tests
place(): O(1) — three set insertions
remove(): O(1) — three set deletions

Space complexity: O(27 × 9) = O(243) = O(1) for fixed Sudoku — 27 sets, each holding at most 9 values.

Correctness invariant:

The tracking structures must always accurately reflect the board state. This means:

Every place() must add the value to all three relevant sets
Every remove() must remove the value from all three relevant sets
Initialization must correctly populate sets from given clues

A common bug is forgetting to update one of the three structures, leading to incorrect validity results.

Synchronization Is Critical

The board and the tracking structures must always be synchronized. If you modify the board directly without updating the sets, validity checking will give wrong answers. Always use the place() and remove() methods, never board[r][c] = value directly.

Bitset Optimization for Maximum Performance

For the fastest possible validity checking, we can replace sets with bitmasks. A 16-bit integer can represent the presence/absence of all 9 Sudoku digits (using bits 1-9, ignoring bit 0). This approach is:

Faster: Bitwise operations are among the fastest CPU operations
More cache-friendly: Integers fit in registers; no memory allocations for set nodes
Copy-efficient: Copying an integer is O(1) with minimal overhead
Parallelism-friendly: Multiple checks can be combined with bitwise AND

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
class SudokuSolverBitset:
    """
    Ultra-fast Sudoku solver using bitset-based validity checking.
    
    Each row/column/box is represented as a 16-bit bitmask.
    Bit i is set if digit i is present (we use bits 1-9).
    """
    
    def __init__(self, puzzle):
        self.board = [row[:] for row in puzzle]
        
        # Bitmasks: bit i is set if digit i is used
        self.row_mask = [0] * 9
        self.col_mask = [0] * 9
        self.box_mask = [0] * 9
        
        # Initialize from puzzle
        for r in range(9):
            for c in range(9):
                if self.board[r][c] != 0:
                    self._set_bit(r, c, self.board[r][c])
    
    def _box_index(self, row, col):
        return (row // 3) * 3 + (col // 3)
    
    def _set_bit(self, row, col, value):
        """Set the bit for 'value' in row, column, and box masks."""
        bit = 1 << value  # Bit position for this value
        self.row_mask[row] |= bit
        self.col_mask[col] |= bit
        self.box_mask[self._box_index(row, col)] |= bit
    
    def _clear_bit(self, row, col, value):
        """Clear the bit for 'value' in row, column, and box masks."""
        bit = ~(1 << value)  # Inverted bit mask
        self.row_mask[row] &= bit
        self.col_mask[col] &= bit
        self.box_mask[self._box_index(row, col)] &= bit
    
    def is_valid(self, row, col, value):
        """
        O(1) validity check using bitwise operations.
        
        Check if the bit for 'value' is set in any of the three masks.
        If any bit is set, the value is already used (invalid).
        """
        bit = 1 << value
        box = self._box_index(row, col)
        
        # Combine masks with OR, check if value's bit is set
        combined = self.row_mask[row] | self.col_mask[col] | self.box_mask[box]
        return (combined & bit) == 0
    
    def get_valid_values(self, row, col):
        """
        Return all valid values for a cell as a bitmask.
        Extremely efficient: single bitwise computation.
        """
        box = self._box_index(row, col)
        
        # Used values in row, column, and box
        used = self.row_mask[row] | self.col_mask[col] | self.box_mask[box]
        
        # Valid values are those NOT used (complement, masked to bits 1-9)
        valid_mask = (~used) & 0b1111111110  # Bits 1-9
        
        return valid_mask
    
    def count_valid_values(self, row, col):
        """Count valid values using population count (popcount)."""
        valid_mask = self.get_valid_values(row, col)
        return bin(valid_mask).count('1')
    
    def place(self, row, col, value):
        self.board[row][col] = value
        self._set_bit(row, col, value)
    
    def remove(self, row, col, value):
        self.board[row][col] = 0
        self._clear_bit(row, col, value)

Performance Comparison of Validity Check Methods
Method	Time per Check	Memory	Implementation Complexity
Naïve Scan	~27 comparisons	0 extra	Very simple
Set-Based	3 hash lookups	~243 set entries	Simple
Bitset-Based	3 bitwise ops	27 integers	Moderate
Bitset Combined	1-2 bitwise ops	27 integers	Moderate

Population Count Optimization

Modern CPUs have a native POPCNT instruction that counts set bits in a single cycle. In high-performance solvers, count_valid_values() becomes a single CPU instruction, making MRV computation extremely fast. Python's bin(x).count('1') is slower; use gmpy2.popcount() or similar for production code.

Incremental Validity Checking

So far, we've discussed checking validity before placement. An alternative paradigm is incremental checking: maintain invariants that guarantee the board is always valid by only allowing valid placements. This subtly changes how we structure the solver.

Incremental Checking Principles

•Domain-based filtering: Before attempting placement, filter choices to only valid values. Validity checking is baked into domain computation.
•Eager propagation: When a value is placed, immediately update all affected domains. Domains never contain invalid values.
•No explicit validity check: If a value is in a cell's domain, it's guaranteed valid. Placement proceeds without checking.
•Empty domain detection: If propagation empties any domain, we've detected invalidity implicitly—backtrack without explicit check.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
class SudokuSolverDomains:
    """
    Solver using domain-based incremental validity.
    
    Key insight: Instead of checking validity at placement time,
    we maintain domains such that every value in a domain is
    guaranteed valid. Validity is enforced through propagation.
    """
    
    def __init__(self, puzzle):
        self.board = [row[:] for row in puzzle]
        
        # Each cell's domain: set of valid values
        # Initialized to {1..9}, then constrained by given values
        self.domains = [[set(range(1, 10)) for _ in range(9)] for _ in range(9)]
        
        # Initialize: propagate all given clues
        for r in range(9):
            for c in range(9):
                if puzzle[r][c] != 0:
                    value = puzzle[r][c]
                    self.domains[r][c] = {value}  # Singleton domain
                    self._propagate_constraint(r, c, value)
    
    def _propagate_constraint(self, row, col, value):
        """
        Remove 'value' from domains of all peers.
        Called after placing 'value' at (row, col).
        """
        peers = self._get_peers(row, col)
        
        for peer_r, peer_c in peers:
            self.domains[peer_r][peer_c].discard(value)
    
    def _get_peers(self, row, col):
        """Get all 20 peers of (row, col)."""
        peers = []
        
        # Row peers
        for c in range(9):
            if c != col:
                peers.append((row, c))
        
        # Column peers
        for r in range(9):
            if r != row:
                peers.append((r, col))
        
        # Box peers (not already in row or column)
        box_r, box_c = (row // 3) * 3, (col // 3) * 3
        for r in range(box_r, box_r + 3):
            for c in range(box_c, box_c + 3):
                if r != row and c != col:
                    peers.append((r, c))
        
        return peers
    
    def solve(self):
        # Find cell with smallest non-singleton domain
        best_cell = None
        min_size = 10
        
        for r in range(9):
            for c in range(9):
                if self.board[r][c] == 0:
                    size = len(self.domains[r][c])
                    if size == 0:
                        return False  # Domain empty = contradiction
                    if size < min_size:
                        min_size = size
                        best_cell = (r, c)
        
        if best_cell is None:
            return True  # All cells filled = solved
        
        row, col = best_cell
        
        # Try each value in domain - ALL are guaranteed valid!
        for value in list(self.domains[row][col]):
            # Save state
            saved_board = [r[:] for r in self.board]
            saved_domains = [[d.copy() for d in row] for row in self.domains]
            
            # Place and propagate
            self.board[row][col] = value
            self.domains[row][col] = {value}
            self._propagate_constraint(row, col, value)
            
            # Recurse - note: no validity check needed!
            if self.solve():
                return True
            
            # Backtrack: restore state
            self.board = saved_board
            self.domains = saved_domains
        
        return False

Trade-offs of incremental checking:

Pros:

No per-placement validity checks during search
Domains provide built-in MRV information
Empty domain gives early contradiction detection
Natural integration with constraint propagation

Cons:

Higher overhead per placement (propagation to 20 peers)
More expensive state saving/restoration for backtracking
Domain bookkeeping complexity

When to use which:

For simple solvers, explicit validity checks (especially bitset-based) are straightforward and fast. For sophisticated solvers with extensive constraint propagation, domain-based approaches are more natural because propagation maintains domains anyway—validity comes 'for free' as a byproduct.

Full Board Validation

Sometimes we need to validate an entire board—perhaps to check a proposed solution, verify initial puzzle validity, or validate user input. This requires checking all 27 constraints comprehensively.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
def validate_sudoku_complete(board):
    """
    Validate a complete Sudoku board.
    Returns (is_valid, error_message) tuple.
    
    Checks:
    1. All cells contain values 1-9 (completeness)
    2. Each row contains 1-9 exactly once
    3. Each column contains 1-9 exactly once
    4. Each 3×3 box contains 1-9 exactly once
    """
    # Check cell values
    for r in range(9):
        for c in range(9):
            if board[r][c] not in range(1, 10):
                return False, f"Invalid value {board[r][c]} at ({r}, {c})"
    
    # Check rows
    for r in range(9):
        seen = set()
        for c in range(9):
            val = board[r][c]
            if val in seen:
                return False, f"Duplicate {val} in row {r}"
            seen.add(val)
    
    # Check columns
    for c in range(9):
        seen = set()
        for r in range(9):
            val = board[r][c]
            if val in seen:
                return False, f"Duplicate {val} in column {c}"
            seen.add(val)
    
    # Check boxes
    for box_r in range(3):
        for box_c in range(3):
            seen = set()
            for r in range(box_r * 3, box_r * 3 + 3):
                for c in range(box_c * 3, box_c * 3 + 3):
                    val = board[r][c]
                    if val in seen:
                        return False, f"Duplicate {val} in box ({box_r}, {box_c})"
                    seen.add(val)
    
    return True, "Valid complete Sudoku"
 
def validate_sudoku_partial(board):
    """
    Validate a partially filled Sudoku board.
    Empty cells (0) are allowed but filled cells must not violate constraints.
    """
    # Check rows (ignoring zeros)
    for r in range(9):
        seen = set()
        for c in range(9):
            val = board[r][c]
            if val == 0:
                continue
            if val not in range(1, 10):
                return False, f"Invalid value {val} at ({r}, {c})"
            if val in seen:
                return False, f"Duplicate {val} in row {r}"
            seen.add(val)
    
    # Check columns (ignoring zeros)
    for c in range(9):
        seen = set()
        for r in range(9):
            val = board[r][c]
            if val == 0:
                continue
            if val in seen:
                return False, f"Duplicate {val} in column {c}"
            seen.add(val)
    
    # Check boxes (ignoring zeros)
    for box_r in range(3):
        for box_c in range(3):
            seen = set()
            for r in range(box_r * 3, box_r * 3 + 3):
                for c in range(box_c * 3, box_c * 3 + 3):
                    val = board[r][c]
                    if val == 0:
                        continue
                    if val in seen:
                        return False, f"Duplicate {val} in box ({box_r}, {box_c})"
                    seen.add(val)
    
    return True, "Valid partial Sudoku"

Validation vs. Solving

Validation tells you if a configuration is valid—it doesn't tell you if it's solvable or has a unique solution. A valid partial board might have zero solutions (painted into a corner) or multiple solutions (underspecified). These are separate questions requiring different analysis.

Common Validity Checking Bugs

Validity checking seems simple, but subtle bugs can cause incorrect solver behavior. Here are the most common pitfalls and how to avoid them:

Common Bugs and Fixes

•Self-comparison bug: When checking if value v can be placed at (r,c), make sure not to compare against (r,c) itself if v is already there. Fix: Explicitly skip the target cell in loops.
•Box index calculation error: Common to miscompute (row // 3) * 3 as (row / 3) * 3 (floating-point division) or confuse box indexing. Fix: Test box_index() separately with edge cases like (2,5), (3,0), (8,8).
•Off-by-one in domains: Using sets {0..8} instead of {1..9}, or bitmasks with bit 0 instead of bits 1-9. Fix: Be consistent and document your convention.
•State synchronization failure: Updating board but forgetting to update row_mask/col_mask/box_mask (or vice versa). Fix: Always use place()/remove() methods, never raw assignment.
•Backtrack restoration bug: Failing to fully restore tracking structures when backtracking. Fix: Test that after place()+remove(), state equals original state.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
def test_validity_checker():
    """
    Comprehensive tests for validity checking implementation.
    Run these before trusting your solver!
    """
    # Empty board: everything should be valid
    empty = [[0] * 9 for _ in range(9)]
    for r in range(9):
        for c in range(9):
            for v in range(1, 10):
                assert is_valid(empty, r, c, v), f"Empty board: ({r},{c})={v} should be valid"
    
    # Single value: only same row/col/box should be affected
    board = [[0] * 9 for _ in range(9)]
    board[4][4] = 5  # Center cell
    
    # Same value in same row should be invalid
    assert not is_valid(board, 4, 0, 5), "Same row should be invalid"
    assert not is_valid(board, 4, 8, 5), "Same row should be invalid"
    
    # Same value in same column should be invalid
    assert not is_valid(board, 0, 4, 5), "Same column should be invalid"
    assert not is_valid(board, 8, 4, 5), "Same column should be invalid"
    
    # Same value in same box should be invalid
    assert not is_valid(board, 3, 3, 5), "Same box should be invalid"
    assert not is_valid(board, 5, 5, 5), "Same box should be invalid"
    
    # Different value should be valid (no conflict)
    assert is_valid(board, 4, 0, 7), "Different value should be valid"
    
    # Outside row/col/box with same value should be valid
    assert is_valid(board, 0, 0, 5), "Outside scope should be valid"
    assert is_valid(board, 8, 8, 5), "Outside scope should be valid"
    
    # Box boundary tests
    board2 = [[0] * 9 for _ in range(9)]
    board2[2][2] = 9  # Bottom-right of top-left box
    
    assert not is_valid(board2, 0, 0, 9), "Top-left of same box should be invalid"
    assert is_valid(board2, 3, 0, 9), "Different box (below) should be valid"
    assert is_valid(board2, 0, 3, 9), "Different box (right) should be valid"
    
    print("All validity tests passed!")

Test Before You Trust

A bug in validity checking can cause your solver to either accept invalid placements (producing wrong 'solutions') or reject valid placements (failing to find solutions that exist). Always run comprehensive tests before using your solver for anything important.

Summary: Validity Checking Mastered

We've explored validity checking from basic concepts to highly optimized implementations. Let's consolidate the key insights:

Key Takeaways

•Three constraints must be satisfied: row uniqueness, column uniqueness, and box uniqueness. Each cell has 20 peers across these constraints.
•Naïve checking scans 27 cells per check. Simple and correct, but the constant factor matters at scale.
•Auxiliary data structures (sets or bitmasks) enable O(1) validity checks by precomputing which values are used in each unit.
•Bitset representation is optimal: uses minimal memory, executes in a few CPU cycles, and enables efficient operations like counting valid values.
•Incremental checking integrates validity into domain maintenance—if a value is in a domain, it's guaranteed valid. More complex but natural for advanced solvers.
•Full board validation is needed for verifying solutions and initial puzzles. Handle both complete and partial boards appropriately.
•Testing is essential—subtle bugs in validity checking corrupt solver correctness. Test edge cases thoroughly.

Looking Ahead

With constraint satisfaction, cell-by-cell exploration, and validity checking mastered, we have a working Sudoku solver. The final page focuses on optimization strategies—techniques that transform a working solver into a lightning-fast one, capable of solving even the hardest puzzles in milliseconds.

The role of validity checking:

Think of validity checking as the immune system of your solver. It prevents invalid configurations from propagating, ensuring that every partial solution explored is potentially completable. Without robust validity checking, the search would waste time exploring impossible paths. With it, every step moves purposefully toward a solution.

Validity Checking

The Guardian of Correctness

What You Will Learn

The Three Constraints

The three unit types:

Rows: Each of the 9 horizontal rows must contain digits 1-9 exactly once.
Columns: Each of the 9 vertical columns must contain digits 1-9 exactly once.
Boxes: Each of the 9 non-overlapping 3×3 boxes must contain digits 1-9 exactly once.

This gives us 27 constraints total: 9 rows × 1 constraint + 9 columns × 1 constraint + 9 boxes × 1 constraint = 27.

When checking if a value can be placed at position (row, col), we must verify that the value doesn't already exist in:

The row containing (row, col): cells (row, 0), (row, 1), ..., (row, 8)
The column containing (row, col): cells (0, col), (1, col), ..., (8, col)
The box containing (row, col): the 9 cells in the 3×3 region

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
     0   1   2   3   4   5   6   7   8
   ┌───────────┬───────────┬───────────┐
 0 │ .   .   . │ .   .   C │ .   .   . │   C = Column constraint
 1 │ .   .   . │ .   .   C │ .   .   . │
 2 │ .   .   . │ .   .   C │ .   .   . │
   ├───────────┼───────────┼───────────┤
 3 │ .   .   . │ B   B   X │ .   .   . │   B = Box constraint
 4 │ R   R   R │ R   R  [*] │ R   R   R │   R = Row constraint
 5 │ .   .   . │ B   B   X │ .   .   . │   [*] = Cell being checked
   ├───────────┼───────────┼───────────┤       X = Both Box and Column
 6 │ .   .   . │ .   .   C │ .   .   . │
 7 │ .   .   . │ .   .   C │ .   .   . │
 8 │ .   .   . │ .   .   C │ .   .   . │
   └───────────┴───────────┴───────────┘
 
Cell (4,5) is constrained by:
 - Row 4:    8 other cells (R)
 - Column 5: 8 other cells (C)
 - Box (3,3): 8 other cells (B), but 4 overlap with row/column
 
 Total unique constraining cells: 8 + 8 + 4 = 20 peers

The 20 Peers

Naïve Validity Checking

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
def is_valid_placement_naive(board, row, col, value):
    """
    Check if placing 'value' at position (row, col) is valid.
    
    Performs three separate scans:
    1. Row scan: Check all cells in the same row
    2. Column scan: Check all cells in the same column
    3. Box scan: Check all cells in the same 3×3 box
    
    Time Complexity: O(9 + 9 + 9) = O(27) = O(1) per check
    Space Complexity: O(1)
    """
    # Check row: does 'value' already exist in this row?
    for c in range(9):
        if c != col and board[row][c] == value:
            return False
    
    # Check column: does 'value' already exist in this column?
    for r in range(9):
        if r != row and board[r][col] == value:
            return False
    
    # Check 3×3 box: does 'value' already exist in this box?
    # First, find the top-left corner of the box
    box_row = (row // 3) * 3  # 0, 3, or 6
    box_col = (col // 3) * 3  # 0, 3, or 6
    
    for r in range(box_row, box_row + 3):
        for c in range(box_col, box_col + 3):
            if (r != row or c != col) and board[r][c] == value:
                return False
    
    return True  # Passed all checks
 
# Usage example:
# if is_valid_placement_naive(board, 4, 5, 7):
#     board[4][5] = 7  # Safe to place

Analysis of the naïve approach:

Space complexity: O(1)—no additional data structures needed.

Pros:

Simple and easy to understand
No additional memory overhead
Works directly with the board representation
Easy to verify correctness

Cons:

Repeated scanning is wasteful
For hard puzzles with millions of validity checks, the 27× overhead adds up
No utilization of previously computed information

When Naïve Is Acceptable

Auxiliary Data Structures for O(1) Checking

The approach:

Maintain three sets of tracking structures:

row_used[r]: set of values already used in row r
col_used[c]: set of values already used in column c
box_used[b]: set of values already used in box b

A placement is valid if and only if the value is not in any of the three relevant sets.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
class SudokuSolverWithSets:
    """
    Sudoku solver using set-based validity checking.
    Achieves O(1) validity checks through auxiliary data structures.
    """
    
    def __init__(self, puzzle):
        self.board = [row[:] for row in puzzle]
        
        # Tracking structures: which values are used in each unit
        self.row_used = [set() for _ in range(9)]
        self.col_used = [set() for _ in range(9)]
        self.box_used = [set() for _ in range(9)]
        
        # Initialize from initial puzzle state
        for r in range(9):
            for c in range(9):
                if self.board[r][c] != 0:
                    value = self.board[r][c]
                    self.row_used[r].add(value)
                    self.col_used[c].add(value)
                    self.box_used[self._box_index(r, c)].add(value)
    
    def _box_index(self, row, col):
        """Convert (row, col) to box index 0-8."""
        return (row // 3) * 3 + (col // 3)
    
    def is_valid(self, row, col, value):
        """
        O(1) validity check using precomputed sets.
        
        Simply check membership in three sets.
        """
        box = self._box_index(row, col)
        return (value not in self.row_used[row] and
                value not in self.col_used[col] and
                value not in self.box_used[box])
    
    def place(self, row, col, value):
        """Place a value and update tracking structures."""
        self.board[row][col] = value
        box = self._box_index(row, col)
        self.row_used[row].add(value)
        self.col_used[col].add(value)
        self.box_used[box].add(value)
    
    def remove(self, row, col, value):
        """Remove a value (backtrack) and update tracking structures."""
        self.board[row][col] = 0
        box = self._box_index(row, col)
        self.row_used[row].discard(value)
        self.col_used[col].discard(value)
        self.box_used[box].discard(value)

Analysis:

Time complexity:

is_valid(): O(1) — three set membership tests
place(): O(1) — three set insertions
remove(): O(1) — three set deletions

Space complexity: O(27 × 9) = O(243) = O(1) for fixed Sudoku — 27 sets, each holding at most 9 values.

Correctness invariant:

The tracking structures must always accurately reflect the board state. This means:

Every place() must add the value to all three relevant sets
Every remove() must remove the value from all three relevant sets
Initialization must correctly populate sets from given clues

A common bug is forgetting to update one of the three structures, leading to incorrect validity results.

Synchronization Is Critical

Bitset Optimization for Maximum Performance

Faster: Bitwise operations are among the fastest CPU operations
More cache-friendly: Integers fit in registers; no memory allocations for set nodes
Copy-efficient: Copying an integer is O(1) with minimal overhead
Parallelism-friendly: Multiple checks can be combined with bitwise AND

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
class SudokuSolverBitset:
    """
    Ultra-fast Sudoku solver using bitset-based validity checking.
    
    Each row/column/box is represented as a 16-bit bitmask.
    Bit i is set if digit i is present (we use bits 1-9).
    """
    
    def __init__(self, puzzle):
        self.board = [row[:] for row in puzzle]
        
        # Bitmasks: bit i is set if digit i is used
        self.row_mask = [0] * 9
        self.col_mask = [0] * 9
        self.box_mask = [0] * 9
        
        # Initialize from puzzle
        for r in range(9):
            for c in range(9):
                if self.board[r][c] != 0:
                    self._set_bit(r, c, self.board[r][c])
    
    def _box_index(self, row, col):
        return (row // 3) * 3 + (col // 3)
    
    def _set_bit(self, row, col, value):
        """Set the bit for 'value' in row, column, and box masks."""
        bit = 1 << value  # Bit position for this value
        self.row_mask[row] |= bit
        self.col_mask[col] |= bit
        self.box_mask[self._box_index(row, col)] |= bit
    
    def _clear_bit(self, row, col, value):
        """Clear the bit for 'value' in row, column, and box masks."""
        bit = ~(1 << value)  # Inverted bit mask
        self.row_mask[row] &= bit
        self.col_mask[col] &= bit
        self.box_mask[self._box_index(row, col)] &= bit
    
    def is_valid(self, row, col, value):
        """
        O(1) validity check using bitwise operations.
        
        Check if the bit for 'value' is set in any of the three masks.
        If any bit is set, the value is already used (invalid).
        """
        bit = 1 << value
        box = self._box_index(row, col)
        
        # Combine masks with OR, check if value's bit is set
        combined = self.row_mask[row] | self.col_mask[col] | self.box_mask[box]
        return (combined & bit) == 0
    
    def get_valid_values(self, row, col):
        """
        Return all valid values for a cell as a bitmask.
        Extremely efficient: single bitwise computation.
        """
        box = self._box_index(row, col)
        
        # Used values in row, column, and box
        used = self.row_mask[row] | self.col_mask[col] | self.box_mask[box]
        
        # Valid values are those NOT used (complement, masked to bits 1-9)
        valid_mask = (~used) & 0b1111111110  # Bits 1-9
        
        return valid_mask
    
    def count_valid_values(self, row, col):
        """Count valid values using population count (popcount)."""
        valid_mask = self.get_valid_values(row, col)
        return bin(valid_mask).count('1')
    
    def place(self, row, col, value):
        self.board[row][col] = value
        self._set_bit(row, col, value)
    
    def remove(self, row, col, value):
        self.board[row][col] = 0
        self._clear_bit(row, col, value)

Performance Comparison of Validity Check Methods
Method	Time per Check	Memory	Implementation Complexity
Naïve Scan	~27 comparisons	0 extra	Very simple
Set-Based	3 hash lookups	~243 set entries	Simple
Bitset-Based	3 bitwise ops	27 integers	Moderate
Bitset Combined	1-2 bitwise ops	27 integers	Moderate

Population Count Optimization

Incremental Validity Checking

Incremental Checking Principles

•Domain-based filtering: Before attempting placement, filter choices to only valid values. Validity checking is baked into domain computation.
•Eager propagation: When a value is placed, immediately update all affected domains. Domains never contain invalid values.
•No explicit validity check: If a value is in a cell's domain, it's guaranteed valid. Placement proceeds without checking.
•Empty domain detection: If propagation empties any domain, we've detected invalidity implicitly—backtrack without explicit check.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
class SudokuSolverDomains:
    """
    Solver using domain-based incremental validity.
    
    Key insight: Instead of checking validity at placement time,
    we maintain domains such that every value in a domain is
    guaranteed valid. Validity is enforced through propagation.
    """
    
    def __init__(self, puzzle):
        self.board = [row[:] for row in puzzle]
        
        # Each cell's domain: set of valid values
        # Initialized to {1..9}, then constrained by given values
        self.domains = [[set(range(1, 10)) for _ in range(9)] for _ in range(9)]
        
        # Initialize: propagate all given clues
        for r in range(9):
            for c in range(9):
                if puzzle[r][c] != 0:
                    value = puzzle[r][c]
                    self.domains[r][c] = {value}  # Singleton domain
                    self._propagate_constraint(r, c, value)
    
    def _propagate_constraint(self, row, col, value):
        """
        Remove 'value' from domains of all peers.
        Called after placing 'value' at (row, col).
        """
        peers = self._get_peers(row, col)
        
        for peer_r, peer_c in peers:
            self.domains[peer_r][peer_c].discard(value)
    
    def _get_peers(self, row, col):
        """Get all 20 peers of (row, col)."""
        peers = []
        
        # Row peers
        for c in range(9):
            if c != col:
                peers.append((row, c))
        
        # Column peers
        for r in range(9):
            if r != row:
                peers.append((r, col))
        
        # Box peers (not already in row or column)
        box_r, box_c = (row // 3) * 3, (col // 3) * 3
        for r in range(box_r, box_r + 3):
            for c in range(box_c, box_c + 3):
                if r != row and c != col:
                    peers.append((r, c))
        
        return peers
    
    def solve(self):
        # Find cell with smallest non-singleton domain
        best_cell = None
        min_size = 10
        
        for r in range(9):
            for c in range(9):
                if self.board[r][c] == 0:
                    size = len(self.domains[r][c])
                    if size == 0:
                        return False  # Domain empty = contradiction
                    if size < min_size:
                        min_size = size
                        best_cell = (r, c)
        
        if best_cell is None:
            return True  # All cells filled = solved
        
        row, col = best_cell
        
        # Try each value in domain - ALL are guaranteed valid!
        for value in list(self.domains[row][col]):
            # Save state
            saved_board = [r[:] for r in self.board]
            saved_domains = [[d.copy() for d in row] for row in self.domains]
            
            # Place and propagate
            self.board[row][col] = value
            self.domains[row][col] = {value}
            self._propagate_constraint(row, col, value)
            
            # Recurse - note: no validity check needed!
            if self.solve():
                return True
            
            # Backtrack: restore state
            self.board = saved_board
            self.domains = saved_domains
        
        return False

Trade-offs of incremental checking:

Pros:

No per-placement validity checks during search
Domains provide built-in MRV information
Empty domain gives early contradiction detection
Natural integration with constraint propagation

Cons:

Higher overhead per placement (propagation to 20 peers)
More expensive state saving/restoration for backtracking
Domain bookkeeping complexity

When to use which:

Full Board Validation

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
def validate_sudoku_complete(board):
    """
    Validate a complete Sudoku board.
    Returns (is_valid, error_message) tuple.
    
    Checks:
    1. All cells contain values 1-9 (completeness)
    2. Each row contains 1-9 exactly once
    3. Each column contains 1-9 exactly once
    4. Each 3×3 box contains 1-9 exactly once
    """
    # Check cell values
    for r in range(9):
        for c in range(9):
            if board[r][c] not in range(1, 10):
                return False, f"Invalid value {board[r][c]} at ({r}, {c})"
    
    # Check rows
    for r in range(9):
        seen = set()
        for c in range(9):
            val = board[r][c]
            if val in seen:
                return False, f"Duplicate {val} in row {r}"
            seen.add(val)
    
    # Check columns
    for c in range(9):
        seen = set()
        for r in range(9):
            val = board[r][c]
            if val in seen:
                return False, f"Duplicate {val} in column {c}"
            seen.add(val)
    
    # Check boxes
    for box_r in range(3):
        for box_c in range(3):
            seen = set()
            for r in range(box_r * 3, box_r * 3 + 3):
                for c in range(box_c * 3, box_c * 3 + 3):
                    val = board[r][c]
                    if val in seen:
                        return False, f"Duplicate {val} in box ({box_r}, {box_c})"
                    seen.add(val)
    
    return True, "Valid complete Sudoku"
 
def validate_sudoku_partial(board):
    """
    Validate a partially filled Sudoku board.
    Empty cells (0) are allowed but filled cells must not violate constraints.
    """
    # Check rows (ignoring zeros)
    for r in range(9):
        seen = set()
        for c in range(9):
            val = board[r][c]
            if val == 0:
                continue
            if val not in range(1, 10):
                return False, f"Invalid value {val} at ({r}, {c})"
            if val in seen:
                return False, f"Duplicate {val} in row {r}"
            seen.add(val)
    
    # Check columns (ignoring zeros)
    for c in range(9):
        seen = set()
        for r in range(9):
            val = board[r][c]
            if val == 0:
                continue
            if val in seen:
                return False, f"Duplicate {val} in column {c}"
            seen.add(val)
    
    # Check boxes (ignoring zeros)
    for box_r in range(3):
        for box_c in range(3):
            seen = set()
            for r in range(box_r * 3, box_r * 3 + 3):
                for c in range(box_c * 3, box_c * 3 + 3):
                    val = board[r][c]
                    if val == 0:
                        continue
                    if val in seen:
                        return False, f"Duplicate {val} in box ({box_r}, {box_c})"
                    seen.add(val)
    
    return True, "Valid partial Sudoku"

Validation vs. Solving

Common Validity Checking Bugs

Validity checking seems simple, but subtle bugs can cause incorrect solver behavior. Here are the most common pitfalls and how to avoid them:

Common Bugs and Fixes

•Self-comparison bug: When checking if value v can be placed at (r,c), make sure not to compare against (r,c) itself if v is already there. Fix: Explicitly skip the target cell in loops.
•Box index calculation error: Common to miscompute (row // 3) * 3 as (row / 3) * 3 (floating-point division) or confuse box indexing. Fix: Test box_index() separately with edge cases like (2,5), (3,0), (8,8).
•Off-by-one in domains: Using sets {0..8} instead of {1..9}, or bitmasks with bit 0 instead of bits 1-9. Fix: Be consistent and document your convention.
•State synchronization failure: Updating board but forgetting to update row_mask/col_mask/box_mask (or vice versa). Fix: Always use place()/remove() methods, never raw assignment.
•Backtrack restoration bug: Failing to fully restore tracking structures when backtracking. Fix: Test that after place()+remove(), state equals original state.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
def test_validity_checker():
    """
    Comprehensive tests for validity checking implementation.
    Run these before trusting your solver!
    """
    # Empty board: everything should be valid
    empty = [[0] * 9 for _ in range(9)]
    for r in range(9):
        for c in range(9):
            for v in range(1, 10):
                assert is_valid(empty, r, c, v), f"Empty board: ({r},{c})={v} should be valid"
    
    # Single value: only same row/col/box should be affected
    board = [[0] * 9 for _ in range(9)]
    board[4][4] = 5  # Center cell
    
    # Same value in same row should be invalid
    assert not is_valid(board, 4, 0, 5), "Same row should be invalid"
    assert not is_valid(board, 4, 8, 5), "Same row should be invalid"
    
    # Same value in same column should be invalid
    assert not is_valid(board, 0, 4, 5), "Same column should be invalid"
    assert not is_valid(board, 8, 4, 5), "Same column should be invalid"
    
    # Same value in same box should be invalid
    assert not is_valid(board, 3, 3, 5), "Same box should be invalid"
    assert not is_valid(board, 5, 5, 5), "Same box should be invalid"
    
    # Different value should be valid (no conflict)
    assert is_valid(board, 4, 0, 7), "Different value should be valid"
    
    # Outside row/col/box with same value should be valid
    assert is_valid(board, 0, 0, 5), "Outside scope should be valid"
    assert is_valid(board, 8, 8, 5), "Outside scope should be valid"
    
    # Box boundary tests
    board2 = [[0] * 9 for _ in range(9)]
    board2[2][2] = 9  # Bottom-right of top-left box
    
    assert not is_valid(board2, 0, 0, 9), "Top-left of same box should be invalid"
    assert is_valid(board2, 3, 0, 9), "Different box (below) should be valid"
    assert is_valid(board2, 0, 3, 9), "Different box (right) should be valid"
    
    print("All validity tests passed!")

Test Before You Trust

Summary: Validity Checking Mastered

We've explored validity checking from basic concepts to highly optimized implementations. Let's consolidate the key insights:

Key Takeaways

•Three constraints must be satisfied: row uniqueness, column uniqueness, and box uniqueness. Each cell has 20 peers across these constraints.
•Naïve checking scans 27 cells per check. Simple and correct, but the constant factor matters at scale.
•Auxiliary data structures (sets or bitmasks) enable O(1) validity checks by precomputing which values are used in each unit.
•Bitset representation is optimal: uses minimal memory, executes in a few CPU cycles, and enables efficient operations like counting valid values.
•Incremental checking integrates validity into domain maintenance—if a value is in a domain, it's guaranteed valid. More complex but natural for advanced solvers.
•Full board validation is needed for verifying solutions and initial puzzles. Handle both complete and partial boards appropriately.
•Testing is essential—subtle bugs in validity checking corrupt solver correctness. Test edge cases thoroughly.

Looking Ahead

The role of validity checking: