Data Structures & AlgorithmsBinary Representation Revisited

Binary Representation Revisited — The Language of Low-Level Optimization

LevelIntermediate

Duration60 mins

TopicBinary Representation Revisited

2 / 4

Bits as the Foundation of Computation

The Atomic Unit of Digital Information

Everything you've ever computed—every search query, every video frame, every neural network inference, every cryptocurrency transaction—ultimately reduces to operations on bits. The bit, short for 'binary digit,' is the atomic unit of digital information: a single entity that can exist in exactly one of two states.

We typically represent these states as 0 and 1. But at the hardware level, they manifest as voltage levels (low/high), magnetic orientations (north/south), optical states (dark/bright), or transistor conditions (off/on). The abstraction of 0 and 1 is how we reason about them—the physical reality is engineering marvel.

This page explores why bits form the foundation of all computation, how hardware implements bit-level operations, and why this understanding matters for writing efficient algorithms.

What You Will Learn

By the end of this page, you will understand why binary (specifically) underlies all digital computation, how processors implement bitwise operations at the hardware level, and how this knowledge informs your approach to algorithm optimization.

Why Binary? — The Physical Necessity

Why do computers use binary rather than decimal or some other base? The answer lies in physics and engineering—binary isn't a mathematical preference; it's a physical necessity for reliable computation.

The Engineering Reasons for Binary

•Noise Immunity — Electronic signals are susceptible to noise. Distinguishing between two states (high/low voltage) is far more reliable than distinguishing among ten. With only two states, a signal can degrade significantly before being misinterpreted. With ten states, even small interference could cause errors.
•Transistor Behavior — Transistors, the building blocks of processors, naturally operate as switches: fully on or fully off. Operating them in between ('linear region') wastes power and generates heat. Binary leverages their natural switching behavior.
•Logic Implementation — Boolean algebra (true/false) maps directly to binary (1/0). All logical operations—AND, OR, NOT—have simple, efficient hardware implementations with transistors.
•Mathematical Elegance — Binary arithmetic follows consistent, simple rules. Carry propagation, overflow detection, and arithmetic operations are straightforward to implement in hardware.
•Error Detection — With only two states, detecting and correcting errors is computationally tractable. Parity bits, checksums, and error-correcting codes all exploit binary structure.

Historical context:

Early computers explored various bases. ENIAC (1945) used decimal internally. But the transition to binary proved universal—by the 1950s, nearly all digital computers adopted binary representation. The advantages in reliability, speed, and cost were overwhelming.

The modern result:

Today's processors contain billions of transistors, each operating as a binary switch. The reliability of binary representation allows these switches to operate at frequencies exceeding 5 billion cycles per second (5 GHz) while maintaining correct computation. This wouldn't be possible with multi-level (e.g., decimal) representations.

Quantum Computing and Beyond

Quantum computers use qubits that can represent superpositions of 0 and 1. However, when we measure or extract information, the result collapses to classical bits. For the foreseeable future, binary remains the foundation of practical computation.

From Transistors to Bits — Building Logic

Understanding how hardware implements bits illuminates why certain operations are fast and others slow. At the lowest level, transistors act as electrically controlled switches:

Off (0): The transistor blocks current flow
On (1): The transistor allows current to flow

By combining transistors, we build logic gates—the fundamental circuits that implement Boolean operations:

Fundamental Logic Gates
Gate	Symbol	Transistors Needed	Operation	Truth Table Summary
NOT	¬	1-2	Inverts input	0→1, 1→0
AND	∧	2-4	Both inputs must be 1	1∧1=1, else 0
OR	∨	2-4	At least one input is 1	0∨0=0, else 1
NAND	↑	2	NOT(AND)	1↑1=0, else 1
NOR	↓	2	NOT(OR)	0↓0=1, else 0
XOR	⊕	4-6	Inputs differ	Same→0, Different→1

From gates to operations:

By connecting gates in specific configurations, we build more complex circuits:

Adders: Combining AND, OR, and XOR gates to add binary numbers
Multiplexers: Selecting between inputs based on control signals
Registers: Storing bits using feedback loops
ALU (Arithmetic Logic Unit): Combining all basic operations into a single component

The ALU performs operations like ADD, SUBTRACT, AND, OR, XOR, and SHIFT in response to control signals. Each of these operations typically completes in a single clock cycle—the fundamental unit of processor time.

Why Bitwise Operations Are Fast

Bitwise AND, OR, XOR, and NOT operate on all bits of a word in parallel. A 64-bit AND operation doesn't take 64 times longer than a 1-bit operation—it happens simultaneously across all 64 bits in a single cycle. This parallelism is why bitwise operations are among the fastest operations a CPU can perform.

The key insight for programmers:

Bitwise operations are not abstractions that get compiled down to something else—they directly correspond to hardware operations. When you write a & b in code, the processor literally performs an AND operation on the corresponding bits using AND gates. There's no interpretation, no library call, no overhead. This directness is why bit manipulation can be so efficient.

The Bit's Position Matters — Positional Significance

In binary representation, each bit position carries a specific weight or significance. Understanding this positional structure is essential for effective bit manipulation.

For an unsigned n-bit integer, the bits are numbered from 0 (rightmost, least significant) to n-1 (leftmost, most significant). The value of the integer is the sum of each bit multiplied by its positional weight:

positional_value.txt
For an 8-bit number with bits b₇b₆b₅b₄b₃b₂b₁b₀:
 
Value = b₇×2⁷ + b₆×2⁶ + b₅×2⁵ + b₄×2⁴ + b₃×2³ + b₂×2² + b₁×2¹ + b₀×2⁰
      = b₇×128 + b₆×64 + b₅×32 + b₄×16 + b₃×8 + b₂×4 + b₁×2 + b₀×1
 
Example: 01011010
= 0×128 + 1×64 + 0×32 + 1×16 + 1×8 + 0×4 + 1×2 + 0×1
= 64 + 16 + 8 + 2
= 90

Key terminology:

Least Significant Bit (LSB): Bit position 0, weight 2⁰ = 1. Determines odd/even.
Most Significant Bit (MSB): Highest bit position. For signed integers, this is the sign bit.
Low bits / High bits: Informal terms for positions near LSB / MSB respectively.
Bit n: Specifically refers to the bit at position n (0-indexed from right).

This positional structure has profound implications:

Implications of Positional Structure

•Shifting left multiplies by 2 — Moving all bits one position left doubles each bit's weight, doubling the total. n << 1 equals n × 2.
•Shifting right divides by 2 — Moving bits right halves weights. n >> 1 equals floor(n / 2) for unsigned integers.
•LSB determines parity — The LSB alone determines whether a number is odd or even. n & 1 extracts this bit.
•Each bit is an independent flag — Because positions don't interact, we can set, clear, or toggle any bit without affecting others.
•Powers of 2 have exactly one bit set — Numbers like 1, 2, 4, 8, 16... have a single 1 bit. This property enables fast power-of-2 detection.

Visualizing bit positions:

Think of an integer's bits as a row of light switches, each controlling a different power of 2. Flipping switch 3 adds or removes 8 from the value. The switches are independent—flipping one never accidentally affects another.

This independence is what makes bit manipulation so powerful. We can encode multiple pieces of information in a single integer, manipulate them separately, and combine them freely.

Bytes, Words, and Fixed-Width Integers

Bits don't exist in isolation—they're grouped into larger units for practical computation:

Bit: Single binary digit (0 or 1)
Nibble: 4 bits (can represent 0–15, one hex digit)
Byte: 8 bits (can represent 0–255 unsigned, or -128 to 127 signed)
Word: Platform-dependent; typically 32 or 64 bits on modern systems

Most programming works with fixed-width integer types:

Common Integer Types and Their Bit Widths
Type (C/C++)	Bits	Unsigned Range	Signed Range
uint8_t / int8_t	8	0 to 255	-128 to 127
uint16_t / int16_t	16	0 to 65,535	-32,768 to 32,767
uint32_t / int32_t	32	0 to ~4.29 billion	~±2.15 billion
uint64_t / int64_t	64	0 to ~18.45 quintillion	~±9.22 quintillion

Why fixed-width matters for bit manipulation:

Bit manipulation techniques assume you know exactly how many bits you're working with. Consider left-shifting:

In an 8-bit context, 10000000 << 1 results in 00000000 (the 1 bit shifts out)
In a 16-bit context, 0000000010000000 << 1 results in 0000000100000000 (the 1 bit shifts into a higher position)

The same operation yields different results based on the bit width. Always be conscious of your integer type's width when performing bit manipulation.

Platform-Dependent Sizes

The type int is commonly 32 bits but isn't guaranteed. For bit manipulation, prefer explicit-width types like int32_t or uint64_t. These guarantee exact bit widths across platforms, making your bit operations predictable and portable.

Endianness—byte order:

When multi-byte integers are stored in memory, the order of bytes can vary:

Little-endian: Least significant byte stored first (x86, ARM default)
Big-endian: Most significant byte stored first (network protocols, some ARM)

For most bit manipulation, endianness doesn't matter—we work with the integer's value, not its memory layout. However, when interpreting raw bytes or writing low-level code, endianness becomes relevant.

Endianness affects byte order, not bit order within a byte. We'll revisit this when it matters; for now, focus on the logical bit structure rather than physical layout.

The CPU's Perspective on Bits

Modern CPUs are designed around bit-parallel operations. Understanding how the CPU handles bits explains why certain operations are fast and informs optimization strategies.

CPU Characteristics Relevant to Bit Manipulation

•Register Width — Modern CPUs have 64-bit registers. All bits in a register are processed simultaneously. A 64-bit AND takes the same time as an 8-bit AND—one cycle.
•Single-Cycle Bitwise Operations — AND, OR, XOR, NOT, and shifts typically complete in one clock cycle (often less than 1 nanosecond). These are among the fastest operations a CPU can execute.
•Specialized Bit Instructions — Modern CPUs include instructions for counting set bits (popcount), finding the lowest/highest set bit (bsf/bsr), and rotating bits. These hardware-accelerated operations are exposed through compiler intrinsics.
•SIMD Extensions — Vector instructions (SSE, AVX, NEON) process multiple values simultaneously using bitwise operations on 128-bit, 256-bit, or 512-bit registers. Bit manipulation scales beautifully to SIMD.
•Branch Prediction Impact — Conditional branches can be expensive if mispredicted. Bitwise tricks that eliminate branches (e.g., branchless min/max) can improve performance in tight loops.

operation_cycles.txt
Approximate CPU Cycle Costs (modern x86-64):
 
┌─────────────────────┬─────────────┐
│ Operation           │ Cycles      │
├─────────────────────┼─────────────┤
│ AND, OR, XOR, NOT   │ 1           │
│ Shift (SHL, SHR)    │ 1           │
│ Addition            │ 1           │
│ Subtraction         │ 1           │
│ Popcount            │ 1           │
│ Multiplication      │ 3-4         │
│ Division            │ 20-80       │
│ Branch (predicted)  │ 1           │
│ Branch (mispredicted)│ 15-20      │
└─────────────────────┴─────────────┘
 
Note: Division is 20-80x slower than bitwise operations!

What this means for you:

When you can express an operation using bitwise logic rather than arithmetic, you're often choosing the fastest possible implementation. Replacing n % 2 with n & 1 isn't just stylistic—it's potentially 80x faster (though modern compilers often make this optimization automatically).

More importantly, understanding CPU bit capabilities helps you recognize when hardware can help. Need to count 1 bits in an integer? Don't write a loop—use the popcount intrinsic. Need the position of the lowest set bit? That's one instruction on most processors.

Trust Your Compiler... But Verify

Modern compilers optimize many arithmetic operations to bitwise equivalents. However, they can't optimize what they can't recognize. Using explicit bit manipulation communicates your intent clearly and guarantees the efficient implementation.

Information Density — The Power of Bits

One of the most powerful aspects of working at the bit level is information density. Because each bit can represent two states, n bits can represent 2ⁿ distinct values. This exponential relationship has profound implications:

Bits vs Representable Values
Bits	Distinct Values	Example Application
1	2	Boolean: true/false
2	4	Direction: N/S/E/W
3	8	Days of week (with overflow)
4	16	Hex digit (0-9, A-F)
8	256	ASCII character, pixel channel
16	65,536	Unicode BMP, audio sample
32	~4.3 billion	IPv4 address, standard int
64	~18 quintillion	Unique IDs, pointers, timestamps

Practical power: subset representation

Consider a set of n elements. How many subsets are there? Exactly 2ⁿ—each element is either included or not. This maps perfectly to n bits:

Bit i = 1: element i is in the subset
Bit i = 0: element i is not in the subset

With 32 bits, we can represent any subset of a 32-element set. With 64 bits, any subset of 64 elements. Set operations become bitwise operations:

Union: A | B (OR)
Intersection: A & B (AND)
Complement: ~A (NOT)
Difference: A & ~B (AND NOT)
Symmetric difference: A ^ B (XOR)

These set operations execute in O(1) time, regardless of how many elements are in the sets. This is the basis of bitmask techniques we'll explore extensively later.

The Tradeoff

Bit-level density is powerful but comes with constraints. You need fixed, known sizes. You need operations that map to bitwise logic. When these conditions hold, bit manipulation offers unmatched efficiency. When they don't—when you need arbitrary precision or complex relationships—other approaches are needed.

From Hardware to Algorithm Design

Understanding bits at the hardware level directly informs algorithm design. Here's how the connections manifest:

Hardware Property

•Bits are processed in parallel
•Shift operations are single-cycle
•Division is expensive
•Popcount has hardware support
•Branches can stall the pipeline
•Memory access is slow vs registers

Algorithm Design Implication

•Operate on all bits at once, not individually
•Use shifts for power-of-2 arithmetic
•Replace division with bit tricks when possible
•Use built-in popcount for counting bits
•Use bitwise tricks to avoid branches
•Pack data to reduce memory traffic

Example: turning hardware knowledge into code

Suppose you need to check if a number is a power of 2. The naive approach:

function isPowerOfTwo(n) {
    if (n <= 0) return false;
    while (n > 1) {
        if (n % 2 != 0) return false;
        n = n / 2;
    }
    return true;
}

This loops O(log n) times with division in each iteration. Now, knowing that:

Powers of 2 have exactly one bit set
Subtracting 1 flips all bits from the rightmost 1
AND operates in parallel on all bits

We can write:

function isPowerOfTwo(n) {
    return n > 0 && (n & (n - 1)) == 0;
}

O(1) time, single-cycle operation. Hardware knowledge → algorithmic advantage.

The Pattern Recognition Skill

Once you internalize how bits work at the hardware level, you'll start recognizing when problems have bit-level solutions. This recognition—seeing that 'power of 2 check' has a one-bit-set property—is the key skill. The specific trick follows naturally.

Summary: Bits as Foundation

Let's consolidate what we've learned about bits as the foundation of computation:

Key Takeaways

•Binary is a physical necessity — Two-state representation enables reliable, high-speed computation with noise immunity.
•Transistors and gates implement bits — Bitwise operations directly correspond to hardware circuits, making them inherently fast.
•Position determines value — Each bit independently contributes a power of 2. This independence enables parallel manipulation.
•Fixed-width types matter — Bit manipulation requires awareness of integer width. Use explicit types for predictability.
•CPUs are optimized for bits — Bitwise operations are single-cycle. Modern CPUs include specialized bit instructions.
•Bits offer extreme density — n bits represent 2ⁿ values, enabling compact representation of sets, flags, and states.
•Hardware knowledge informs algorithms — Understanding CPU capabilities reveals when bit manipulation offers advantages.

What's next:

With the foundational understanding of what bits are and how hardware handles them, we're ready to tackle a crucial question: how do we represent negative numbers? The next page explores positive and negative number representation—unsigned versus signed integers, the challenge of representing negatives, and the various schemes computers have employed to solve this fundamental problem.

Page Complete

You now understand bits at both the conceptual and hardware levels. This foundation enables everything that follows—from understanding two's complement to mastering advanced bit manipulation techniques. Next, we explore how to represent both positive and negative numbers.

2 / 4

Loading learning content...

Data Structures & AlgorithmsBinary Representation Revisited

Binary Representation Revisited — The Language of Low-Level Optimization

LevelIntermediate

Duration60 mins

TopicBinary Representation Revisited

2 / 4

Bits as the Foundation of Computation

The Atomic Unit of Digital Information

This page explores why bits form the foundation of all computation, how hardware implements bit-level operations, and why this understanding matters for writing efficient algorithms.

What You Will Learn

Why Binary? — The Physical Necessity

The Engineering Reasons for Binary

•Noise Immunity — Electronic signals are susceptible to noise. Distinguishing between two states (high/low voltage) is far more reliable than distinguishing among ten. With only two states, a signal can degrade significantly before being misinterpreted. With ten states, even small interference could cause errors.
•Transistor Behavior — Transistors, the building blocks of processors, naturally operate as switches: fully on or fully off. Operating them in between ('linear region') wastes power and generates heat. Binary leverages their natural switching behavior.
•Logic Implementation — Boolean algebra (true/false) maps directly to binary (1/0). All logical operations—AND, OR, NOT—have simple, efficient hardware implementations with transistors.
•Mathematical Elegance — Binary arithmetic follows consistent, simple rules. Carry propagation, overflow detection, and arithmetic operations are straightforward to implement in hardware.
•Error Detection — With only two states, detecting and correcting errors is computationally tractable. Parity bits, checksums, and error-correcting codes all exploit binary structure.

Historical context:

The modern result:

Quantum Computing and Beyond

From Transistors to Bits — Building Logic

Understanding how hardware implements bits illuminates why certain operations are fast and others slow. At the lowest level, transistors act as electrically controlled switches:

Off (0): The transistor blocks current flow
On (1): The transistor allows current to flow

By combining transistors, we build logic gates—the fundamental circuits that implement Boolean operations:

Fundamental Logic Gates
Gate	Symbol	Transistors Needed	Operation	Truth Table Summary
NOT	¬	1-2	Inverts input	0→1, 1→0
AND	∧	2-4	Both inputs must be 1	1∧1=1, else 0
OR	∨	2-4	At least one input is 1	0∨0=0, else 1
NAND	↑	2	NOT(AND)	1↑1=0, else 1
NOR	↓	2	NOT(OR)	0↓0=1, else 0
XOR	⊕	4-6	Inputs differ	Same→0, Different→1

From gates to operations:

By connecting gates in specific configurations, we build more complex circuits:

Adders: Combining AND, OR, and XOR gates to add binary numbers
Multiplexers: Selecting between inputs based on control signals
Registers: Storing bits using feedback loops
ALU (Arithmetic Logic Unit): Combining all basic operations into a single component

Why Bitwise Operations Are Fast

The key insight for programmers:

The Bit's Position Matters — Positional Significance

In binary representation, each bit position carries a specific weight or significance. Understanding this positional structure is essential for effective bit manipulation.

positional_value.txt
For an 8-bit number with bits b₇b₆b₅b₄b₃b₂b₁b₀:
 
Value = b₇×2⁷ + b₆×2⁶ + b₅×2⁵ + b₄×2⁴ + b₃×2³ + b₂×2² + b₁×2¹ + b₀×2⁰
      = b₇×128 + b₆×64 + b₅×32 + b₄×16 + b₃×8 + b₂×4 + b₁×2 + b₀×1
 
Example: 01011010
= 0×128 + 1×64 + 0×32 + 1×16 + 1×8 + 0×4 + 1×2 + 0×1
= 64 + 16 + 8 + 2
= 90

Key terminology:

Least Significant Bit (LSB): Bit position 0, weight 2⁰ = 1. Determines odd/even.
Most Significant Bit (MSB): Highest bit position. For signed integers, this is the sign bit.
Low bits / High bits: Informal terms for positions near LSB / MSB respectively.
Bit n: Specifically refers to the bit at position n (0-indexed from right).

This positional structure has profound implications:

Implications of Positional Structure

•Shifting left multiplies by 2 — Moving all bits one position left doubles each bit's weight, doubling the total. n << 1 equals n × 2.
•Shifting right divides by 2 — Moving bits right halves weights. n >> 1 equals floor(n / 2) for unsigned integers.
•LSB determines parity — The LSB alone determines whether a number is odd or even. n & 1 extracts this bit.
•Each bit is an independent flag — Because positions don't interact, we can set, clear, or toggle any bit without affecting others.
•Powers of 2 have exactly one bit set — Numbers like 1, 2, 4, 8, 16... have a single 1 bit. This property enables fast power-of-2 detection.

Visualizing bit positions:

This independence is what makes bit manipulation so powerful. We can encode multiple pieces of information in a single integer, manipulate them separately, and combine them freely.

Bytes, Words, and Fixed-Width Integers

Bits don't exist in isolation—they're grouped into larger units for practical computation:

Bit: Single binary digit (0 or 1)
Nibble: 4 bits (can represent 0–15, one hex digit)
Byte: 8 bits (can represent 0–255 unsigned, or -128 to 127 signed)
Word: Platform-dependent; typically 32 or 64 bits on modern systems

Most programming works with fixed-width integer types:

Common Integer Types and Their Bit Widths
Type (C/C++)	Bits	Unsigned Range	Signed Range
uint8_t / int8_t	8	0 to 255	-128 to 127
uint16_t / int16_t	16	0 to 65,535	-32,768 to 32,767
uint32_t / int32_t	32	0 to ~4.29 billion	~±2.15 billion
uint64_t / int64_t	64	0 to ~18.45 quintillion	~±9.22 quintillion

Why fixed-width matters for bit manipulation:

Bit manipulation techniques assume you know exactly how many bits you're working with. Consider left-shifting:

In an 8-bit context, 10000000 << 1 results in 00000000 (the 1 bit shifts out)
In a 16-bit context, 0000000010000000 << 1 results in 0000000100000000 (the 1 bit shifts into a higher position)

The same operation yields different results based on the bit width. Always be conscious of your integer type's width when performing bit manipulation.

Platform-Dependent Sizes

Endianness—byte order:

When multi-byte integers are stored in memory, the order of bytes can vary:

Little-endian: Least significant byte stored first (x86, ARM default)
Big-endian: Most significant byte stored first (network protocols, some ARM)

Endianness affects byte order, not bit order within a byte. We'll revisit this when it matters; for now, focus on the logical bit structure rather than physical layout.

The CPU's Perspective on Bits

Modern CPUs are designed around bit-parallel operations. Understanding how the CPU handles bits explains why certain operations are fast and informs optimization strategies.

CPU Characteristics Relevant to Bit Manipulation

•Register Width — Modern CPUs have 64-bit registers. All bits in a register are processed simultaneously. A 64-bit AND takes the same time as an 8-bit AND—one cycle.
•Single-Cycle Bitwise Operations — AND, OR, XOR, NOT, and shifts typically complete in one clock cycle (often less than 1 nanosecond). These are among the fastest operations a CPU can execute.
•Specialized Bit Instructions — Modern CPUs include instructions for counting set bits (popcount), finding the lowest/highest set bit (bsf/bsr), and rotating bits. These hardware-accelerated operations are exposed through compiler intrinsics.
•SIMD Extensions — Vector instructions (SSE, AVX, NEON) process multiple values simultaneously using bitwise operations on 128-bit, 256-bit, or 512-bit registers. Bit manipulation scales beautifully to SIMD.
•Branch Prediction Impact — Conditional branches can be expensive if mispredicted. Bitwise tricks that eliminate branches (e.g., branchless min/max) can improve performance in tight loops.

operation_cycles.txt
Approximate CPU Cycle Costs (modern x86-64):
 
┌─────────────────────┬─────────────┐
│ Operation           │ Cycles      │
├─────────────────────┼─────────────┤
│ AND, OR, XOR, NOT   │ 1           │
│ Shift (SHL, SHR)    │ 1           │
│ Addition            │ 1           │
│ Subtraction         │ 1           │
│ Popcount            │ 1           │
│ Multiplication      │ 3-4         │
│ Division            │ 20-80       │
│ Branch (predicted)  │ 1           │
│ Branch (mispredicted)│ 15-20      │
└─────────────────────┴─────────────┘
 
Note: Division is 20-80x slower than bitwise operations!

What this means for you:

Trust Your Compiler... But Verify

Information Density — The Power of Bits

Bits vs Representable Values
Bits	Distinct Values	Example Application
1	2	Boolean: true/false
2	4	Direction: N/S/E/W
3	8	Days of week (with overflow)
4	16	Hex digit (0-9, A-F)
8	256	ASCII character, pixel channel
16	65,536	Unicode BMP, audio sample
32	~4.3 billion	IPv4 address, standard int
64	~18 quintillion	Unique IDs, pointers, timestamps

Practical power: subset representation

Consider a set of n elements. How many subsets are there? Exactly 2ⁿ—each element is either included or not. This maps perfectly to n bits:

Bit i = 1: element i is in the subset
Bit i = 0: element i is not in the subset

With 32 bits, we can represent any subset of a 32-element set. With 64 bits, any subset of 64 elements. Set operations become bitwise operations:

Union: A | B (OR)
Intersection: A & B (AND)
Complement: ~A (NOT)
Difference: A & ~B (AND NOT)
Symmetric difference: A ^ B (XOR)

These set operations execute in O(1) time, regardless of how many elements are in the sets. This is the basis of bitmask techniques we'll explore extensively later.

The Tradeoff

From Hardware to Algorithm Design

Understanding bits at the hardware level directly informs algorithm design. Here's how the connections manifest:

Hardware Property

•Bits are processed in parallel
•Shift operations are single-cycle
•Division is expensive
•Popcount has hardware support
•Branches can stall the pipeline
•Memory access is slow vs registers

Algorithm Design Implication

•Operate on all bits at once, not individually
•Use shifts for power-of-2 arithmetic
•Replace division with bit tricks when possible
•Use built-in popcount for counting bits
•Use bitwise tricks to avoid branches
•Pack data to reduce memory traffic

Example: turning hardware knowledge into code

Suppose you need to check if a number is a power of 2. The naive approach:

function isPowerOfTwo(n) {
    if (n <= 0) return false;
    while (n > 1) {
        if (n % 2 != 0) return false;
        n = n / 2;
    }
    return true;
}

This loops O(log n) times with division in each iteration. Now, knowing that:

Powers of 2 have exactly one bit set
Subtracting 1 flips all bits from the rightmost 1
AND operates in parallel on all bits

We can write:

function isPowerOfTwo(n) {
    return n > 0 && (n & (n - 1)) == 0;
}

O(1) time, single-cycle operation. Hardware knowledge → algorithmic advantage.

The Pattern Recognition Skill

Summary: Bits as Foundation

Let's consolidate what we've learned about bits as the foundation of computation:

Key Takeaways

•Binary is a physical necessity — Two-state representation enables reliable, high-speed computation with noise immunity.
•Transistors and gates implement bits — Bitwise operations directly correspond to hardware circuits, making them inherently fast.
•Position determines value — Each bit independently contributes a power of 2. This independence enables parallel manipulation.
•Fixed-width types matter — Bit manipulation requires awareness of integer width. Use explicit types for predictability.
•CPUs are optimized for bits — Bitwise operations are single-cycle. Modern CPUs include specialized bit instructions.
•Bits offer extreme density — n bits represent 2ⁿ values, enabling compact representation of sets, flags, and states.
•Hardware knowledge informs algorithms — Understanding CPU capabilities reveals when bit manipulation offers advantages.

What's next:

Page Complete

2 / 4