Common Bit Manipulation Techniques

labuladongOriginalAbout 3044 words

This article will resolve

LeetCode	Difficulty
136. Single Number	🟢
191. Number of 1 Bits	🟢
231. Power of Two	🟢
268. Missing Number	🟢

Bit Manipulation can involve numerous tricks. There's a website called Bit Twiddling Hacks that has collected nearly all the advanced techniques in bit manipulation. The website is:

http://graphics.stanford.edu/~seander/bithacks.html

However, most of these techniques are quite obscure. I believe they can be used as a reference dictionary, and there is no need to delve into each one. Yet, I think mastering interesting and useful bit manipulation techniques is essential for everyone.

Therefore, this article starts with some interesting (though not very useful) bit manipulation techniques, and then summarizes some commonly used techniques in algorithm problems and engineering development.

1. Some Interesting Bit Manipulations

// Use the OR operation `|` and space to convert English characters to lowercase
('a' | ' ') = 'a'
('A' | ' ') = 'a'

// Use the AND operation `&` and underscore to convert English characters to uppercase
('b' & '_') = 'B'
('B' & '_') = 'B'

// Use the XOR operation `^` and space to toggle the case of English characters
('d' ^ ' ') = 'D'
('D' ^ ' ') = 'd'

// The reason the above operations produce peculiar effects is due to ASCII encoding
// ASCII characters are actually numbers, and the numbers corresponding
// to spaces and underscores can change the case through bit operations
// Readers interested can check the ASCII table to
// calculate it themselves, I won't elaborate here


// Swap two numbers without using a temporary variable
int a = 1, b = 2;
a ^= b;
b ^= a;
a ^= b;
// Now a = 2, b = 1


// Increment by one
int n = 1;
n = -~n;
// Now n = 2


// Decrement by one
int n = 2;
n = ~-n;
// Now n = 1


// Determine if two numbers have opposite signs
int x = -1, y = 2;
boolean f = ((x ^ y) < 0); // true

int x = 3, y = 2;
boolean f = ((x ^ y) < 0); // false

If the first six techniques are not very useful, the seventh technique is quite practical. It utilizes the sign bit of two's complement encoding. The highest bit in integer encoding is the sign bit: negative numbers have a sign bit of 1, non-negative numbers have a sign bit of 0. Using the properties of XOR, you can determine if two numbers have opposite signs.

Of course, if you don't use bitwise operations to check for opposite signs, you need to use if-else branches, which can be quite cumbersome. You might consider using the product to check if two numbers have opposite signs, but this approach can easily lead to integer overflow, resulting in errors.

Usage of `index & (arr.length - 1)`

In Monotonic Stack Problem Solving Pattern, I've introduced circular arrays, which essentially use modulo (remainder) to make the array appear as a continuous loop, never-ending:

java

int[] arr = {1,2,3,4};
int index = 0;
while (true) {
    // looping in a circular array
    print(arr[index % arr.length]);
    index++;
}
// output: 1,2,3,4,1,2,3,4,1,2,3,4...

cpp

int arr[] = {1, 2, 3, 4};
int index = 0;
while (true) {
    // loop through the circular array
    cout << arr[index % (sizeof(arr) / sizeof(int))] << endl;
    index++;
}
// output: 1,2,3,4,1,2,3,4,1,2,3,4...

python

arr = [1,2,3,4]
index = 0
while True:
    # Loop through the circular array
    print(arr[index % len(arr)])
    index += 1
# Output: 1,2,3,4,1,2,3,4,1,2,3,4...

arr := []int{1, 2, 3, 4}
index := 0
for {
    // Looping in a circular array
    fmt.Print(arr[index%len(arr)])
    index++
}
// Output: 1,2,3,4,1,2,3,4,1,2,3,4...

javascript

var arr = [1, 2, 3, 4];
var index = 0;
while (true) {
    // looping in a circular array
    console.log(arr[index % arr.length]);
    index++;
}
// output: 1,2,3,4,1,2,3,4,1,2,3,4...

However, modulo operation % is actually a relatively expensive operation for computers, so we can use the & operation to find the remainder:

java

int[] arr = {1,2,3,4};
int index = 0;
while (true) {
    // loop in the circular array
    print(arr[index & (arr.length - 1)]);
    index++;
}
// output: 1,2,3,4,1,2,3,4,1,2,3,4...

cpp

vector<int> arr = {1, 2, 3, 4};
int index = 0;
while (true) {
    // Looping in a circular array
    cout << arr[index & (arr.size() - 1)] << " ";
    index++;
}
// Output: 1,2,3,4,1,2,3,4,1,2,3,4...

python

arr = [1, 2, 3, 4]
index = 0
while True:
    # Loop in a circular array
    print(arr[index & (len(arr) - 1)], end=", ")
    index += 1
# Output: 1, 2, 3, 4, 1, 2, 3, 4, 1, 2, 3, 4, ...

arr := []int{1, 2, 3, 4}
index := 0
for {
    // loop in a circular array
    fmt.Print(arr[index&(len(arr)-1)])
    index++
}
// output: 1,2,3,4,1,2,3,4,1,2,3,4...

javascript

var arr = [1,2,3,4];
var index = 0;
while (true) {
    // loop through the circular array
    console.log(arr[index & (arr.length - 1)]);
    index++;
}
// output: 1,2,3,4,1,2,3,4,1,2,3,4...

Important

Note that this technique only works when the array length is a power of 2, such as 2, 4, 8, 16, 32, and so on. As for how to expand the array length to a power of 2, there are some clever bit manipulation algorithms, which can be referenced at https://graphics.stanford.edu/~seander/bithacks.html#RoundUpPowerOf2

Simply put, the bitwise operation & (arr.length - 1) can replace the modulo operation % arr.length, providing better performance.

Now the question arises, if you are continuously index++, you achieve circular iteration. But can you still achieve the effect of a circular array if you continuously index--?

The answer is, if you use the % modulo method, when index becomes less than 0, the modulo result can also be negative, requiring special handling. However, with the & bitwise operation, index won't become negative and can still work normally:

java

int[] arr = {1,2,3,4};
int index = 0;
while (true) {
    // looping in a circular array
    print(arr[index & (arr.length - 1)]);
    index--;
}
// output: 1,4,3,2,1,4,3,2,1,4,3,2,1...

cpp

int arr[] = {1,2,3,4};
int index = 0;
while (true) {
    // Loop in a circular array
    cout << arr[index & (sizeof(arr) / sizeof(*arr) - 1)] << " ";
    index--;
}
// Output: 1,4,3,2,1,4,3,2,1,4,3,2,1...

python

arr = [1, 2, 3, 4]
index = 0

while True:
    # Looping in a circular array
    print(arr[index & (len(arr) - 1)])
    index -= 1

arr := []int{1, 2, 3, 4}
index := 0
for {
    // looping in a circular array
    fmt.Print(arr[index&(len(arr)-1)])
    index--
}
// output: 1,4,3,2,1,4,3,2,1,4,3,2,1...

javascript

var arr = [1,2,3,4];
var index = 0;
while (true) {
    console.log(arr[index & (arr.length - 1)]);
    index--;
}
// Output: 1,4,3,2,1,4,3,2,1,4,3,2,1...

We generally don't use this technique in our own code, but you might often see it when studying other code libraries. Keep this in mind so you won't be confused later.

Usage of `n & (n-1)`

The operation n & (n-1) is common in algorithms and serves to eliminate the last 1 in the binary representation of the number n.

It is easy to understand with a diagram:

The core logic is that n - 1 will always remove the last 1 and turn all trailing 0s into 1s. Then, performing the & operation with n will turn only the last 1 into 0.

Calculating Hamming Weight

This is LeetCode problem 191: "Number of 1 Bits":

191. Number of 1 Bits | LeetCode | 🟢

Write a function that takes the binary representation of a positive integer and returns the number of set bits it has (also known as the Hamming weight).

Example 1:

Input: n = 11

Output: 3

Explanation:

The input binary string 1011 has a total of three set bits.

Example 2:

Input: n = 128

Output: 1

Explanation:

The input binary string 10000000 has a total of one set bit.

Example 3:

Input: n = 2147483645

Output: 30

Explanation:

The input binary string 1111111111111111111111111111101 has a total of thirty set bits.

Constraints:

1 <= n <= 2³¹ - 1

Follow up: If this function is called many times, how would you optimize it?

The problem is from LeetCode 191. Number of 1 Bits.

It requires you to return the number of 1s in the binary representation of n. Since n & (n - 1) can eliminate the last 1, you can use a loop to continuously remove 1s and count them until n becomes 0.

java

public class Solution {
    // you need to treat n as an unsigned value
    public int hammingWeight(int n) {
        int res = 0;
        while (n != 0) {
            n = n & (n - 1);
            res++;
        }
        return res;
    }
}

cpp

#include <vector>
#include <string>

class Solution {
public:
    int hammingWeight(uint32_t n) {
        int res = 0;
        while (n != 0) {
            n = n & (n - 1);
            res++;
        }
        return res;
    }
};

python

class Solution:
    def hammingWeight(self, n: int) -> int:
        res = 0
        while n != 0:
            n = n & (n - 1)
            res += 1
        return res

func hammingWeight(n int) int {
    res := 0
    for n != 0 {
        n = n & (n - 1)
        res++
    }
    return res
}

javascript

var hammingWeight = function(n) {
    let res = 0;
    while (n != 0) {
        n = n & (n - 1);
        res++;
    }
    return res;
};

Determine if a Number is a Power of Two

LeetCode Problem 231, "Power of Two", addresses this question.

A number is a power of two if its binary representation contains only one '1':

2^0 = 1 = 0b0001
2^1 = 2 = 0b0010
2^2 = 4 = 0b0100

Using the technique n & (n-1) simplifies this (note the operator precedence, parentheses cannot be omitted):

java

class Solution {
    public boolean isPowerOfTwo(int n) {
        if (n <= 0) return false;
        return (n & (n - 1)) == 0;
    }
}

cpp

class Solution {
public:
    bool isPowerOfTwo(int n) {
        if (n <= 0) return false;
        return (n & (n - 1)) == 0;
    }
};

python

class Solution:
    def isPowerOfTwo(self, n: int) -> bool:
        if n <= 0:
            return False
        return (n & (n - 1)) == 0

func isPowerOfTwo(n int) bool {
    if n <= 0 {
        return false
    }
    return (n & (n - 1)) == 0
}

javascript

var isPowerOfTwo = function(n) {
    if (n <= 0) return false;
    return (n & (n - 1)) === 0;
};

Application of `a ^ a = 0`

The properties of the XOR operation are essential to remember:

A number XORed with itself results in 0, i.e., a ^ a = 0; a number XORed with 0 results in itself, i.e., a ^ 0 = a.

Finding the Element that Appears Once

This is LeetCode Problem 136 "Single Number":

136. Single Number | LeetCode | 🟢

Given a non-empty array of integers nums, every element appears twice except for one. Find that single one.

You must implement a solution with a linear runtime complexity and use only constant extra space.

Example 1:

Input: nums = [2,2,1]
Output: 1

Example 2:

Input: nums = [4,1,2,1,2]
Output: 4

Example 3:

Input: nums = [1]
Output: 1

Constraints:

1 <= nums.length <= 3 * 10⁴
-3 * 10⁴ <= nums[i] <= 3 * 10⁴
Each element in the array appears twice except for one element which appears only once.

The problem is from LeetCode 136. Single Number.

For this problem, XOR all the numbers. Pairs of numbers will result in 0, and the single number XORed with 0 will still be itself. Therefore, the final XOR result is the element that appears only once:

java

class Solution {
    public int singleNumber(int[] nums) {
        int res = 0;
        for (int n : nums) {
            res ^= n;
        }
        return res;
    }
}

cpp

#include <vector>

class Solution {
public:
    int singleNumber(std::vector<int>& nums) {
        int res = 0;
        for (int n : nums) {
            res ^= n;
        }
        return res;
    }
};

python

class Solution:
    def singleNumber(self, nums: List[int]) -> int:
        res = 0
        for n in nums:
            res ^= n
        return res

func singleNumber(nums []int) int {
    res := 0
    for _, n := range nums {
        res ^= n
    }
    return res
}

javascript

var singleNumber = function(nums) {
    let res = 0;
    for (let n of nums) {
        res ^= n;
    }
    return res;
};

Finding the Missing Element

This is LeetCode Problem 268 "Missing Number":

268. Missing Number | LeetCode | 🟢

Given an array nums containing n distinct numbers in the range [0, n], return the only number in the range that is missing from the array.

Example 1:

Input: nums = [3,0,1]
Output: 2
Explanation: n = 3 since there are 3 numbers, so all numbers are in the range [0,3]. 2 is the missing number in the range since it does not appear in nums.

Example 2:

Input: nums = [0,1]
Output: 2
Explanation: n = 2 since there are 2 numbers, so all numbers are in the range [0,2]. 2 is the missing number in the range since it does not appear in nums.

Example 3:

Input: nums = [9,6,4,2,3,5,7,0,1]
Output: 8
Explanation: n = 9 since there are 9 numbers, so all numbers are in the range [0,9]. 8 is the missing number in the range since it does not appear in nums.

Constraints:

n == nums.length
1 <= n <= 10⁴
0 <= nums[i] <= n
All the numbers of nums are unique.

Follow up: Could you implement a solution using only O(1) extra space complexity and O(n) runtime complexity?

The problem is from LeetCode 268. Missing Number.

Given an array of length n, its indices should be [0,n), but you need to accommodate n + 1 elements [0,n]. Hence, one element will be missing. Please find this missing element.

This problem is not difficult. One might easily think of sorting the array and then traversing it to find the missing element.

Alternatively, leveraging data structures, use a HashSet to store the numbers present in the array, then traverse the numbers in [0,n] and check the HashSet to find the missing element.

The sorting solution has a time complexity of O(NlogN), while the HashSet solution has a time complexity of O(N) but requires O(N) space complexity to store the HashSet.

There is actually a particularly simple solution to this problem: the arithmetic sequence sum formula.

The problem can be understood as: there is an arithmetic sequence 0, 1, 2,..., n, with one number missing. Find this number, which is simply sum(0,1,..n) - sum(nums).

java

int missingNumber(int[] nums) {
    int n = nums.length;
    // Although the data range given by the problem is not large, we should
    // use long type to prevent integer overflow for the sake of rigor
    // Sum formula: (first term + last term) * number of terms / 2
    long expect = (0 + n) * (n + 1) / 2;
    long sum = 0;
    for (int x : nums) {
        sum += x;
    }
    return (int)(expect - sum);
}

cpp

int missingNumber(vector<int>& nums) {
    int n = nums.size();
    // Although the data range given by the problem is not large, we should
    // use long type to prevent integer overflow for the sake of rigor
    // Sum formula: (first term + last term) * number of terms / 2
    long expect = (0 + n) * (n + 1) / 2;
    long sum = 0;
    for (int x : nums) {
        sum += x;
    }
    return (int)(expect - sum);
}

python

from typing import List

def missingNumber(nums: List[int]) -> int:
    n = len(nums)
    # Although the data range given by the problem is not large,
    # it's rigorous to use long type to prevent integer overflow
    # Sum formula: (first term + last term) * number of terms / 2
    expect = (0 + n) * (n + 1) / 2
    sum_ = 0
    for x in nums:
        sum_ += x
    return int(expect - sum_)

func missingNumber(nums []int) int {
    n := len(nums)
    // Although the data range given by the problem is not large, to
    // be rigorous, use long type to prevent integer overflow
    // Sum formula: (first term + last term) * number of terms / 2
    expect := (0 + n) * (n + 1) / 2
    var sum int64
    for _, x := range nums {
        sum += int64(x)
    }
    return int(expect - sum)
}

javascript

var missingNumber = function(nums) {
    var n = nums.length;
    // Although the data range given by the problem is not large, we should
    // use long type to prevent integer overflow for the sake of rigor
    // Sum formula: (first term + last term) * number of terms / 2
    var expect = (0 + n) * (n + 1) / 2;
    var sum = 0;
    for (var i = 0; i < n; i++) {
        sum += nums[i];
    }
    return (expect - sum);
};

However, the main topic of this article is bit manipulation. Let's discuss how to use bit manipulation techniques to solve this problem.

Review the properties of the XOR operation: a number XORed with itself results in 0, and a number XORed with 0 remains unchanged.

Moreover, XOR operation is commutative and associative, meaning:

2 ^ 3 ^ 2 = 3 ^ (2 ^ 2) = 3 ^ 0 = 3

This problem can be solved cleverly using these properties. For instance, consider nums = [0,3,1,4]:

To make it easier to understand, let's assume we add an index and let each element correspond to its equal index:

After doing this, you can see that all indices and elements form pairs, except for the missing element. Now, if we find the single index 2, we find the missing element.

How do we find this lone number? By XORing all elements and indices, paired numbers will cancel out to 0, leaving the single element, achieving our goal:

java

class Solution {
    public int missingNumber(int[] nums) {
        int n = nums.length;
        int res = 0;
        // first XOR with the new added index
        res ^= n;
        // XOR with other elements and indices
        for (int i = 0; i < n; i++)
            res ^= i ^ nums[i];
        return res;
    }
}

cpp

#include <vector>
using namespace std;

class Solution {
public:
    int missingNumber(vector<int>& nums) {
        int n = nums.size();
        int res = 0;
        // first XOR with the new added index
        res ^= n;
        // XOR with other elements and indices
        for (int i = 0; i < n; i++)
            res ^= i ^ nums[i];
        return res;
    }
};

python

class Solution:
    def missingNumber(self, nums: List[int]) -> int:
        n = len(nums)
        res = 0
        # first XOR with the new added index
        res ^= n
        # XOR with other elements and indices
        for i in range(n):
            res ^= i ^ nums[i]
        return res

func missingNumber(nums []int) int {
    n := len(nums)
    res := 0
    // first XOR with the new added index
    res ^= n
    // XOR with other elements and indices
    for i := 0; i < n; i++ {
        res ^= i ^ nums[i]
    }
    return res
}

javascript

var missingNumber = function(nums) {
    let n = nums.length;
    let res = 0;
    // first XOR with the new added index
    res ^= n;
    // XOR with other elements and indices
    for (let i = 0; i < n; i++)
        res ^= i ^ nums[i];
    return res;
};

Since XOR is commutative and associative, paired numbers will always cancel out, leaving the missing element.

At this point, most common bit manipulation techniques have been covered. These techniques are easy for those who know and difficult for those who don't. There's no need to memorize them; just having an impression is sufficient.

Related Problems

You can install my Chrome extension then open the link.

LeetCode	Difficulty
1457. Pseudo-Palindromic Paths in a Binary Tree	🟠
389. Find the Difference	🟢

Common Bit Manipulation Techniques

136. Single Number https://leetcode.com/problems/single-number

191. Number of 1 Bits https://leetcode.com/problems/number-of-1-bits

231. Power of Two https://leetcode.com/problems/power-of-two

268. Missing Number https://leetcode.com/problems/missing-number

1. Some Interesting Bit Manipulations

Usage of `index & (arr.length - 1)`

Usage of `n & (n-1)`

Calculating Hamming Weight

Determine if a Number is a Power of Two

Application of `a ^ a = 0`

Finding the Element that Appears Once

Finding the Missing Element