I’ve recently come across a very interesting article on BitHacks – the low level magics for bit level operations. Some of the tricks introduced here are really excellently clever, some of them may even make you exclaim for their genius!
I had a lot of fun reading through some of the BitHacks. It’s also worth noticing these BitHacks are not only for intellectual pleasures, they provide actual boosts to algorithm performance as well. When an operations is used often enough, the overall performance benefits to the whole program might be significant.
I couldn’t help but keep wondering how on earth did these clever CS guys ever come up with such algorithms. I tried very hard to find some answers and the following are some patterns I noticed in this attempt. Still, honestly, I highly doubt if I can come up with same solutions myself if I ever run into these problems again. Some of them are just to clever.
The beauty part of bit hackings is that most of the times they avoid branching in CPU, which could be expensive for modern pipelining CPUs, as a misprediction in branching means flushing operations, causing waste in time and power.
For example, to compute the sign of an integer, instead of using branching, we can use the following code:
int v; // the integer
// One interesting feature of an integer to the power of 2:
The effect shall be the same as
max = x > y ? x : y;
Somehow the above approach also use branch to determine value. A BitHack way is to:
result = x ^ ((x ^ y) & -(x < y));
Amazing! Isn’t it? Here when x < y, -(x < y) evaluates to -1, which is all 1s in binary representation. The result will then evaluates to:
x ^ (x ^ y)
Which is y. While when x > y, -(x < y) evaluates to 0, and the result will be assigned x.
This is a very interesting feature of the XOR operation. Remember how XOR could be used to exchange the value of two numbers without extra memory:
a = a ^ b;
XOR has many interesting features, and is a very important operations in BitHacks. Here’s another example.
mask = v >> (sizeof(int) * CHAR_BIT - 1);
When v is positive, mask is 0, result will be assigned v. And when v is negative, mask will evaluate to -1 (all 1s in binary), and the result will be (v - 1) ^ (-1). As XORing all 1s gives the NOT of an integer (v ^ -1 == ~v), the result becomes ~(v - 1). Not surprisingly, this is the negative value of v.
An alternative but similar approach is:
result = (v ^ mask) - mask;
I believe some general guidelines could be useful for inventing our own BitHacks could be useful, but the post did not mention any special techniques, and some of the hacks seems really ad-hoc (sure, that’s why they’re called “hacks”, right?). Nevertheless, the following is some observations I had when reading. Keeping these in mind might help the next time when inventing BitHacks.
Use AND to clear the bit field, OR to set the bit field, and use shift to move important bit to the right position. Use a carefully designed mask to clear or set the fields.
An excellent example could be found in here: counting bit set, in parallel.
In the example, instead of iterating all the bit fields in the given integer, we could use a mask to mask the fields, and merge the all the counts, a bit like reduce in the map-reduce. In this way, no loops are required to count the bit sets.
The operations could be explained as follows:
// first, I randomly pick a 32-bit integer to count bits set
The XOR operation has many interesting features which could be used cleverly in BitHacks.
int value, other;
The previous, and some following examples all uses these features to efficiently “hack the bits”. For example: exchanging values without extra memory, finding absolute value, finding the min or max of two values.
In the previous example: Absolute value of an integer, a mask was used to conditionally find the original and the negative value, by setting the mask to either 0 or all 1s, which is -1 in integer representation. The negative value of v could be found by very simple operations:
v = ~v + 1
Taking advantage of this property, we can also come up with a way to conditionally negate a value:
result = (v ^ -flag) + flag;
A lot of complicated operations could be accomplished by using lookup tables, one example could be the counting bits set as above. Some other examples includes:
- Compute parity by lookup table
- Reverse bits lookup table
- Find the log base 2 of an integer with a lookup table
Although it might take up more space for memory, it’s often worthwhile to trade some amount of memory for speed. Memory operations may be more expensive, but considering prefetching is now prevalent in modern CPUs, fetching data from the cache is way faster than a misprediction in branch operations.
There are many other interesting and mind-opening techniques, tricks and hacks to speed up your program in this post. I have a hunch that I might actually use some of them in the future, or come up with my own BitHacks with similar mindset. Before this, I didn’t even realize I could play with bit operations in C/C++ this way. Also, this post could serve as a great reference when a performance critical part needs optimization with BitHacks.