Constrained counting sets on binary codes

Question

Problem formulation: Given a binary code of length l, for every t bits set (l%t=0), if there exists at least one bit of value 1, we add 1 to the result.

My question : How to efficiently get the final result?

For example, we have a binary code 010 110 000, and t=3. Then the final result is 2. Since for 000, there is no bit of value 1. For 110, there exists at least one bit of value 1. We add 1 to the result. For 010, there also exists one bit of value 1. We add 1 to the result. Thus, the final result is 2.

My question is that how to efficiently solve this problem without scanning through every t bits, which causes a time complexity linear with the length of the binary code.

For the counting set problem (calculate how many 1's in a binary code), there are some algorithms which take constant time to solve it by taking a limited number of mask and shift operations such as the MIT HAKMEM Count algorithm.

However, existing algorithms for the traditional counting set problem cannot be used to solve my problem.

Does anyone know some tricks for my problems? You can assume a maximum length of the input binary code if it makes the problem easier to solve.

Andreas · Accepted Answer

From comment:

Then you can assume that the input is a binary code of 64-bit integer for this problem.

Here are two different ways to do it.

The first run in O(I/t) and works by testing each set for all 0-bits.

public static int countSets(int setSize, long bits) {
    Long mask = (1L << setSize) - 1;
    int count = 0;
    for (long b = bits; b != 0; b >>>= setSize)
        if ((b & mask) != 0)
            count++;
    return count;
}

The second run in O(log I) and works by collapsing the bits of a set to the right-most bit of the set, then counting bits set.

public static int countSets(int setSize, int bitCount, long bits) {
    long b = bits, mask = 1;
    for (int i = 1; i < setSize; i++)
        b |= bits >>> i;
    for (int i = setSize; i < bitCount; i <<= 1)
        mask |= mask << i;
    return Long.bitCount(b & mask);
}

Further explanation

The first method builds a mask for the set, e.g. with t = 3, the mask is 111.

It then shifts the value to the right, t bits at a time, e.g. with input = 010 110 000 and t = 3, you get:

mask = 111
b = 010 110 000  ->  b & mask = 000  ->  don't count
b =     010 110  ->  b & mask = 110  ->  count
b =         010  ->  b & mask = 010  ->  count
result: 2

The second method first merge bits to the right-most bit of the sets, e.g. with input = 010 110 000 and t = 3, you get:

bits      = 010 110 000
bits >> 1 = 001 011 000
bits >> 2 = 000 101 100
b (OR'd)  = 011 111 100

It then builds a mask for only checking right-most bits, and then applies mask and counts the bits set:

mask     = 001 001 001
b & mask = 001 001 000
result: 2

Constrained counting sets on binary codes

Answers (1)

Related Questions