Breaking down an integer into other unique integers

Question

I'm using a (numpy) array of integers to log potential problems with an array of data. The concept is that each error type has its own integer value, and that these are set so that

err1 = 1
err2 = 2 ** 1
err3 = 2 ** 2
...
errx = 2 ** x

This way, I figure, I can add these error types to the integer logging array, and still know what combination of errors made up that value; so if the end array has a value of 7, I know it must have be made of up 1 + 2 + 4 - ie, err1, err2, and err3.

This all seemed very clever at the time, but I now need to produce a boolean array telling me which cells have logged a given error; so, for example, if I have an error array of

test_arr = np.array(
    [[1, 5, 19],
     [3, 4, 12]]
)

I'd like to get the result

test_contains_err3 = np.array(
    [[False, True, False],
     [False, True, True]]
)

Because the value 4 has gone into making up the values 5 and 4, but not any of the others. I've developed an iterative solution for single values, but that then doesn't work well for a vectorized calculation (the actual array is quite large). Can any one please suggest something? I have a feeling that there's something simpler here that I'm not seeing.

Thanks in advance!

Simas Joneliunas · Accepted Answer

You should look into bitwise operations. That would allow you to encode multiple different numbers in a single joined value, for example the output of the following snippet

a = (3 << 24) + (8 << 16) + 5 
print (a)

print(a>>24 & 0xf)
print(a>>16 & 0xf)
print(a & 0xf)

would look like this:

Now if you play around with it, you can encode as many variables as you want as long as you make sure to give each variable enough bits to cover the maximum possible value for that variable - an overflow of a single variable would corrupt your data.

Now when you need to compare which errors have been fired, you have to run a check against bitmask (location) of a particular error and you will easily know whether that particular error has been registered.

It seems to me that for your problem you would only need to know which errors have occurred and don't need to save the error codes. You can then employ a simplified scenario where you would reserve 1 bit per error and a bit->error map in code.

Finally, when you want to display which errors were triggered, you simply need to take the binary value of the encoded number and convert 1's to True and 0's to False.

Breaking down an integer into other unique integers

Answers (2)

Related Questions