Is there anyway to get around this compiler optimization in C?

Question

I want to note that, as pointed out by Olaf, the compiler is not at fault.

Disclaimer: I'm not entirely sure this behavior is due to compiler optimization.

Anyways, in C I'm trying to determine whether the n-th bit (n should be between 0 and 7, inclusive) of an 8-bit byte is 1 or 0. I initially came up with this solution:

#include 
#include 

bool one_or_zero( uint8_t t, uint8_t n ) // t is some byte, n signifies which bit
{
    return (t << (n - (n % 8) - 1)) >> 7;
}

Which, from my previous understanding, would do the following to a byte:

Suppose t = 5 and n = 2. Then the byte t can be represented as 0000 0101. I assumed that (t << (n - (n % 8) - 1)) would shift the bits of t so that t is 1010 0000. This assumption is only somewhat correct. I also assumed the next bit shift (>> 7) would shift the bits of t so that t is 0000 0001. This assumption is also only somewhat correct.

TL;DR: I thought the line return (t << (n - (n % 8) - 1)) >> 7; did this:

t is 0000 0101
The first bit shift occurs; t is now 1010 0000
The second bit shift occurs; t is now 0000 0001
t is returned as 0000 0001

Although I intend for that to happen, it does not. Instead, I have to write the following, to get my intended results:

bool one_or_zero( uint8_t t, uint8_t n ) // t is some byte, n signifies which bit
{
    uint8_t val = (t << (n - (n % 8) - 1));
    return val >> 7;
}

I know that adding uint8_t val isn't a massive performance drain. Still, I'd like to know two things:

Do I have to initialize another variable to do what I intend?
Why doesn't the one-liner do the same thing as the two-liner?

I'm under the impression that when the compiler optimizes my code, it smashes the two bit shifts together so only one occurs. This seems like a nice thing, but it doesn't "clear" the other bits as intended.

too honest for this site · Accepted Answer

That code is very complicated just to check a bit in an integer. Try the standard method:

return (t & (1U << n)) != 0;

If you have to check n is valid, add an assertion. else masking (n & 7) or modulus (n % 8) (this will be optimized by the compiler to the mask-operation) will force the shift count in a valid range. As that pattern will be recognized by many compilers, they might transform this to a single bit-test CPU instruction if available.

To avoid magic numbers, you should replace the modulus 8 by: (sizeof(t) * CHAR_BIT). That will follow any type t might have. The mask is always one less than the modulus.

Your code:

(n - (n % 8) - 1))

If n < 8 it yields a negative value (-1 precisely). Negative shifts present undefined behaviour, so anything can happen (watch out for nasal demons).

Is there anyway to get around this compiler optimization in C?

Answers (2)

Related Questions