little endian to uint with undetermined numbers of bytes

Question

I was trying to write a function that took in N bytes of little endian hex and made it into an unsigned int.

unsigned int endian_to_uint(char* buf, int num_bytes)
{
    if (num_bytes == 0)
            return (unsigned int) buf[0];

    return (((unsigned int) buf[num_bytes -1]) << num_bytes * 8) | endian_to_uint(buf, num_bytes - 1);
}

however, the value returned is approx ~256 times larger than the expected value. Why is that?

If I needed to do use it for a 4 byte buffer, normally you'd do:

unsigned int endian_to_uint32(char* buf)
{
    return (((unsigned int) buf[3]) <<   24)
         | (((unsigned int) buf[2]) <<   16)
         | (((unsigned int) buf[1]) << 8)
         | (((unsigned int) buf[0]));
}

which should be reproduced by the recursive function I wrote, or is there some arithmetic error that I haven't caught?

Santosh A · Accepted Answer

The below code snippet would work.

unsigned int endian_to_uint(unsigned char* buf, int num_bytes)
{
    if (num_bytes == 0)
        return (unsigned int) buf[0];

    return (((unsigned int) buf[num_bytes -1]) << (num_bytes -1) * 8) | endian_to_uint(buf, num_bytes - 1);
}

Change 1:
Modified the function argument data type from char* to unsigned char *
Reason:
For a given buf[] = {0x12, 0x34, 0xab, 0xcd};
When you are trying to read buf[3] i.e here buf[num_bytes -1] will give you 0xffffffcd instead of just 0xcd because of sign extension. For more info on sign extension refer Sign Extension

Change 2:
Use num_bytes-1 when calculating the shift position value. This was a logical error in calculation of the number of bits to be shifted.

little endian to uint with undetermined numbers of bytes

Answers (2)

Related Questions