I just noticed I get different hashcodes from objects depending on if I build for x86 or 64. Can I do that aswell?

Question

I noticed that hashcodes I got from other objects were different when I built for a either x86 or x64. Up until now I have implemented most of my own hashing functions like this:

int someIntValueA;
int someIntValueB;

const int SHORT_MASK = 0xFFFF;
public override int GetHashCode()
{
    return (someIntValueA & SHORT_MASK) + ((someIntValueB & SHORT_MASK) << 16);
}

Will storing the values in a long and getting the hashcode from that give me a wider range as well on 64-bit systems, or is this a bad idea?

public override int GetHashCode()
{
    long maybeBiggerSpectrumPossible = someIntValueA + (someIntValueB << 32);
    return maybeBiggerSpectrumPossible.GetHashCode();
}

Eric Lippert · Accepted Answer

No, that will be far worse.

Suppose your int values are typically in the range of a short: between -30000 and +30000. And suppose further that most of them are near the middle, say, between 0 and 1000. That's pretty typical. With your first hash code you get all the bits of both ints into the hash code and they don't interfere with each other; the number of collisions is zero under typical conditions.

But when you do your trick with a long, then you rely on what the long implementation of GetHashCode does, which is xor the upper 32 bits with the lower 32 bits. So your new implementation is just a slow way of writing int1 ^ int2. Which, in the typical scenario has almost all zero bits, and hence collisions all over the place.

I just noticed I get different hashcodes from objects depending on if I build for x86 or 64. Can I do that aswell?

Answers (2)

Related Questions