seedhead
seedhead

Reputation: 3815

UUID shortening

I need to calculate the daily number of unique users of an app.

The only way I can uniquely identify a user is via their UUID (this is externally supplied so I am forced to use it).

I know that my daily user counts are a couple of million users.

I'd like to use a bitset in Redis to do a population count, but in order for this to work, I'd need a way of narrowing my UUID so that it could comfortably fit into a long. I am aware of the potential for collisions but I am not concerned about precise numbers.

Has anyone done this in Java before? What I am after is how I could convert my UUID into something that could fit into a long.

Upvotes: 4

Views: 2739

Answers (3)

Jonas Adler
Jonas Adler

Reputation: 913

you could generate a hash of your uuids that generates ints or longs and use those for your population count.

have a look a `redis.clients.util.MurmurHash' in the jedis redis library. you can find it at https://github.com/xetorthio/jedis

*edit: sample

        UUID uuid = UUID.randomUUID();
        ByteBuffer buf = ByteBuffer.allocate(16).putLong(uuid.getMostSignificantBits()).putLong(uuid.getLeastSignificantBits());
        buf.flip();
        int useMe= MurmurHash.hash(buf, 123);

Upvotes: 3

Joshua Martell
Joshua Martell

Reputation: 7212

This is probably small enough to fit directly using the full UUID as a hash key. Approximations can also be made using less memory if that suites your needs.

Upvotes: 2

Starkey
Starkey

Reputation: 9781

There are two methods on the UUID object which might benefit you.

getLeastSignificantBits() and getMostSignificateBits(). Both return a long. Take one of these longs as your answer (or some kind of combination if you care.)

Upvotes: 3

Related Questions