Reputation: 1281
I want to attach a UUID to a field in my protobuf User message example.
message User {
// field containing id as UUID type
required string email;
optional string name;
}
I know that protobuf messages do not yet support the UUID type. I've read that the best approach is to have a UUID message type.
So I'm guessing my User message would import my UUID message proto definition and use it as a field type like so:
import "myproject/UUID.proto";
message User {
required UUID id;
required string email;
optional string name;
}
My question is, what will the UUID message look like, and how will I encode/decode it? I'm aiming for Java/Scala and C# compatibility.
Upvotes: 89
Views: 89203
Reputation: 999
Use a string, not a byte array unlike what some other commenters are saying. According to MS (https://learn.microsoft.com/en-us/dotnet/architecture/grpc-for-wcf-developers/protobuf-data-types), "Don't use a bytes field for Guid values. Problems with endianness (Wikipedia definition) can result in erratic behavior when Protobuf is interacting with other platforms, such as Java."
Upvotes: 16
Reputation: 5169
I would suggest to use string encoding not byte encoding if you want to ensure straight forward interoperability:
message UUID {
required string value = 1;
}
The problem with the bytes encoding is: Different UUID libraries use different encoding/decoding schemes for bytes while they agree how to encode/decode strings.
For example see the C#'s System.guid.toBytesArray
returns a mixed-endian format: the first three components are little-endian encoded while the last two are big-endian encoded.
In Java, the Apache Commons Library Uuid.toRawBytes
returns the uuid in big-endian encoding:
"String": 35918bc9-196d-40ea-9779-889d79b753f0
"C#" : C9 8B 91 35 6D 19 EA 40 97 79 88 9D 79 B7 53 F0
"Java" : 35 91 8B C9 19 6D 40 EA 97 79 88 9D 79 B7 53 F0
As a side note: Python 3's Uuid
provides both encodings: bytes
for the big-endian encoding and bytes_le
for the mixed-endian encoding.
Upvotes: 12
Reputation: 193
If anything, you want to use string
to avoid problems with endianness. Note that a UUID and a MS GUID that have the same string representation (and therefore are the same "id") have, however, different byte-stream order (big-endian vs little-endian). If you use bytes
in the protocol to communicate between Java using UUID and C# using System.Guid, you could end up with flipped IDs.
Upvotes: 12
Reputation: 45296
You should probably use string
or bytes
to represent a UUID. Use string
if it is most convenient to keep the UUID in human-readable format (e.g. "de305d54-75b4-431b-adb2-eb6b9e546014"
) or use bytes
if you are storing the 128-bit value raw. (If you aren't sure, you probably want string
.)
Wrapping the value in a message type called UUID
can be helpful to make the code more self-documenting but will have some performance overhead and isn't strictly required. If you want to do this, define the type like:
message UUID {
required string value = 1;
}
or:
message UUID {
required bytes value = 1;
}
Upvotes: 85