Can I store UTF8 in C-style char array

Question

Can I safely store UTF8 string in zero terminated char * ?

I understand strlen() will not return correct information, put "storing", printing and "transferring" the char array, seems to be safe.

unwind · Accepted Answer

Yes.

Just like with ASCII and similiar 8-bit encodings before Unicode, you can't store the NUL character in such a string (the value \u+0000 is the Unicode code point NUL, very much like in ASCII).

As long as you know your strings don't need to contain that (and regular text doesn't), it's fine.

Can I store UTF8 in C-style char array

Answers (2)

Related Questions