How do I create the padded message for use in the SHA256-2 algorithm?

Question

I am trying to understand the SHA-2 algorithm. And it seems that its a bit vague on how people are encoding the message 'L' (see wikipedia's SHA256-2 pseudo code). Is the message encoded in ASCII, UTF-8, or UTF-16? I understand that technically message L could be anything that we decide before encrypting but I want to check my little test program with other sites like https://www.dcode.fr/sha256-hash and I realize I can't even check anything (except the empty "") without knowing if we are padding the '1' and subsequent '0's to the 9 bit representations for the message or 16 bit representations for the message. If I use the ASCII (which in this case is the same as UTF-8) for the word 'dcode' I am expecting the message to start with the following binary sequence: d:01100100:UTF-8:100 c:01100011:UTF-8:99 o:01101111:UTF-8:111 d:01100100:UTF-8:100 e:01100101:UTF-8:101 0110010001100011011011110110010001100101

can someone verify that I'm thinking of this correctly? And as a side benefit if you know where the standard that says the pre-hashed message should be UTF-8 or UTF-16 (presumably for specific applications) it would be much appreciated.

This answer is close but lacks specificity in its answer

How can i pad the message in sha family

How do I create the padded message for use in the SHA256-2 algorithm?

Answers (1)

Related Questions