Ruby, pack encoding (ASCII-8BIT that cannot be converted to UTF-8)

Question

puts "C3A9".lines.to_a.pack('H*').encoding

results in

ASCII-8BIT

but I prefer this text in UTF-8. But

"C3A9".lines.to_a.pack('H*').encode("UTF-8")

results in

`encode': "\xC3" from ASCII-8BIT to UTF-8 (Encoding::UndefinedConversionError)

why? How can I convert it to UTF-8?

mu is too short · Accepted Answer

You're going about this the wrong way. If you have URI encoded data like this:

%C5%BBaba

Then you should use URI.unescape to decode it:

1.9.2-head :004 > URI.unescape('%C5%BBaba')
 => "Żaba"

If that doesn't work then force the encoding to UTF-8:

1.9.2-head :004 > URI.unescape('%C5%BBaba').force_encoding('utf-8')
 => "Żaba"

Answers (2)