Converting latin-1 encoded UTF-8 string in Python

Question

I'm using a Python 2.x-library email to iterate over some .eml-files, but I have Python 3.x installed.

I extract the filename in the header of each payload (attachment) using .get_filename(). Encoding is not set in the header and thus I believe Python 3.x interprets the returned string as utf-8. The string however looks like this, when it contains special characters, e.g. like "ø":

=?ISO-8859-1?Q?Sp=F8rgeskema=2Edoc?=

I have failed in numerous ways to convert this string into utf-8 making it into bytes or not and de- and encoding using latin-1, ISO-8859-1 (should be the same though) and utf-8.

I've also tried using:

ast.literal_eval(r"b'=?ISO-8859-1?Q?Sp=F8rgeskema=2Edoc?='")

and decoding that, but it still returns the original string containing the encoded characters.

How do one go about this?

Converting latin-1 encoded UTF-8 string in Python

Answers (1)

Related Questions