Encoding and Decoding special characters (Latin-1)

Question

I´m trying to clean some strange unicode characters after my HTML parsing, but is still not converting these unicodes.

Original text:

raw = 'If further information is needed, donÂ´t hesitate to contact us. Kind regards, JosÃ© Ramirez.'

After encoding & decoding:

text = str(raw.encode().decode('unicode_escape'))

Current output:

'If further information is needed, donÃ\x82Â´t hesitate to contact us. Kind regards, JosÃ\x83Â© Ramirez'

Desired output:

'If further information is needed, don´t hesitate to contact us. Kind regards, José Ramirez'

Answers (1)