Python: UnicodeEncodeError: 'latin-1' codec can't encode characters in position 3-4: ordinal not in range(256)

Question

I found a site which fixes my mojibake, here that uses the python package ftfy. I tried reproducing the steps given, although it seems to pre-convert the string before running the steps it gives me.

The string I am trying to fix is EvðŸ’ðŸ‘¸ðŸ», although the site seems to pre-convert it to EvÃ°Å¸â\x80\x99Â\x9dÃ°Å¸â\x80\x98Â¸Ã°Å¸Â\x8fÂ» before attempting to fix it with the same steps as I am below.

My question is, how can I get my string in the same state as the site, before running the fix_broken_unicode function, to hopfully avoid the error I am facing?

When running my script, (probably due to me not pre-converting) I receive:

UnicodeEncodeError: 'latin-1' codec can't encode characters in position 3-4: ordinal not in range(256)

The source code for mentioned website can be found at: https://github.com/simonw/ftfy-web/blob/master/ftfy_app.py, although because I am primarily a C++ developer I can't understand it.

My script:

import ftfy.bad_codecs 

def fix_broken_unicode(string):
    string = string.encode('latin-1')
    string = string.decode('utf-8')
    string = string.encode('sloppy-windows-1252')
    string = string.decode('utf-8')
    return string
    
print(fix_broken_unicode("EvðŸ’ðŸ‘¸ðŸ»"))

Updates since answer:

My input: "EvðŸ’ðŸ‘¸ðŸ»", expected outcome: Ev💝👸🏻

Python: UnicodeEncodeError: 'latin-1' codec can't encode characters in position 3-4: ordinal not in range(256)

Answers (1)

Related Questions

Python: UnicodeEncodeError: &#39;latin-1&#39; codec can&#39;t encode characters in position 3-4: ordinal not in range(256)

Answers (1)

Related Questions

Python: UnicodeEncodeError: 'latin-1' codec can't encode characters in position 3-4: ordinal not in range(256)