Add special characters in csv pandas python

Question

While writing strings containing certain special characters, such as

    Töölönlahdenkatu

using to_csv from pandas, the result in the csv looks like

    T%C3%B6%C3%B6l%C3%B6nlahdenkatu

How do we get to write the text of string as it is? This is my to_csv command

    df.to_csv(csv_path,index=False,encoding='utf8')

I have even tried

    df.to_csv(csv_path,index=False,encoding='utf-8')
    df.to_csv(csv_path,index=False,encoding='utf-8-sig')

and still no success.There are other characters replaced with random symbols

    '-' to â€“

Is there a workaround?

LazyCoder · Accepted Answer

What you're trying to do is remove German umlauts and Spanish tildes. There is an easy solution for that.

import unicodedata

data = u'Töölönlahdenkatu Adiós Pequeño'
english = unicodedata.normalize('NFKD', data).encode('ASCII', 'ignore')
print(english)

output : b'Toolonlahdenkatu Adios Pequeno'

Let me know if it works or if there are any edge cases.

Answers (2)