how to check the utf-8 equivalent value of a character in python?

Question

I want to know how to find the utf-8 equivalent of a tamil character. Is there any function for it? Can you give the syntax.

for line in f:
    words = line.strip().split() 
    for word1, word2 in zip(words, words[1:]): 
            if word1 == '1' and word2 == "கோடி":
                ff.write("onru
")
                ff.write(word2+'
')
            else:
                ff.write(word1+'
')
                ff.write(word2+'
')

But it gives, SyntaxError: Non-ASCII character '\xe0' in file replacement.py on line 5, but no encoding declared. So how to read the non-ascii characters or how to read the tamil words. mainly how to compare and check. Thanx in advance.

charvi · Accepted Answer

I dont know if its technically making any difference, but i just removed the double quotes and replaced them with single quotes and now my pgm works. it is doing the comparison correctly. now am giving as'கோடி' instead of "கோடி". I tried u'கோடி, u'/கோடி, u"கோடி. all of them were giving errors.

how to check the utf-8 equivalent value of a character in python?

Answers (2)

Related Questions