corpus
corpus

Reputation: 27

Customizing Python unidecode module

Unidecode is a wonderful tool, but I'd like to change processing of several cyrillic letters. For example, I prefer to have a letter 'я' replaced with 'ya', not 'ia'. What file should I modify? There are lots of files named like 'x00a.py', unfortunately I can't find the one that contains needed symbols (small 'я' is 'x44F', for instance).

Upvotes: 1

Views: 105

Answers (1)

Ry-
Ry-

Reputation: 225005

Each filename is a 3-digit prefix of a 5-digit value, so you’re looking for x004.py. Specifically line 81:

'ia', # 0x4f

Upvotes: 1

Related Questions