Reputation: 5274
I have just migrated a database containing Latin American place names from MS Access to my MySQL. In the process, every instance of á has been changed to ‡. Here is my question:
Does there exist some sort of reference for looking up which character encoding has been translated to which other? For example, a place where I can enter a character and see how it would be misrepresented after a variety of erroneous encoding translations (e.g. ASCII to ISO 8859-1, ISO 8859-1 to UTF-8, etc.)?
Upvotes: 1
Views: 297
Reputation: 90995
Not that I'm aware of, but if you have a list of possible encodings, you can write a simple program like:
for x in ENCODINGS:
for y in ENCODINGS:
try:
if 'á'.encode(x) == '‡'.encode(y):
print(x, '→', y)
except UnicodeError:
pass
Doing that, it appears in your case that the original encoding is one of:
and the misinterpreted encoding is one of:
If you live in a "Western" locale, then mac_roman → cp1252 is the most likely possibility.
Upvotes: 1