Unicode to Latin in Teradata Conversion

Question

I have been trying to convert Unicode strings to Latin in Teradata version 16.20.32.23. I have seen many online forums but I was not able to formulate a solution. Following are some of the strings that I was unable to convert:

hyÃƒÂ¶dyt
lÃƒÂ¶ydÃƒÂ¤t

I have tried following solution but function translate_chk does not seems to work.

SELECT CASE WHEN Translate_Chk ( 'hyÃƒÂ¶dyt' using UNICODE_TO_LATIN) <> 0 
THEN
''
WHEN Translate_Chk ( 'hyÃƒÂ¶dyt' using UNICODE_TO_LATIN ) = 0 
THEN
Translate ( 'hyÃƒÂ¶dyt' using UNICODE_TO_LATIN WITH ERROR)
END AS transalated

The error I receive is SELECT FAILED. 6706: The string contains untranslatable character.

I think I have reached a dead end, could anyone help me here?

Mark Tolonen · Accepted Answer

I'm not familiar with Teradata, but the strings you have are double-mis-decoded as Windows-1252, which is a variation of ISO-8859-1 a.k.a latin1. Example to fix in Python:

>>> s='hyÃƒÂ¶dyt'
>>> s.encode('cp1252').decode('utf8').encode('cp1252').decode('utf8')
'hyödyt'
>>> s='lÃƒÂ¶ydÃƒÂ¤t'
>>> s.encode('cp1252').decode('utf8').encode('cp1252').decode('utf8')
'löydät'

So not a Teradata solution, but should help you figure it out.

Unicode to Latin in Teradata Conversion

Answers (2)

Related Questions