Huey
Huey

Reputation: 2834

Converting accents to ASCII in R

I'm trying to convert special characters to ASCII in R. I tried using Hadley's advice in this question:

stringi::stri_trans_general('Jos\xe9', 'latin-ascii')

But I get "Jos�". I'm using stringi v1.1.1.

I'm running a Mac. My friends who are running Windows machines seem to get the desired result of "Jose".

Any idea what is going on?

Upvotes: 4

Views: 3450

Answers (1)

Ista
Ista

Reputation: 10437

The default encoding on Windows is different from the typical default encoding on other operating systems (UTF-8). x ='Jos\xe9' means something in Latin1, but not in UTF-8. So, on Linux or OS X you need to tell R what the encoding is:

x ='Jos\xe9'
Encoding(x) <- 'latin1'
stri_trans_general(x, 'Latin-ASCII')

Upvotes: 10

Related Questions