ElinaJ
ElinaJ

Reputation: 801

How to remove special characters while loading a csv in R?

I have this similar problem: read.csv warning 'EOF within quoted string' prevents complete reading of file

That is, when I load a csv R says:

Warning message:
In scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings,  :
EOF within quoted string

I can get rid of this error by applying: quotes="" to read.csv

But the main problem still exists, only 22111 rows of 689233 in total are read into R. I would like to try removing all special characters from the csv to see if this clears the problem.

Related I found this: How to remove specific special characters in R

But is there a way to do it in read.csv, that is in the phase when I'm reading in the file?

Upvotes: 1

Views: 3758

Answers (3)

Tauqeer Afzal
Tauqeer Afzal

Reputation: 1

Solution of datatable expord csv with special chahracters Find charset from https://cdn.datatables.net/buttons/1.1.2/js/buttons.html5.js or https://cdn.datatables.net/buttons/1.1.2/js/buttons.html5.min.js

and change it to 'UTF-8-BOM'from 'UTF-8'

Upvotes: 0

ElinaJ
ElinaJ

Reputation: 801

It was indeed a special charcter. There was a → (arrow, hexadecimal value 0x1A) on line 22,112. After deleting the arrow I get the data to load normally!

Upvotes: 0

daniel
daniel

Reputation: 1246

Did you try fread from data.table? It can optimize the task and likely deal with some common issues. As you haven't provide any piece of data, I'm giving a silly example:

> fread('col1,col2\n5,"4\n3"')
   col1 col2
1:    5 4\n3

Upvotes: 1

Related Questions