Reputation:

Rows being dropped in R with read.table?

I am loading a table in which the first column is a URL and reading it into R using read.table().

It seems that R is dropping about 1/3 of the columns and does not return any errors.

The URLs do not contain any # characters or tabs (my separator field), which I understand could be an issue. If I convert the URLs to integer IDs first, the problem goes away.

Is there something about the field that might be causing R to drop the rows?

Upvotes: 1

Answers (2)

kari

Reputation: 272

Without a sample of the data, it's hard to say. But one small "gotcha" is that # is the default comment.char in read.table(). Try to set comment.char = "" and see if that fixes it.

Upvotes: 2

Eytan

Reputation:

Thanks for all your help,

Yes, so initially there were some hashes and I was able to handle them using comment.char = ''. The problem turned out to be that some of my URLs contained ' and " characters. The strangest thing about the situation is that it didn't return any errors. After I removed these characters using tr, I had no issues with loading the data.

Upvotes: 1

Rows being dropped in R with read.table?

Answers (2)

Related Questions