6:[["$","$Le",null,{}],["$","div",null,{"className":"min-h-screen bg-gray-100 p-6","children":[["$","$Lf",null,{}],["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"{\"@context\":\"https://schema.org\",\"@type\":\"QAPage\",\"mainEntity\":{\"@type\":\"Question\",\"name\":\"SparkR df read as one column\",\"text\":\"

txt with 4 column divided by \\\\t.

\\n\\n

When I read it in this way:

\\n\\n

A<-read.df(sqlContext,\\\"/home/daniele/Tnt3.txt\\\", \\\"com.databricks.spark.csv\\\")\\n

\\n\\n

SparkR read it all as one column

\\n\\n

 a\\\\tb\\\\tc\\\\td\\n

\\n\\n

How can I change the \\\\t to , in sparkR?

\\n\\n

(I know that I can change it manually like this sed -i 's/\\\\t/,/g' file but is a little bit slowly)

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"DanieleO\"},\"upvoteCount\":1,\"answerCount\":2,\"acceptedAnswer\":{\"@type\":\"Answer\",\"text\":\"

a <- read.df(sqlContext, \\\"/home/daniele/Tnt3.txt\\\", \\\"com.databricks.spark.csv\\\", delimiter=\\\"\\\\t\\\")

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"xyzzy\"},\"upvoteCount\":3}}}"}}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mb-6 relative","children":[["$","div",null,{"className":"absolute top-4 right-4 flex flex-wrap space-x-2","children":[["$","span","csv",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/csv/1","children":"csv"}]}],["$","span","apache-spark",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/apache-spark/1","children":"apache-spark"}]}],["$","span","sparkr",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/sparkr/1","children":"sparkr"}]}]]}],["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/878d6755201071025a27c1ea9407dd66?s=256&d=identicon&r=PG&f=y&so-version=2","alt":"DanieleO","className":"w-16 h-16 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/5772936/danieleo","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"DanieleO"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",472]}]]}]]}],["$","h1",null,{"className":"text-2xl font-bold text-gray-800 mb-4","children":"SparkR df read as one column"}],["$","p",null,{"className":"text-gray-700 mt-4","dangerouslySetInnerHTML":{"__html":"

txt with 4 column divided by \\t.

\n\n

When I read it in this way:

\n\n

A<-read.df(sqlContext,\"/home/daniele/Tnt3.txt\", \"com.databricks.spark.csv\")\n

\n\n

SparkR read it all as one column

\n\n

 a\\tb\\tc\\td\n

\n\n

How can I change the \\t to , in sparkR?

\n\n

(I know that I can change it manually like this sed -i 's/\\t/,/g' file but is a little bit slowly)

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm mt-4","children":[["$","p",null,{"children":["Upvotes: ",1]}],["$","p",null,{"children":["Views: ",582]}]]}]]}],["$","div",null,{"className":"container mx-auto","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-6","children":["Answers (",2,")"]}],[["$","div","35371337",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/0b6432fe80734f7afb5df1c5d6c4c6ed?s=256&d=identicon&r=PG&f=y&so-version=2","alt":"xyzzy","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/2238220/xyzzy","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"xyzzy"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",329]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

a <- read.df(sqlContext, \"/home/daniele/Tnt3.txt\", \"com.databricks.spark.csv\", delimiter=\"\\t\")

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",3]}]}]]}],["$","div","35370889",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/7b13c2b98a0bbce211c473e1d8dc017a?s=256&d=identicon&r=PG&f=y&so-version=2","alt":"DanielVL","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/5519050/danielvl","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"DanielVL"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",249]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

You should specify delimiter.

\n\n

Im newer in R, but i think is something like this

\n\n

A<-read.df(sqlContext,\"/home/daniele/Tnt3.txt\", \"com.databricks.spark.csv\").options(\"delimiter\", \"\\t\")

\n\n

for more info, visit page of spark-csv:

\n\n

https://github.com/databricks/spark-csv

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",0]}]}]]}]]]}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mt-6","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-4","children":"Related Questions"}],["$","ul",null,{"className":"list-disc list-inside","children":[["$","li","6002256",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/6002256","className":"text-blue-600 hover:underline","children":"Is it possible to force Excel recognize UTF-8 CSV files automatically?"}]}],["$","li","14964035",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/14964035","className":"text-blue-600 hover:underline","children":"How to export JavaScript array info to csv (on client side)?"}]}],["$","li","3635166",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/3635166","className":"text-blue-600 hover:underline","children":"How do I import CSV file into a MySQL table?"}]}],["$","li","16176996",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/16176996","className":"text-blue-600 hover:underline","children":"Keep only date part when using pandas.to_datetime"}]}],["$","li","36981392",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/36981392","className":"text-blue-600 hover:underline","children":"save data into Hadoop using sparkR - crash"}]}],["$","li","41181969",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/41181969","className":"text-blue-600 hover:underline","children":"Read a csv file in sparkR where columns have spaces"}]}],["$","li","36779542",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/36779542","className":"text-blue-600 hover:underline","children":"Load spark-csv from Rstudio under Windows environment"}]}],["$","li","36444362",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/36444362","className":"text-blue-600 hover:underline","children":"Rounding off values in a column - SparkR"}]}],["$","li","34718730",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/34718730","className":"text-blue-600 hover:underline","children":"SparkR dubt and Broken pipe exception"}]}],["$","li","31674017",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/31674017","className":"text-blue-600 hover:underline","children":"To use sparkR columns"}]}]]}]]}]]}],["$","$L11",null,{}],["$","$L12",null,{}],["$","$L13",null,{}],["$","$L14",null,{}],["$","$L15",null,{}]]

SparkR df read as one column

Answers (2)

Related Questions