Reputation: 1

Hadoop Standalone Cluster Pyspark Load CSV file does not Exist(hdfs://home/usr/filepath.csv)

 df = spark.read.csv('/home/hadoop/observations_temp.csv, header=True)

When I run the script raises the following error message:

pyspark.sql.utils.AnalysisException: u'Path does not exist: hdfs://home/anmol/SnapShot.cvs

Upvotes: 0

Answers (1)

Reputation: 6082

I believe there is a typo in your path, it's cvs instead of csv. This should work

df = spark.read.csv('hdfs://home/anmol/SnapShot.csv')

Upvotes: 1