Sequence file reading issue using spark Java

Question

i am trying to read the sequence file generated by hive using spark. When i try to access the file , i am facing org.apache.spark.SparkException: Job aborted due to stage failure: Task not serializable: java.io.NotSerializableException:

I have tried the workarounds for this issue like making the class serializable, still i face the issue. I am writing the code snippet here , please let me know what i am missing here.

Is it because of the BytesWritable data type or something else which is causing the issue.

JavaPairRDD fileRDD = javaCtx.sequenceFile("hdfs://path_to_the_file", BytesWritable.class, Text.class);
List result = fileRDD.map(new Function,String>(){
public String call (Tuple2 row){
return row._2.toString()+"
";

}).collect();
}

Sequence file reading issue using spark Java

Answers (1)

Related Questions