Using Hadoop Job in Spark

Question

I have a Mapper (CustomMapper.class) and a Reducer (CustomReducer.class) class that I want to use in Spark. I could use them in Hadoop by creating a Job object and then setting the required Mapper and Reducer class as follows:

    Configuration conf = new Configuration();
    Job j = new Job(conf, "Adjacency Generator Job"); 
    j.setMapperClass(CustomMapper.class);
    j.setReducerClass(CustomReducer.class);

How can I achieve the same in Spark using Java? I have created a java RDD object as follows:

    SparkConf conf=new SparkConf().setAppName("startingSpark").setMaster("local[*]");
    JavaSparkContext sc = new JavaSparkContext(conf); 
    JavaRDD myFile = sc.textFile(args[0]);

I am not sure how to bind the Mapper and Reducer class in Spark using Java. Any help is appreciated.

Using Hadoop Job in Spark

Answers (1)

Related Questions