MapReduce to calculate sum of tab separated input values

Question

I am trying to use MapReduce to find sum of tab separated input separated by its labels. The data looks like this

1     5.0    4.0   6.0
2     2.0    1.0   3.0
1     3.0    4.0   8.0

The first column is the class label so I am expecting an output categorized by class label. For this instance the output would be

label 1: 30.0
label 2: 6.0

Here is the code that I tried but I am getting wrong output and

unexpected class labels are displayed.

public class Total {

 public static class Map extends Mapper {
    private final static DoubleWritable one = new DoubleWritable();
    private Text word = new Text();

    public void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException {
        String line = value.toString();
        StringTokenizer tokenizer = new StringTokenizer(line);
        word.set(tokenizer.nextToken());
        while (tokenizer.hasMoreTokens()) {
            one.set(Double.valueOf(tokenizer.nextToken()));
            context.write(word, one);                                           
        }
    }
 }

 public static class Reduce extends Reducer {
    private Text Msg = new Text();


    public void reduce(Text key, Iterable values, Context context) 
      throws IOException, InterruptedException {
       firstMsg.set("label " + key+": Total");

       Double sum = 0.0;

         for (DoubleWritable val : values) {

            sum += val.get();


        }

        context.write(Msg, new DoubleWritable(sum));

    }
 }
//void method implementation also exists
}

MapReduce to calculate sum of tab separated input values

Answers (1)

Related Questions