Efficient grouping by key and StatCounter

Question

I am aggregating values by parameter as below using apache-spark and scala. This keeps adding values to "List" Is there more efficient way to get list by key and StatCounter?

val predictorRawKey = predictorRaw.map { x =>
      val param = x._1
      val val: Double = x._2.toDouble
      (param, val)
    }.mapValues(num => List( num) )
     .reduceByKey((l1, l2) => l1 ::: l2)
     .map { x => x._1, StatCounter(x._2.iterator))

Efficient grouping by key and StatCounter

Answers (1)

Related Questions