How do we flatten a List with same key in spark

Question

I have a RDD like below

 Array[(String, List[Int])] = Array((2008,List(40, 20)), (2000,List(30, 10)), (2001,List(9)))

I am looking to flatten the values for the same key .

Expected output :

 Array[(String, Int)]

 Array((2008,40) ,(2008,20) ,(2000,30),(2000,10),(2001,9))

Can Someone help me on getting this result?

freedev · Accepted Answer

I would try something like that:

val l = Array((2008,List(40, 20)), (2000,List(30, 10)), (2001,List(9)))

l.flatMap(pair => pair._2.map(listElem => (pair._1, listElem)))

Answers (2)