Reduce Key with Ordered List of Values

Question

I have the following RDD:

| Key | Value | Date       |
|-----|-------|------------|
| 1   | A     | 10/30/2016 |
| 1   | B     | 10/31/2016 |
| 1   | C     | 11/1/2016  |
| 1   | D     | 11/2/2016  |
| 2   | A     | 11/2/2016  |
| 2   | B     | 11/2/2016  |
| 2   | C     | 11/2/2016  |
| 3   | A     | 10/30/2016 |
| 3   | B     | 10/31/2016 |
| 3   | C     | 11/1/2016  |
| 3   | D     | 11/2/2016  |

And I would like to transform it to the following RDD:

| Key | List         |
|-----|--------------|
| 1   | (A, B, C, D) |
| 2   | (A, B, C)    |
| 3   | (A, B, C, D) |

Which is Key, List(Value) -- where the list of Values is ordered by the corresponding dates. All keys, obviously, will be unique, but not all values will necessarily be unique. I would still like to list all Values. How can I accomplish this?

Reduce Key with Ordered List of Values

Answers (1)

Related Questions