How to generate cumulative concatenation in Spark SQL

Question

My Input for spark is below:

Col_1	Col_2	Amount
1	0	35/310320
1	1	35/5
1	1	180/-310350
17	1	0/1000
17	17	0/-1000
17	17	74/314322
17	17	74/5
17	17	185/-3142

I want to generate the below Output using spark SQL:

Output
35/310320
35/310320/35/5
35/310320/35/5/180/-310350
0/1000
0/1000/0/-1000
0/1000/0/-1000/74/314322
0/1000/0/-1000/74/314322/74/5
0/1000/0/-1000/74/314322/74/5/185/-3142

Conditions & Procedure: If col_1 and col_2 values are not the same then consider the current amount value for the new Output column but both are the same then concatenate the previous all amount value by /.

i.e. 17 from col_1 where col_1 & col_2 value are different so consider current amount 0/1000. Next step both column values is the same so the value is 0/1000/0/-1000 and so on. Need to create this logic for dynamic data in spark SQL or Spark Scala.

How to generate cumulative concatenation in Spark SQL

Answers (1)

Related Questions