Pyspark data transformation logic to assign one column values to another

Question

I am using spark 2.1.0. I have dataframe as mentioned below. I am very new to pyspark I am stuck up with this issue

Now the problem statement is : Taking b column into consideration I should populate the C column from reference to column a i,e For every 4 values from column a , column c has to be populated with referring to values from column b. For example as shown in below dataframe from row no :4 equivalent c value is 30 . This 30 has been obtained from column b having its equivalent a to be 1

Below is my original dataframe

Resulting dataframe should be as shown below:

Please help me in resolving this Thanks in advance

Pyspark data transformation logic to assign one column values to another

Answers (1)

Related Questions