Pivot a spark dataframe without a groupBy column

Question

Let's say I've a dataframe something like this,

+---+----+------+
|id |name|salary|
+---+----+------+
|10 |abc |100   |
+---+----+------+

And I would like to pivot/transpose the data so that the output looks like,

+--------+----+
|col_name|data|
+--------+----+
|id      |10  |
|name    |abc |
|salary  |100 |
+--------+----+

How would I do this using pyspark.

Shubham Jain · Accepted Answer

You can use stack as

s = ','.join([f"'{i}', `{i}`" for i in df.columns])
df = df.select([col(i).cast('string') for i in df.columns])
df.select(expr(f'''stack({len(df.columns)},{s})''')).show()

+------+----+
|  col0|col1|
+------+----+
|    id|  10|
|  name| abc|
|salary| 100|
+------+----+

Pivot a spark dataframe without a groupBy column

Answers (2)

Related Questions