Expand JSON from pySpark DataFrame into name / value pairs

Question

I have a pySpark dataframe looking like this:

|id|json                                  |
+--+--------------------------------------+
|1 |{"attr1": "value1"}                   |
|2 |{"attr2": "value2", "attr3": "value3"}|

root
 |-- id: string (nullable = true)
 |-- json: string (nullable = true)

How do I convert it into a new dataframe which will look like this:

|id|attr |value |
+--+-----+------+
|1 |attr1|value1|
|2 |attr2|value2|
|2 |attr3|value3|

(tried to google for the solution with no success, apologies if it's a duplicate) Thanks!

Expand JSON from pySpark DataFrame into name / value pairs

Answers (1)

Related Questions