Need help Parsing strange JSON with scala

Question

I am working on parsing json to spark dataframe in scala. I have a nested json file of 50 different records of different household items. On JSON I am trying to parse the equipment tag is as below:

"equipment":[{"tv":[""]}]

Due to this item name (ex: tv in this case) is becoming column name than values.

Ideally this tag should be like,

"equipment":["tv"]

Is there a way parse this type of JSON tags/ contents?

Due to this the dataframe schema is being shown as:

 |-- equipment: array (nullable = true)
 |    |-- element: struct (containsNull = true)
 |    |    |-- ac: array (nullable = true)
 |    |    |    |-- element: string (containsNull = true)
 |    |    |-- tv: array (nullable = true)
 |    |    |    |-- element: string (containsNull = true)

Where you can see that (above) ac & tv are becoming column headers. Instead of that i need them to shown as values. The dataframe should look like:

+----------+
|equipment |
+----------+
|tv        |
|ac        |
+----------+

Need help Parsing strange JSON with scala

Answers (1)

Related Questions