Ingest recursive JSON data which has a document per line into Databricks OR Azure Data Factory

Question

I have multiple JSON files in the file structure:

Commodity1
- Interval
  - YYYY
    - MM
      - DD
        
        file1.json
- Interval
  - YYYY
    - MM
      - DD
        
        file2.json
      - DD
        
        file3.json
    - MM
      - DD
        
        file4.json
      - DD
        
        file5.json

And so on.

All the JSON files have the same schema.

However, I'm having issues ingesting this data into Databricks. I've set the recursiveFileLookup option to true and I'm able to ingest it but I end up with a document per line (each file on one row).

Is there any way to merge the data into one row in Databricks?

Also, if anyone has a solution to ingest this data into a data flow in Azure Data Factory, please share!

Thanks!

Ingest recursive JSON data which has a document per line into Databricks OR Azure Data Factory

Answers (1)

Related Questions