How can I write a query to insert array values from a python dictionary in BigQuery?

Question

I have a python dictionary that looks like this:

{
'id': 123, 
'categories': [
    {'category': 'fruit', 'values': ['apple', 'banana']}, 
    {'category': 'animal', 'values': ['cat']},
    {'category': 'plant', 'values': []}
  ]
}

I am trying to insert those values into a table in big query via the API using python, I just need to format the above into an "INSERT table VALUES" query. The table needs to have the fields: id, categories.category, categories.values.

I need categories to basically be an array with the category and each category's corresponding values. The table is supposed to look sort of like this in the end - except I need it to be just one row per id, with the corresponding category fields nested and having the proper field name:

SELECT 123 as id, (["fruit"], ["apple", "banana"]) as category
UNION ALL (SELECT 123 as id, (["animal"], ["cat"]) as category)
UNION ALL (SELECT 123 as id, (["plant"], ["tree", "bush", "rose"]) as category)

I'm not really sure how to format the "INSERT" query to get the desired result, can anyone help?

Mikhail Berlyant · Accepted Answer

You can use below query - with your dictionary text embed into it

#standardSQL
WITH data AS (
SELECT '''
  {
  'id': 123, 
  'categories': [
      {'category': 'fruit', 'values': ['apple', 'banana']}, 
      {'category': 'animal', 'values': ['cat']},
      {'category': 'plant', 'values': ['tree', 'bush', 'rose']}
    ]
  } 
  ''' dict
)
SELECT 
  JSON_EXTRACT_SCALAR(dict, '$.id') AS id,
  ARRAY(
    SELECT AS STRUCT 
      JSON_EXTRACT_SCALAR(cat, '$.category') AS category,
      ARRAY(
        SELECT TRIM(val, '"')
        FROM UNNEST(JSON_EXTRACT_ARRAY(cat, '$.values')) val
      )`values`
    FROM UNNEST(JSON_EXTRACT_ARRAY(dict, '$.categories')) cat
  ) AS categories 
FROM data

which produces below result

Row id  categories.category categories.values    
1   123 fruit               apple    
                            banana   
        animal              cat  
        plant               tree     
                            bush     
                            rose

How can I write a query to insert array values from a python dictionary in BigQuery?

Answers (2)

Related Questions