Convert pandas series of json objects to DataFrame

Question

I'm trying to figure out how to convert a pandas series of json objects from a slice of a DataFrame to a new DataFrame with the keys as columns. I've tried all similar solutions, but nothing seems to work. It either returns the same series, or generates errors. Some of the lines have different keys, they're not all the same, and the number of elements varies. There's a limited amount of variation, so each key can have it's own column. Is there a way to convert this type of series into a DataFrame?

1       {'length': '27 mm', 'style': 'Short', 'width': '9 mm'}
2       {'length': '51 mm', 'style': 'Long', 'width': '9 mm'}
5       {'length': '29 mm', 'style': 'Medium', 'width': '9 mm','id': '16684'}
6       {'width': '7 mm', 'length': '29 mm', 'style': 'Medium','mfg': '45-163'}
8       {'width': '8 mm', 'length': '33 mm', 'style': 'Medium'}

Viacheslav Zhukov · Accepted Answer

Well, I would suggest

convert every json to dict
convert every dict to DataFrame
concat all dataframes, preserving indices

Data is in series in this example. Also It replaces ' with ":

import json
pd.concat([pd.DataFrame(json.loads(s), index=[ind]) for ind, s in series.str.replace('\'', '"').items()], sort=False)

    length  style   width   id      mfg
1   27 mm   Short   9 mm    NaN     NaN
2   51 mm   Long    9 mm    NaN     NaN
5   29 mm   Medium  9 mm    16684   NaN
6   29 mm   Medium  7 mm    NaN     45-163
8   33 mm   Medium  8 mm    NaN     NaN

Convert pandas series of json objects to DataFrame

Answers (2)

Related Questions