Aggregate time series from list of dictionaries (Python)

Question

I have a list of dictionaries generated as such:

all_series = []
    # loop
    ...
    all_series.append({"name": a.name, "sector": a.sector, "ts":a.ts})
    ...

name and sector are strings, ts is a pandas time series indexed by date.

Summing all time series together irrespective of name/sector is easy:

reduce(lambda x, y: x.add(y, fill_value=0), [a["ts"] for a in all_series])

Now I want to do this summing, but grouped by sector - i.e. I'd like to get one summed time series by sector and stored in some sensible way. I'm able to easily do this for one hard-coded sector of choice, but can you think of a good way of doing this in a more flexible way?

I guess ideally I get one data frame back, with one column per summed sector?

Oli · Accepted Answer

The suggested answer didn't work after all, because it didn't account for different lengths and start/end dates of the individual time series.

This is how I solved it in the end:

pd.concat({(a.sector, a.name): a.ts for i, a in all_series.iterrows()}, axis=1).groupby(axis=1, level=0).sum()

Thanks for the inspiration!

Aggregate time series from list of dictionaries (Python)

Answers (2)

Related Questions