Merging a groupby object with another data frame

Question

I have a data frame with repesenting the sales of an item:

import pandas as pd

data = {'id': [1,1,1,1,2,2], 'week': [1,2,2,3,1,3], 'quantity': [1,2,4,3,2,2]}
df_sales = pd.DataFrame(data)
🐍 >>> df_sales
   id  week  quantity
0   1     1         1
1   1     2         2
2   1     3         3
3   2     1         2
4   2     3         2

I have another data frame that represents the available weeks:

data = {'week': [1,2,3]}
df_week = pd.DataFrame(data)
🐍 >>> df_week
   week
0     1
1     2
2     3

I want to groupby the id and the week and compute the mean, which I do as follows:

df = df_sales.groupby(by=['id', 'week'], as_index=False).mean()
🐍 >>> df
   id  week  quantity
0   1     1         1
1   1     2         3
2   1     3         3
3   2     1         2
4   2     3         2

However, I want to fill the missing week values (present in df_week) with 0, such that the output is:

🐍 >>> df
   id  week  quantity
0   1     1         1
1   1     2         3
2   1     3         3
3   2     1         2
4   2     2         0
4   2     3         2

Is it possible to merge the groupby with the df_week data frame?

Shubham Sharma · Accepted Answer

We can reindex after groupby

# group and aggregate
df  = df_sales.groupby(['id', 'week']).mean()

# define new MultiIndex
idx = pd.MultiIndex.from_product([df.index.levels[0], df_week['week']])

# reindex with fill_value=0
df  = df.reindex(idx, fill_value=0).reset_index()

print(df)

   id  week  quantity
0   1     1         1
1   1     2         3
2   1     3         3
3   2     1         2
4   2     2         0
5   2     3         2

Merging a groupby object with another data frame

Answers (2)

Related Questions