groupby and sum to calculate into new column and organized hierarchy

Question

I'm trying to groupby my dataframe into a tree format so that it sorts down in a hierarchical way. DC being the first column that funnels down into Retailer, Store Count, Product descriptions, case volume and velocity in that order. Summing the retailer column into a new column "StoreCt" that is positioned after "Retailer"

The problem I'm running into is the store counts are being duplicated.

Here is the dataframe I have

Retailer	DC	Product	Cs	Volume	Velocity
joe	ABC	bars	Cs	Cost	Velocity
joe	DFC	drinks1	Cs	Cost	Velocity
joe	DFC	drinks2	Cs	Cost	Velocity
randy	ABC	bars	Cs	Cost	Velocity
peter	DFC	drinks2	Cs	Cost	Velocity
john	XYZ	drinks	Cs	Cost	Velocity
joe	XYZ	snacks	Cs	Cost	Velocity
joe	DFC	bars2	Cs	Cost	Velocity

This is the result I want. values in the cs, volume, and velocity columns need to be unchanged

DC	Retailer	StoreCt	Product	Cs	Volume	Velocity
ABC	joe	1	bars	Cs	Cost	Velocity
	randy	1	bars	Cs	Cost	Velocity
DFC	joe	3	drinks1	Cs	Cost	Velocity
			drinks2	Cs	Cost	Velocity
			bars2	Cs	Cost	Velocity
	peter	1	drinks2	Cs	Cost	Velocity
XYZ	joe	1	snacks	Cs	Cost	Velocity
	john	1	drinks	Cs	Cost	Velocity

this is my code to get the store count, but i can't figure out how to add it into the dataframe without duplicating the values

store_count = df.groupby("Retailer").size().to_frame("StoreCt")
store_count

groupby and sum to calculate into new column and organized hierarchy

Answers (1)

Related Questions