Reputation: 4768
i have a dataframe :
import pandas as pd
data = {'day_bucket': ['2011-01-21', '2011-01-22', '2011-01-23', '2011-01-24'], 'label': ['birds', 'birds', 'birds', 'birds'], 'numeric_value': [4, 0, 7, 3]}
df = pd.DataFrame(data)
day_bucket label numeric_value
0 2011-01-21 birds 4
1 2011-01-22 birds 0
2 2011-01-23 birds 7
3 2011-01-24 birds 3
I want to pivot this dataframe so that i have a column birds
with the values below it.
pd.pivot_table(df, values='numeric_value', index='day_bucket',columns='label')
gives:
label birds
day_bucket
2011-01-21 4
2011-01-22 0
2011-01-23 7
2011-01-24 3
what should i do the keep the index? The result will look like:
day_bucket birds
0 2011-01-21 4
1 2011-01-22 0
2 2011-01-23 7
3 2011-01-24 3
Upvotes: 8
Views: 19025
Reputation: 294218
set_index
with append
df.set_index(['day_bucket', 'label'], append=True) \
.rename_axis([None, None, None]).squeeze().unstack()
birds
0 2011-01-21 4
1 2011-01-22 0
2 2011-01-23 7
3 2011-01-24 3
Upvotes: 6
Reputation: 682
just for supplement:
pt2 = pt.rename_axis(None, axis=1).reset_index()
print(pt2)
day_bucket birds
0 2011-01-21 4
1 2011-01-22 0
2 2011-01-23 7
3 2011-01-24 3
the dataframe used for it is:
df = pd.DataFrame({'day_bucket': ['2011-01-21', '2011-01-22', '2011-01-23', '2011-01-24'],
'label': ['birds', 'birds', 'birds', 'birds'],
'num_value': [4, 0, 7, 3]})
pt = pd.pivot_table(df,
values='num_value',
index='day_bucket',
columns='label',
aggfunc=np.sum)
print(pt, '\n')
label birds
day_bucket
2011-01-21 4
2011-01-22 0
2011-01-23 7
2011-01-24 3
Upvotes: 4
Reputation: 4768
In the meantime, I also came up with a result
pd.pivot_table(df, values='numeric_value',
index=[df.index.values,'day_bucket'],
,columns='label').reset_index('day_bucket')
label day_bucket mortality_birds
0 2011-01-21 4
1 2011-01-22 0
2 2011-01-23 7
3 2011-01-24 3
Upvotes: 7