Pandas, group by and aggregate multiple column values into a dict

Question

So I have a dataframe that I want to combine some rows via a group by.

Sample DF,

   col_a  col_b  col_c  col_e  col_f
0      1      0      1   -1.0      2
1      1      1      3    0.0      3
2      1      2      4    NaN      3
3      2      0      3    4.0      6
4      3      0      3    4.0      2

And what I want the output to look like is this...

df.groupby('col_a')

col_a, col_c               ...col_f
1       {0: 1, 1: 3, 2:4}     {0:2,1:3,2:3}
2       ....                 ....
3        ....               ....

Basically, group by col_a, then aggregate all the values we got for col_c through col_f, set the values into a dictionary where col_b is the dictionary key.

Not sure if there's a way to use groupby and maybe some kind of agg function or if I'm just resigned to writing a python function that takes the dataframe and just iterates over every row and using .apply. Ideas?

Edit:

Original:
       col_a  col_b  col_c  col_e  col_f
    0      1      A     1   -1.0      2
    1      1      B      3    0.0      3
    2      1      C      4    NaN      3
    3      2      A      3    4.0      6
    4      3      A      3    4.0      2

Desired:
    col_a, col_c               ...col_f
    1       {A: 1, B: 3, C:4}     {A:2,B:3,C:3}
    2       ....                 ....
    3        {A:3}               {A:2}

Pandas, group by and aggregate multiple column values into a dict

Answers (1)

Related Questions