python pandas aggregating non numeric types

Question

I have the following problem, I managed to create a data frame with object dtypes on some columns. In particular these would be 2d numpy arrays but they could be any non-numeric type. Now I want to pivot my dataframe. Is there a way to pass an aggregating function of my choice which works on these objects? I don't seem to be able to do it and I get the error:

GroupByError: No numeric types to aggregate

For example, say I have this dummy data frame:

date foo  bar               mat
1     a   x      [[1, 2], [3, 4]]
1     b   x      [[1, 2], [3, 4]]
1     a   y      [[1, 2], [3, 4]]
1     b   y      [[1, 2], [3, 4]]
2     a   x      [[4, 5], [6, 7]]
2     b   x      [[4, 5], [6, 7]]
2     a   y      [[4, 5], [6, 7]]
2     b   y      [[4, 5], [6, 7]]

and I want to have a new data frame of the type:

dd.pivot_table(values=['mat'], rows=['date'], cols=['foo'], aggfunc= ??)

where my 2-d arrays will be an element-by-element sum of the arrays with same value in the 'foo' columns. How can I do that? If not possible, is it possible to pick the first occurrence of the 'mat' element in the list of arrays with same 'foo'? Thanks

added the desired output:

date    a               b           
1    [[2, 4], [6, 8]]  [[8, 10], [12, 14]]
2    [[2, 4], [6, 8]]  [[8, 10], [12, 14]]

python pandas aggregating non numeric types

Answers (1)

Related Questions