Pandas sum of multi-indexed columns

Question

If I have a data frame with nested headers like this:

              John          Joan
         Smith,   Jones,    Smith,
Index1     234      432      324
Index2     2987     234      4354

...how do I create a new column that sums the values of each row? I tried df['sum']=df['John']+df['Joan'] but that resulted in this error:

ValueError: Wrong number of items passed 3, placement implies 1

piRSquared · Accepted Answer

If I understand you correctly:

...how do I create a new column that sums the values of each row?

The sum of each row is just

df.sum(axis=1)

The trick is getting to be a new column. You need to ensure the column you add has 2 levels of column heading.

df.loc[:, ('sum', 'sum')] = df.sum(axis=1)

I'm not happy with it, but it works.

         Joan   John          sum
       Smith, Jones, Smith,   sum
Index1    324    432    234   990
Index2   4354    234   2987  7575

Answers (2)