Apply group specific function to groups in Pandas

Question

I'm trying to figure out the best way to apply a function to groups within a Pandas dataframe where the function depends on the group.

Say I have the following dataframe:

>>> df=pd.DataFrame(np.random.randint(50,200,9), columns=['Value'])
>>> df['Year']=[2001,2002,2003]*3
>>> df['Location']=['A','A','A','B','B','B','C','C','C']
>>> df.set_index(['Location','Year'], inplace=True)
>>> df
               Value
Location Year       
A        2001    134
         2002    162
         2003    108
B        2001     59
         2002     52
         2003    124
C        2001    148
         2002    162
         2003     66
>>>

And that I have the following dictionary of values, specific to each year:

>>> YearDict={2001:1.3, 2002:1.2, 2003:1.1}
>>> YearDict
{2001: 1.3, 2002: 1.2, 2003: 1.1}

What would be the best way to multiply the 'Value' column in my dataframe by the year specific value in my dictionary?

Currently I do something like this:

>>> df.reset_index(inplace=True)
>>> def f(row):
...     return row['Value']*YearDict[row['Year']]
... 
>>> 
>>> df.apply(f, axis=1)
0     84.5
1    210.0
2    201.3
3    248.3
4     94.8
5    177.1
6    140.4
7    218.4
8     68.2
dtype: float64
>>>

Is this the best approach? Is their a method that does not require resetting the dataframe index?

Marius · Accepted Answer

You can map a function on the index. Each row in the dataframe has a (Location, Year) tuple as its index, so you can do:

df.index.map(lambda t: YearDict[t[1]])
Out[11]: array([ 1.3,  1.2,  1.1,  1.3,  1.2,  1.1,  1.3,  1.2,  1.1])

So multiplying by these values looks like:

year_mults = df.index.map(lambda t: YearDict[t[1]])

df['Value'] * year_mults
Out[13]: 
Location  Year
A         2001    247.0
          2002    160.8
          2003    119.9
B         2001    102.7
          2002    182.4
          2003    202.4
C         2001     71.5
          2002    178.8
          2003    211.2
Name: Value, dtype: float64

Apply group specific function to groups in Pandas

Answers (2)

Related Questions