pandas multiply values by scaling factor in another DataFrame

Question

I have some data with an id and val columns in DataFrame df and some scaling factors in DataFrame scaling such that it has a scaling factor for each value in id

df = pd.DataFrame(data=dict(id=['a', 'a', 'a', 'b', 'b', 'c'], val=[1, 2, 3, 10, 11, 100]))
scaling = pd.DataFrame(data=dict(id=['a', 'b', 'c'], scaling=[1, 0.1, 0.01]))

These look like this:

In[23]: df
Out[24]: 
  id  val
0  a    1
1  a    2
2  a    3
3  b   10
4  b   11
5  c  100
scaling
Out[25]: 
  id  scaling
0  a     1.00
1  b     0.10
2  c     0.01

I now want to multiply the data in df by the scaling factors. I can do it this way but this feels awkward and is possibly inefficient?

df['val'] = df['val'] * df.merge(scaling, left_on='id', right_on='id')['scaling']

Is there a better way of applying the factors?

Quang Hoang · Accepted Answer

You can use map with set_index on scaling:

df['val'] * df['id'].map(scaling.set_index('id').scaling)

Output:

0    1.0
1    2.0
2    3.0
3    1.0
4    1.1
5    1.0
dtype: float64

pandas multiply values by scaling factor in another DataFrame

Answers (2)

Related Questions