Using numpy where with applymap in pandas

Question

I am trying to use numpy where in conjunction with applymap in pandas.

Sample DF:

f = [[1,5],[20,40],[100,21],[15,19],[-46,101]]
test = pd.DataFrame(f,columns=["A","B"])
test

OP:

    A   B
0   1   5
1   20  40
2   100 21
3   15  19
4   -46 101

Condition is, if a column value is greater than 50 or less than 25 it should be changed to 0 or it should remain as it is.

Code:

test = test.applymap(lambda x:np.where((test[x]>50)| (test[x]<25), 0,test[x]) )
test

Error:

    KeyError                                  Traceback (most recent call last)
~\AppData\Local\Continuum\miniconda\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
   2896             try:
-> 2897                 return self._engine.get_loc(key)
   2898             except KeyError:

pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()

pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()

pandas\_libs\hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()

pandas\_libs\hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()

KeyError: (1, 'occurred at index A')

Any suggestions will be helpful

ansev · Accepted Answer

Use DataFrame.mask:

test.mask(test.lt(25)|test.gt(50),0)

or DataFrame.where

test.where(test.ge(25) & test.le(50),0)

Output

Using DataFrame.applymap we could do:

test.applymap(lambda x: 0 if (x>50) or (x<25) else x)

but this could become slow for large data frames

Solution with np.where

import numpy as np
pd.DataFrame(np.where((test<25)|(test>50),0,test),index = test.index,columns = test.columns)

EDIT

mean_test = test.mean()
limit = 5
df_filtered = test.mask(test.gt(mean_test.add(limit))|
                        test.lt(mean_test.sub(limit)),0)
print(df_filtered)
    A   B
0   0   0
1  20  40
2   0   0
3  15   0
4   0   0

Using numpy where with applymap in pandas

Answers (2)

Related Questions