apply function to each word of every row in pandas dataframe column

Question

I have a sample dataframe as follows:

df = pd.DataFrame({
'notes': pd.Series(['speling', 'korrecter']), 
'name': pd.Series(['Walter White', 'Walter White']), 
})

  name                notes
0  Walter White     This speling is incorrect
1  Walter White     Corrector should correct korrecter

I want to adapt the spell checker by Peter Norvig available here. I would then like to apply this function to every row by going over every word in the row. I was wondering how can this be done in Python Pandas context?

I would like the output as:

    name                notes
0  Walter White     This spelling is incorrect
1  Walter White     Corrector should correct corrector

Appreciate any inputs. Thanks!

jezrael · Accepted Answer

You can try this solution with str.split, but I think performance in big df can be problematic:

import pandas as pd
import numpy as np

df = pd.DataFrame({
'notes': pd.Series(['This speling is incorrect', 'Corrector should correct korrecter one']), 
'name': pd.Series(['Walter White', 'Walter White']), 
})
print df
           name                                   notes
0  Walter White               This speling is incorrect
1  Walter White  Corrector should correct korrecter one    

#simulate function correct
def correct(x):
    return x + '888'

#split column notes and apply correct
df1 = df.notes.str.split(expand=True).apply(correct)
print df1
              0           1           2             3       4
0       This888  speling888       is888  incorrect888     NaN
1  Corrector888   should888  correct888  korrecter888  one888

#remove NaN and concanecate all words together
df['notes'] = df1.fillna('').apply(lambda row: ' '.join(row), axis=1)
print df
           name                                              notes
0  Walter White             This888 speling888 is888 incorrect888 
1  Walter White  Corrector888 should888 correct888 korrecter888...

apply function to each word of every row in pandas dataframe column

Answers (2)

Related Questions