How do I make spelling corrector using txt file

Question

Here's my txt file, called it replacer.txt

keyword_origin, keyword_destinantion
topu,topup
atmstrbca,atm bca

Here's what I want

id keyword
1  transfer atmstrbca
2  topu bank
3  topup bank

My expected output

id keyword
1  transfer atm bca
2  topup bank
3  topup bank

What I did is

df['keyword'].str.replace("atmstrbca","atm bca")
df['keyword'].str.replace("topu","topup")

The output is

id keyword
1  transfer atm bca
2  topup bank
3  topupp bank

My Idea is using text replacer.txt to do this since the list is more tahn 100 keyword

jezrael · Accepted Answer

Create dictionary from first file and split values by whitespace and use get for replace:

d = dict(zip(df1.keyword_origin, df1.keyword_destinantion))
#alternative
#d = df1.set_index('keyword_origin')['keyword_destinantion'].to_dict()
df2['keyword'] = df2['keyword'].apply(lambda x: ' '.join([d.get(y, y) for y in x.split()]))
print (df2)
   id           keyword
0   1  transfer atm bca
1   2        topup bank
2   3        topup bank

How do I make spelling corrector using txt file

Answers (2)

Related Questions