Pandas not replacing character in large dataframe

Question

When I create a small test dataframe this code works fine, but when attempting to use it after importing a large excel file it is not substituting the character.

import pandas as pd
df = pd.DataFrame({'A':[1,2,3],
                    'B':[4,5,6],
                    'C':['`f;','d:','sda`sd'],
                    'D':['s','d;','d;p`'],
                    'E':[5,3,6],
                    'F':[7,4,3]})
df.replace({'`':''}, regex=True)

The results are as expected:

    A   B   C   D   E   F
0   1   4   f;  s   5   7
1   2   5   d:  d;  3   4
2   3   6   sdasd   d;p 6   3

However when I load up a large excel file:

import pandas as pd
excel_file = f'C:	estfile.xlsx'
df = pd.read_excel(excel_file,sheet_name='Details', dtype=str)
df.iloc[20831].loc['Group Number']

Result:

'008513L-0005 `'

Then run the replace:

df.replace({'`':''}, regex=True)
df.iloc[20831].loc['Group Number']

Result:

'008513L-0005 `'

Pandas not replacing character in large dataframe

Answers (1)

Related Questions