Pandas - Replace substrings from a column if not numeric

Question

I have a list of suffixes I want to remove in a list, say suffixes = ['inc','co','ltd']. I want to remove these from a column in a Pandas dataframe, and I have been doing this: df['name'] = df['name'].str.replace('|'.join(suffixes), '').

This works, but I do NOT want to remove the suffice if what remains is numeric. For example, if the name is 123 inc, I don't want to strip the 'inc'. Is there a way to add this condition in the code?

Rakesh · Accepted Answer

Using Regex --> negative lookbehind.

Ex:

suffixes = ['inc','co','ltd']

df = pd.DataFrame({"Col": ["Abc inc", "123 inc", "Abc co", "123 co"]})
df['Col_2'] = df['Col'].str.replace(r"(?


Output:
       Col    Col_2
0  Abc inc      Abc
1  123 inc  123 inc
2   Abc co      Abc
3   123 co   123 co

Pandas - Replace substrings from a column if not numeric

Answers (2)

Related Questions