Alexander Borochkin
Alexander Borochkin

Reputation: 4611

How to delete rows in python pandas DataFrame using regular expressions?

I have a pattern:

patternDel = "( \\((MoM|QoQ)\\))";

And I want to delete all rows in pandas dataframe where column df['Event Name'] matches this pattern. Which is the best way to do it? There are more than 100k rows in dataframe.

Upvotes: 38

Views: 34350

Answers (1)

Bob Haffner
Bob Haffner

Reputation: 8483

str.contains() returns a Series of booleans that we can use to index our frame

patternDel = "( \\((MoM|QoQ)\\))"
filter = df['Event Name'].str.contains(patternDel)

I tend to keep the things we want as opposed to delete rows. Since filter represents things we want to delete we use ~ to get all the rows that don't match and keep them

df = df[~filter]

Upvotes: 66

Related Questions