search multiple keywords python

Question

How can I improve my code to search using a list of keywords in a specific column of a dataframe and return those rows that contains the value. the current code only accepts two keywords!

contain_values = df[df['tweet'].str.contains('free','news')]
contain_values.head()

Cimbali · Accepted Answer

Your code currently only returns tweets that contain 'free' and ignores 'news'. Let’s test it:

>>> df
          tweet
0    free stuff
1  newsnewsnews
2   hello world
3 another tweet
>>> df[df['tweet'].str.contains('free', 'news')]
        tweet
0  free stuff

See the documentation for .str.contains(): you can either pass a word, or a regular expression. This will work:

df[df['tweet'].str.contains('free|news|hello')]

Here I’ve added a 3rd keyword, and now the first 3 elements of my dataframe are returned:

          tweet
0    free stuff
1  newsnewsnews
2   hello world

search multiple keywords python

Answers (2)

Related Questions