Pandas: count occurrences that contains words and do not contain other words

Question

I'm trying to get a count of the number of entries that contain some words but also must not contain other words. To be clear, I want to get an idea of the number of occurrences assuming an eliminating condition is not met. Here's what I have:

 import pandas as pd
 import re

 data = pd.read_csv('rando-file')

 vague_series = pd.DataFrame([(data['text'].str.contains('bla1|bla2', 
                                      flags=re.IGNORECASE, regex = True))

            &

           (~data['text'].str.contains('blah3|bla4', 
                                flags=re.IGNORECASE, regex = True))])

 vague_count = vague_series.columns[0].sum()

 print(vague_count)

Any attempt to count or sum has failed in this instance with an invalid syntax error. removing the columns[0] bit resulted simply in a 0, 1 designation in place of true and false.

Pandas: count occurrences that contains words and do not contain other words

Answers (1)

Related Questions