Extracting sentence from a dataframe with description column based on a phrase

Question

I have a dataframe with a 'description' column with details about the product. Each of the description in the column has long paragraphs. Like

"This is a superb product. I so so loved this superb product that I wanna gift to all. This is like the quality and packaging. I like it very much"

How do I locate/extract the sentence which has the phrase "superb product", and place it in a new column?

So for this case the result will be expected output

I have used this,

searched_words=['superb product','SUPERB PRODUCT']


print(df['description'].apply(lambda text: [sent for sent in sent_tokenize(text)
                           if any(True for w in word_tokenize(sent) 
                                     if stemmer.stem(w.lower()) in searched_words)]))

The output for this is not suitable. Though it works if I put just one word in " Searched Word" List.

ChootsMagoots · Accepted Answer

Assuming the paragraphs are neatly formatted into sentences with ending periods, something like:

for index, paragraph in df['column_name'].iteritems(): for sentence in paragraph.split('.'): if 'superb prod' in sentence: print(sentence) df['extracted_sentence'][index] = sentence

This is going to be quite slow, but idk if there's a better way.

Extracting sentence from a dataframe with description column based on a phrase

Answers (2)

Related Questions