How to detect negative sentences with sentimentr or qdap

Question

I am trying to extract (and eventually categorise) sentences from medical reports that contain negatives. An example is something like:

samples<-c('There is no evidence of a lump','Neither a contusion nor a scar was seen','No inflammation was evident','We found generalised badness here')

I am trying to use the sentimentr package as it seems it is able to detect negators. Is there a way of just using the detection of negators so that negative sentences are extracted out (preferably into a new dataframe for further work)?

Using polarity from qdap just gives a summary statistic and is based on including amplifiers and deamplifiers which I dont want to include eg.

polarity(samples,negators = qdapDictionaries::negation.words)

      all total.sentences total.words ave.polarity sd.polarity stan.mean.polarity
1 all               4          24        0.213       0.254              0.842

I tried the sentimentr package as follows:

extract_sentiment_terms(MyColonData$Endo_ResultText,polarity_dt = lexicon::hash_sentiment_jockers, hyphen = "")

and this gives me neutral, negative and positive words:

   element_id sentence_id     negative positive
1:          1           1                      
2:          2           1         scar         
3:          3           1 inflammation  evident
4:          4           1      badness    found

but I am really looking for sentences that contain negators only without interpretation of the sentiment so that the output is:

element_id sentence_id                          negative                    positive
1:          1           1     There is no evidence of a lump                 
2:          2           1     Neither a contusion nor a scar was seen       
3:          3           1     No inflammation was evident
4:          4           1                                               We found generalised badness here

amrrs · Accepted Answer

I think you want to classify the text positive and negative only based on the presence of negator hence extracting negator from lexicon should help.

samples<-c('There is no evidence of a lump','Neither a contusion nor a scar was seen','No inflammation was evident','We found generalised badness here')


polarity <- data.frame(text = samples, pol = NA)

polarity$pol <- ifelse(grepl(paste(lexicon::hash_valence_shifters[y==1]$x,collapse = '|'), tolower(samples)),'Negative','Positive')

polarity

                                     text      pol
1          There is no evidence of a lump Negative
2 Neither a contusion nor a scar was seen Negative
3             No inflammation was evident Negative
4       We found generalised badness here Positive

Formatted OP:

reshape2::dcast(polarity,text~pol) 



                                     text Negative Positive
1 Neither a contusion nor a scar was seen Negative     
2             No inflammation was evident Negative     
3          There is no evidence of a lump Negative     
4       We found generalised badness here      Positive

How to detect negative sentences with sentimentr or qdap

Answers (2)

Related Questions