Identify missing words between two sentences using similarity algorithms python

Question

I am new when it comes to NLP. Overall what I am trying to do is: given two sentences A and B, I want to figure out which words from B are completely semantically different from words in A. Essentially I need to calculate the similarity between two sentences and figure out which words (from B) have low similarity and print them. I computed the cosine similarity and it doesn't give much info about the matrix similarity.

lets say A="Lung cancer is a malignant lung tumour " and B = "Lung cancer is a lung disease",

since disease and tumour are semantically similar, the word(s) with small similarity score in A would be 'malignant' as it doesn't match with any word in B

How can I do that? Maybe I am looking at this completely wrong. But I need to find the words in A that are not in B and take into consideration semantically similar words.

Identify missing words between two sentences using similarity algorithms python

Answers (1)

Related Questions