Best Data Structure For Text Processing

Question

Given a sentence like this, and i have the data structure (dictionary of lists, UNIQUE keys in dictionary):

{'cat': ['feline', 'kitten'], 'brave': ['courageous', 'fearless'], 'little': ['mini']}

A courageous feline was recently spotted in the neighborhood protecting her mini kitten

How would I efficiently process these set of text to convert the word synonyms of the word cat to the word CAT such that the output is like this:

A fearless cat was recently spotted in the neighborhood protecting her little cat

The algorithm I want is something that can process the initial text to convert the synonyms into its ROOT word (key inside dictionary), the keywords and synonyms would get longer as well. Hence, first, I want to inquire if the data structure I am using is able to perform efficiently and whether there are more efficient structure. For now, I am only able to think of looping through each list inside the dictionary, searching for the synonym's then mapping it back to its keyword

edit: Refined the question

Best Data Structure For Text Processing

Answers (1)

Related Questions