Find specific string objects in text

Question

Lets say that I have a free text filled with information about specific cars, car brands and other automative-related information. I want to extract this information from the text following a certain template:

Brand:
Model:
Color:

For example: "Mike drove away in a black Mercedes with four other people. Moreover he also owns a BMW M3 in Europe."

Template 1: Brand: Mercedes, Model: -, Color: Black

Template 2: Brand: BMW, Model: M3, Color: -

What is the best way to tackle this in Python? Although I have some knowledge about NLTK, POS tagging and NP-chunking, I am thinking it could be done I an easier way once I can recognize specific terms, from for example a (nested) dictionary that contains lists. As such, it would behave like a controlled vocabulary.

Hopefully, someone has a nice example or can point me in the right direction. Thanks

Find specific string objects in text

Answers (1)

Related Questions