Creating own named entity using NLTK on Python

Question

I am studying NLTK using a book named Natural Language Processing with Python Cookbook.

Here is the code but there was no explanation at all.

grammar = r"NAMED-ENTITY: {+}"
cp = nltk.RegexpParser(grammar)

samplestrings = [
    "Microsoft Azure is a cloud service",
    "Bill Gates announces Satya Nadella as new CEO of Microsoft"
]

def demo(samplestrings):
    for s in samplestrings:
        words = nltk.word_tokenize(s)
        tagged = nltk.pos_tag(words)
        # chunks = nltk.ne_chunk(tagged)
        chunks = cp.parse(tagged)
        print(nltk.tree2conllstr(chunks))
        print(chunks)

demo(samplestrings)

So I am stuck with the first line.

What does grammar = r"NAMED-ENTITY: {+}" this code do?

Does it mean that if there is more than one NNP (at least one NNP), then that tagged word is a named-entity?

Thanks for the answer

Creating own named entity using NLTK on Python

Answers (1)

Related Questions