How to merge same consecutive entity types using Spacy

Question

this is sample example, which uses entity_ruler to create patterns. but I want to merge same consecutive entity types into one entity and token

import spacy
from spacy.pipeline import EntityRuler
from spacy.util import filter_spans

ent_list_sample = ["brain", "ischimia", "heart failufe", "parenchyma"]


print("Adding patterns to EntityRuler:
-----------")
patterns = []
for concept in ent_list_sample:
    doc = nlp.make_doc(concept)
    if len(doc) > 1:
        patterns.append({"label": "SCI", "pattern":[{"LOWER":term.text.lower()} for term in doc]})
    else:
        patterns.append({"label": "SCI", "pattern":doc.text.lower()})
ruler = EntityRuler(nlp)
ruler.add_patterns(patterns)
nlp.add_pipe(ruler)


doc = nlp("It has a brain and also might have brain parenchyma ")
print("Entities:")
print(doc.ents)

output: (brain, brain, parenchyma)
expected: (brain, brain parenchyma)

PS: how we can reach expected output without adding  extra pattern for "brain parenchyma"

How to merge same consecutive entity types using Spacy

Answers (1)

Related Questions