Finding whether or not, a word is on the dependency path of two entities with spaCy

Question

I'm working on a nlp problem, given a sentence with two entities I need to generate boolean indicating for each word if it stands on the dependency path between those entities.

For example:

'A misty < e1 >ridge< /e1 > uprises from the < e2 >surge< /e2 >'

I want to iterate on each words and tell if it is on the dependency path between e1 and e2

Two important notes:

-If you try to help me (first thanks), don't bother considering the xml markup with < e1 > and < e2 >, I really am interested in how to find if a word is on the dependency path between any two given words with spaCy, I take care of which words by myself

-As I'm not a nlp expert, I'm kind of confused with the meaning of "on the dependency path" and I'm sorry if it is not clear enough (these are the words used by my tutor)

Thanks in advance

Valentin Mac&#233; · Accepted Answer

So my solution was found using that post

There is an answer dedicated to spaCy

My implementation for finding the dependency path between two words in a given sentence:

import networkx as nx
import spacy
enter code here
doc = nlp("Ships carrying equipment for US troops are already waiting off the Turkish coast")
    
def shortest_dependency_path(doc, e1=None, e2=None):
    edges = []
    for token in doc:
        for child in token.children:
            edges.append(('{0}'.format(token),
                          '{0}'.format(child)))
    graph = nx.Graph(edges)
    try:
        shortest_path = nx.shortest_path(graph, source=e1, target=e2)
    except nx.NetworkXNoPath:
        shortest_path = []
    return shortest_path

print(shortest_dependency_path(doc,'Ships','troops'))

Output:

['Ships', 'carrying', 'for', 'troops']

What it actually does is to first build a non-oriented graph for the sentence where words are the nodes and dependencies between words are the edges and then find the shortest path between two nodes

For my needs, I just then check for each word if it's on the dependency path (shortest path) generated

Finding whether or not, a word is on the dependency path of two entities with spaCy

Answers (2)

Related Questions