How do I preserve the order of the dependencies?

Question

I have the following code that opens files in a directory, runs spaCy NLP on them, and the outputs dependency parse info into a file in a new directory.

import spacy, os

nlp = spacy.load('en')

path1 = 'C:/Path/to/my/input'
path2 = '../output'
for file in os.listdir(path1):
    with open(file, encoding='utf-8') as text:
        txt = text.read()
        doc = nlp(txt)
        for sent in doc.sents:
            f = open(path2 + '/' + file, 'a+')
            for token in sent:
                f.write(file + '	' + str(token.dep_) + '	' + str(token.head) + '	' + str(token.right_edge) + '
')
    f.close()

The trouble is that this won't preserver the order of the dependencies in the output file. I can't seem to find any references to character positions in the API documentation.

How do I preserve the order of the dependencies?

Answers (1)

Related Questions