betul
betul

Reputation: 11

Stanford CoreNLP Dependency Parser Usage with Unsupported Languages

I am trying to train CoreNLP's NN based dependency parser in Turkish. I have found the command below in the documentation of the parser:

Train a parser with CoNLL treebank data: java    edu.stanford.nlp.parser.nndep.DependencyParser -trainFile trainPath
-devFile devPath -embedFile wordEmbeddingFile -embeddingSize wordEmbeddingDimensionality -model modelOutputFile.txt.gz

I couldn't exactly figure out what the modelOutputFile is. It is stated in the documentation that this file is written in the training phase. Is modelOutputFile a pregenerated file that I should create or just an empty file that will be written automatically in the training phase?

Any help will be appreciated, thank you!

Upvotes: 1

Views: 213

Answers (1)

StanfordNLPHelp
StanfordNLPHelp

Reputation: 8739

When the training process is done it should write the trained model to modelOutputFile.txt.gz You can then use that trained file to parse new text. Full documentation here: https://nlp.stanford.edu/software/nndep.shtml

Upvotes: 1

Related Questions