Reputation: 161
Difference between basic-dependencies,collapsed-dependencies and collapsed-ccprocessed-dependencies in Stanford CoreNLP and how to use them to understand query?
Upvotes: 4
Views: 1815
Reputation: 5749
A good way to see the difference by example is with the online demo (corenlp.run). basic, collapsed, and cc-processed are roughly the old ("Stanford Dependencies") equivalents to basic, enhanced, and enhanced++ in the newer ("Universal Dependencies") representation.
At a high level, the basic dependencies are meant to be easier to parse -- e.g., they're always a tree, the label set is small, etc. The enhanced[++] dependencies (like their predecessors, "collapsed" and "cc-processed") are deterministic transformations on the basic dependencies that are intended to make them a bit easier to work with, and a bit more semantic. For example, by labelling a preposition on the arc (prep:of
in Stanford Dependencies; nmod:of
in Universal Dependencies).
The full documentation of the differences (for Universal Dependencies) can be found in: Schuster and Manning (2016). "Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks". The original Stanford Dependencies are perhaps best documented in the Stanford Dependencies Manual.
Upvotes: 4