Ailef
Ailef

Reputation: 7906

Chunking NP, VP and PP phrases in Java (CoreNLP)

I'm using Stanford CoreNLP and I'm aware it doesn't support chunking of sentences. What I'm looking for is, given an input sentence, to have something like this as output:

 [NP He ] [VP reckons ] [NP the current account deficit ] [VP will narrow ] [PP to ] [NP only # 1.8 billion ] [PP in ] [NP September ] . 

I also know OpenNLP apparently supports this feature, but I already wrote quite a lot of code using CoreNLP and I would't like having to switch. So, what I'm looking for is either an external library that can do this for me or ideas about implementing this feature in the most simple way (references to publications, links, everything is welcome), starting from the parse tree. I don't need this to be as accurate as state of the art chunkers, at least for now, so I'm looking to implement this fast and maybe change it in the future if needed.

Upvotes: 6

Views: 1415

Answers (1)

Ailef
Ailef

Reputation: 7906

In the end I found the Illinois chunker that does just what I needed.

Upvotes: 6

Related Questions