user2609542
user2609542

Reputation: 831

How to remove stopwords using stanford nlp

I want to parse the document using stanford nlp and remove stopwords from that, so my question is how to remove stopwords using stanford nlp is there any api to remove that, I find StopWords class but I dont know how to use this, please suggest me how to get this?

Thanks

Upvotes: 9

Views: 8804

Answers (2)

gneusch
gneusch

Reputation: 125

If I'm correct the annotator mentioned by @Raju Penumatsa above is accessible on Maven here: https://mvnrepository.com/artifact/com.zensols/stopword-annotator And maintained in another git repo here: https://github.com/plandes/stopword-annotator

With the usage of the Maven repository you can easily use the annotator in your project as a dependency by importing it with a build tool such as Maven or Gradle etc. and you don't have to copy the lib into your classpath manually, so it is easier and more maintainable. The Git repo I linked moved the stopword plugin of the jconwell/coreNlp project into a separate repo and added some additional metadata in order to be able to publish it on Maven Central.

Upvotes: 1

Raju Penumatsa
Raju Penumatsa

Reputation: 413

I think you can use this annotator to remove stop words https://github.com/jconwell/coreNlp

Upvotes: 6

Related Questions