Reputation: 1585
Problem: Is there an option to stem the words using stanford-core-nlp
?
I am not able to find one! I am using the stanford-corenlp-3.5.2.jar.
Code:
public class StanfordNLPTester {
public static void main (String args[]){
String paragraph = "A long paragraph here";
Properties properties = new Properties();
properties.put("annotators","tokenize,ssplit,pos,lemma,ner,depparse");
StanfordCoreNLP pipeline = new StanfordCoreNLP(properties);
Annotation annotation = new Annotation (paragraph);
pipeline.annotate(annotation);
pipeline.prettyPrint(annotation,System.out);
}
}
Upvotes: 5
Views: 6799
Reputation: 8739
You'll need to get this from GitHub: https://github.com/stanfordnlp/CoreNLP
This class will provide what you want:
https://github.com/stanfordnlp/CoreNLP/blob/master/src/edu/stanford/nlp/process/Stemmer.java
The main() method of that class shows example usage of the stemmer.
You can continue to use the stanford-corenlp-3.5.2.jar and just include that one extra class, since everything that class depends on is in the jar.
Upvotes: 8