Hossein
Hossein

Reputation: 41831

Stop-word elimination and stemmer in python

I have a somewhat large document and want to do stop-word elimination and stemming on the words of this document with Python. Does anyone know an of the shelf package for these? If not a code which is fast enough for large documents is also welcome. Thanks

Upvotes: 4

Views: 2310

Answers (2)

Miki Tebeka
Miki Tebeka

Reputation: 13850

If for some reason you don't want to use NLTK, you can try PyStemmer. For stop words just download a list (google it) and filter them out.

Upvotes: 4

Ken Bloom
Ken Bloom

Reputation: 58770

NLTK supports this.

Upvotes: 8

Related Questions