Reputation: 5363
Does anybody know an open-source\free library that does term clustering?
Thanks, yaniv
Upvotes: 3
Views: 467
Reputation: 381
Python Scikit learn has some dedicated packages for text analysis. Besides they have a complete suite of Clustering Algorithms that includes K-means, AP, Mean shift, Spectral Clustering, Hierarchical Clustering and DBSCAN algorithms (with appropriate evaluation metrics). This may be helpful your term clustering task.
Link to Scikit Learn latest video tutorial
Link to Scikit Learn Book
Upvotes: 0
Reputation: 3861
If your in to python there is NLTK, as already mentioned by it's author, but there is also sklearn which provides much more than just clustering. (Link takes you to text applicable examples).
Upvotes: 0
Reputation: 519
WEKA has a whole suite of tools for text processing along with clustering.
Upvotes: 1
Reputation: 4182
Checkout NLTK. There's a number of clustering modules that might work for you.
Upvotes: 1