Reputation: 87
I have textual data that I want to discover topics it has, I used trained doc2vec on large corpus such as Wikipedia, but there is inconsistency in the results. Is there a better approach to discover topics.
Upvotes: 0
Views: 254
Reputation: 993
You could try using Hierarchical Dirichlet Process as implemented in the gensim
class gensim.models.hdpmodel.HdpModel
. From the gensim docs:
Unlike its finite counterpart, latent Dirichlet allocation, the HDP topic model infers the number of topics from the data.
Upvotes: 1