Jay Qadan
Jay Qadan

Reputation: 87

How to detect topics in arbitrary text file or data? not knowing number of topics beforehand

I have textual data that I want to discover topics it has, I used trained doc2vec on large corpus such as Wikipedia, but there is inconsistency in the results. Is there a better approach to discover topics.

Upvotes: 0

Views: 254

Answers (1)

WolfgangK
WolfgangK

Reputation: 993

You could try using Hierarchical Dirichlet Process as implemented in the gensim class gensim.models.hdpmodel.HdpModel. From the gensim docs:

Unlike its finite counterpart, latent Dirichlet allocation, the HDP topic model infers the number of topics from the data.

Upvotes: 1

Related Questions