Vivek Santhosh
Vivek Santhosh

Reputation: 299

What does the score indicate in topic modelling

I used gimsm for LSA as per this tutorial https://www.datacamp.com/community/tutorials/discovering-hidden-topics-python

and I got the following output after running it for a list of text


[(1, '-0.708*"London" + 0.296*"like" + 0.294*"go" + 0.287*"dislike" + 0.268*"great" + 0.200*"romantic" + 0.174*"stress" + 0.099*"lovely" + 0.082*"good" + -0.075*"Tower" + 0.072*"see" + 0.063*"nice" + 0.061*"amazing" + -0.053*"Palace" + 0.053*"walk" + -0.050*"Eye" + 0.046*"eat" + -0.042*"Bridge" + 0.041*"Garden" + 0.040*"Covent" + -0.040*"old" + -0.039*"visit" + 0.039*"really" + 0.035*"spend" + 0.034*"watch" + 0.034*"get" + -0.032*"Buckingham" + 0.032*"Weather" + -0.032*"Museum" + -0.032*"Westminster"')]

What does -0.708 London indicate?

Upvotes: 0

Views: 36

Answers (1)

chefhose
chefhose

Reputation: 2694

Those are the words mostly contributing to your topic, both positively and negatively. One of the characteristics of your topic seems to be, that it does not have anything to do with London. You can see that other "London-related" words also contribute negatively to your topic: Westminster, Tower and Eye are also negative for this topic.

So if a text lacks the word London, it is highly plausible that the text is about this topic, according to your model.

Upvotes: 1

Related Questions