Jerry George
Jerry George

Reputation: 335

Plot each cluster by a single point (Kmeans Clustering)

I have clustered different texts into 15 clusters.

The texts are in the form:

"Oreo Biscuit is good"
"Healthy Breakfast
"Cars are fast"
....

I converted the texts into Word2Vec of 100 dimensions. Now I have 15 clusters, which I want to plot.

Instead of plotting all the points, I want to plot 1 point for each cluster, so that there would be 15 points in the plot. How do I do that?

Ideas:

1) Use the cluster centre to plot each cluster.

Is there any other way  (Converting all the Word2vecs in a cluster into 
Doc2vec) or 

Can Mds (Multi Dimensional Scaling) be used to plot the . 
clusters?

Thank you

Upvotes: 0

Views: 294

Answers (1)

Abhishek Mishra
Abhishek Mishra

Reputation: 1994

You already have vector representation of each text. You also have clusters for these texts. You have following options that are very trivial:

  1. You just plotthe clusters centriods using some dimensionality reduction mechanism. (Pro: Simple, Cons: Doesn't have information about the goodness of each individual cluster)
  2. You still plot the cluster centroids but these time variance can be added as the third dimension using some bubble plots as shown here. (Pro: Include both mean and variance, Cons: K-mean is too simple)
  3. We can apply some spectral clustering approaches and then apply the above methods on top of that.

Upvotes: 1

Related Questions