getting distance matrix and features matrix from word2vec model

Question

I have generated a word2vec model using gensim for a huge corpus and I need to cluster the vocabularies using k means clustering for that i need:

cosine distance matrix (word to word, so the size of the matrix the number_of_words x number_of_words )
features matrix (word to features, so the size of the matrix is the number_of_words x number_of_features(200) )

for the feature matrix i tried to use x=model.wv and I got the object type as gensim.models.keyedvectors.KeyedVectors and its much smaller than what I expected a feature matrix will be

is there a way to use this object directly to generate the k-means clustering ?

getting distance matrix and features matrix from word2vec model

Answers (1)

Related Questions