infinity vectors in Spark MLlib word2vec

Question

I have a question about running word2vec of Spark MLlib. I run it with vocabulary size ~2.4M and corpus size ~1.4B. What is the reason to get +-infinity vectors for some words? It happens when I increase the number of iterations, namely, with 10 iteration I get a reasonable model, and with 20 iteration I get some vectors of the form [Infinity,-Infinity,Infinity,-Infinity,...]. Thanks in advance.

infinity vectors in Spark MLlib word2vec

Answers (1)

Related Questions