How to Export Gensim Word2Vec Model with Ngram Weights for DL4J?

Question

I'm quite new to nlp. I'm trying to use a model trained with gensim in dl4j. I'm saving the model with

w2v_model.wv.save_word2vec_format("path/to/w2v_model.bin", binary=True)

and afterwards I'm loading it with

Word2Vec w2vModel = WordVectorSerializer.readWord2VecModel("path/to/w2v_model.bin");

The model works well except for the handling of out-of-vocabulary (OOV) words. In Gensim, it seems to calculate vectors for OOV words based on the word's n-grams, but in DL4J, it provides an empty vector for them.

My questions are:

Is there a way to export the n-gram weights along with the model from Gensim so that DL4J can use them?
If exporting the n-gram weights is not possible, is there a method to reconstruct them on the DL4J side to achieve similar results for OOV words as in Gensim?

Any guidance or suggestions would be greatly appreciated

How to Export Gensim Word2Vec Model with Ngram Weights for DL4J?

Answers (1)

Related Questions