Skip to content Skip to sidebar Skip to footer
Showing posts with the label Word2vec

Gensim Word2vec Freeze Some Wordvectors And Update Others

Regarding word2vec with gensim, Suppose you already trained a model on a big corpus, and you want t… Read more Gensim Word2vec Freeze Some Wordvectors And Update Others

Why Gensim Doc2vec Give Attributeerror: 'list' Object Has No Attribute 'words'?

I am trying to experiment gensim doc2vec, by using following code. As far as I understand from tut… Read more Why Gensim Doc2vec Give Attributeerror: 'list' Object Has No Attribute 'words'?

Spacy: How To Load Google News Word2vec Vectors?

I've tried several methods of loading the google news word2vec vectors (https://code.google.com… Read more Spacy: How To Load Google News Word2vec Vectors?

How To Incrementally Train A Word2vec Model With New Vocabularies

I got a dataset over 40G. The program of my tokenizer is killed due to limited memory, so I try to … Read more How To Incrementally Train A Word2vec Model With New Vocabularies

Gensim Phrases Usage To Filter N-grams

I am using Gensim Phrases to identify important n-grams in my text as follows. bigram = Phrases(doc… Read more Gensim Phrases Usage To Filter N-grams

Loss Does Not Decrease During Training (word2vec, Gensim)

What can cause loss from model.get_latest_training_loss() increase on each epoch? Code, used for … Read more Loss Does Not Decrease During Training (word2vec, Gensim)

Word2vec - What Is Best? Add, Concatenate Or Average Word Vectors?

I am working on a recurrent language model. To learn word embeddings that can be used to initialize… Read more Word2vec - What Is Best? Add, Concatenate Or Average Word Vectors?

Gensim Word2vec Print Log Loss

how to print to log (file or stout) the loss of each epoch in the training phase, when using gensim… Read more Gensim Word2vec Print Log Loss