You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to apply pretrained word2vec weights to doc2vec, but it exits with segmentation fault if I called build_vocab() after reseting the weights. There isn't any traceback printed. Perhaps the runtime error is caused by Cython code.
This appears to be a duplicate of #1019, so further discussion/investigation/fixing of the crash should happen there.
Note that the update=True incremental vocab-expansion feature has so far only been implemented/tested with a focus on Word2Vec – so could be crashy-buggy if applied to Doc2Vec.
Similarly, reset_from() was intended to borrow properties from a same-type model, so might not work in this fashion (and could also be causing mismatches in state that trigger hard seg-faults in the cython code).
Separately, you may want to look into intersect_word2vec_format() as a different option for mixing pretrained vectors into a model with an existing vocabulary - see the method comment & prior discussion on the project discussion list for more on how it could be used.
I am trying to apply pretrained word2vec weights to doc2vec, but it exits with segmentation fault if I called
build_vocab()
after reseting the weights. There isn't any traceback printed. Perhaps the runtime error is caused by Cython code.I have also tried to annotated the line
doc_vectors.wv.syn0 = pretrained.syn0
, however, it does not help.The text was updated successfully, but these errors were encountered: