-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FastText inconsistent normalization in adjust_vectors method vs word_vec #2537
Comments
The |
I agree that the two implementations appear to be inconsistent. I think this is a bug. This is what I rewrote when I worked on fastText a few months ago:
This is the old version of the FastText code:
I think the change to make here is:
|
@vackosar can you open a PR? |
@piskvorky @mpenkov I will submit the PR but are you sure about this? |
By the way, I posted couple of FastText norm visualizations here, if you are interested. |
Here is the PR #2551 |
There's no inconsistency. In the Of the many oddnesses of the gensim FT implementation, this behavior (as noted by @vackosar in comment #2537 (comment)) is consistent with the FB reference implementation, so there's no bug, and no reason for the associated code-changing PR. |
Problem description
There is inconsistency in normalization in
FastTextKeyedVectors#adjust_vectors
vsWordEmbeddingsKeyedVectors#word_vec
Steps/code/corpus to reproduce
adjust_vectors:
word_vec method:
Versions
The text was updated successfully, but these errors were encountered: