-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
build_vocab fails when calling with different trim_rule for same corpus #1187
Comments
I believe you'd get that same error even without In general, triggering The error message is poorly worded, implying taking an extra step (sorting earlier) might fix the issue. Instead, it's the sort-attempt that's failing. So perhaps the message should be: "cannot sort vocabulary after model weights already initialized". |
@gojomo Yes. Thanks for specifying that. I was trying to solve another issue involving trim_rule when I encountered this issue. I'll update the error message and submit a PR. |
Fixed in #1190 |
Dear gojomo, I agree with you. I meet the same problem when I tried to use GridSearch to find the best parameters of Doc2Vec. Do you know how to clobber the existing vocabulary & model state efficiently? Thanks |
Now, if we try to build vocabulary for the same model with a different trim_rule
we get error that "must sort before initializing vectors/weights"
Isn't it a bug ? Vocabulary should get updated according to the new trim_rule provided.
The text was updated successfully, but these errors were encountered: