-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
word2vec accuracy #10
Comments
Collection of experiment results
We got low accuracy in several tests(currency, city-in-state, capital-world, capital-common-countries) |
What machines are you using to train? My laptop isn't really cutting it. Like 0.5%/hr ... |
i7 4800 MQ or ec2 c4.4xlarge |
But corpus was generated without any splitting of comments, so each comment's body was fully on one line.
|
looks like it's got a higher semantic accuracy, not sure if it's due to a larger word vector dimension. I'm training a much larger dataset right now (entire 2015 dataset), hopefully we will get a better semantic accuracy. |
1tb on ec2? isnt the storage alone pretty expensive? |
Not a significant improvement |
Collection of experiment results
We got low accuracy in several tests(currency, city-in-state, capital-world, capital-common-countries)
The text was updated successfully, but these errors were encountered: