-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reduce gensim distribution size #1783
Comments
It would be most convenient if we could build a minimal version of the distribution (including minimal scipy and numpy modules). I just managed to squeeze gensim onto a aws lambda using python3, but it was not easy :-) |
@JensMadsen I think we can reduce the size of gensim distribution from |
Yes of course. I plan to write a blog post somewhere soon :-) A not in details procedure for squeezing gensim into AWS lambda:
In that way I get a sufficiently small zip file Actually what matters the most is to reduce the size of scipy which to my understanding has grown significantly lately |
@JensMadsen thanks for the information! |
Can't wait to finally ditch scipy! |
@JensMadsen Hi! Sorry to chime in, but did you ever write a blog post on getting gensim on AWS lambda? Trying to do that now, and gensim is quite...large, when creating a deployment package. Thanks! |
@JustinMoser sorry, no updates. As "ad-hoc" solution, you can extract & drop test data ( |
@menshikh-iv Thank you! Pardon me if I'm being dim, but when I install gensim to my deployment directory (using pip install gensim --target .), with the dependencies, it is near the 300mb mark. |
@JustinMoser wow, that sounds impossible, for example, I made a clean installation on python2
gensim with all deps takes around 72M, where 300MB comes from? Can you check please, what exactly downloaded? if you talking about installed, so, in that case, numpy & scipy still top2 (more than 150MB)
unfortunatelly, I can't help with it |
@JustinMoser I dropped lambdas. too much hazzle. Doing a service in a kubernetes cluster instead :-) This is the content of my dockwer file from back then:
|
Right now, size of gensim
wheel
/tar.gz
is~16MB
, this is less than50MB+
, but still huge.Need to "cut" big files that used for tests and rewrite the affected tests
Previous issue #1698
The text was updated successfully, but these errors were encountered: