3.7.3

mpenkov released this 08 May 03:07

· 728 commits to develop since this release

69877c5

3.7.3, 2019-05-06

🔴 Bug fixes

Fix fasttext model loading from gzip files (mpenkov, #2476)
Clean up FastText Cython code, fix division by zero (mpenkov, #2382)
Update legacy model loading (mpenkov, #2454, #2457)
NMF bugfix (mpenkov, #2466)
Fix WordEmbeddingsKeyedVectors.most_similar (Witiko, #2461)
Fix LdaSequence model by updating to num_documents (Bharat123rox, #2410)
Make termsim matrix positive definite even with negative similarities (Witiko, #2397)
Fix the off-by-one bug in the TFIDF model. (AMR-KELEG, #2392)
Make matutils.unitvec always return float norm when requested (Witiko, #2419)
Fix misleading Doc2Vec.docvecs comment (gojomo, #2472)

📚 Tutorial and doc improvements

Update word2vec.ipynb (asyabo, #2423)

👍 Improvements

Adding type check for corpus_file argument (saraswatmks, #2469)

⚠️ Deprecations (will be removed in the next major release)

Remove
- gensim.models.FastText.load_fasttext_format: use load_facebook_vectors to load embeddings only (faster, less CPU/memory usage, does not support training continuation) and load_facebook_model to load full model (slower, more CPU/memory intensive, supports training continuation)
- gensim.models.wrappers.fasttext (obsoleted by the new native gensim.models.fasttext implementation)
- gensim.examples
- gensim.nosy
- gensim.scripts.word2vec_standalone
- gensim.scripts.make_wiki_lemma
- gensim.scripts.make_wiki_online
- gensim.scripts.make_wiki_online_lemma
- gensim.scripts.make_wiki_online_nodebug
- gensim.scripts.make_wiki (all of these obsoleted by the new native gensim.scripts.segment_wiki implementation)
- "deprecated" functions and attributes
Move
- gensim.scripts.make_wikicorpus ➡ gensim.scripts.make_wiki.py
- gensim.summarization ➡ gensim.models.summarization
- gensim.topic_coherence ➡ gensim.models._coherence
- gensim.utils ➡ gensim.utils.utils (old imports will continue to work)
- gensim.parsing.* ➡ gensim.utils.text_utils

Assets 2