- Added support for Portuguese and Spanish.
- Better error mesage when NLTK tokenizers are missing.
- Dropped support for Python 2.6 and 3.2. Only 2.7/3.3+ are officially supported now. Time to move :)
- CLI: Better message for unknown format.
- LexRank: fixed power method computation.
- Added some extra abbreviations (english, german) into tokenizer for better output.
- SumBasic: Added new summarization method - SumBasic. Thanks to Julian Griggs.
- KL: Added new summarization method - KL. Thanks to Julian Griggs.
- Added dependency requests to fix issues with downloading pages.
- Better documentation of expected Plaintext document format.
- Added possibility to specify format of input document for URL & stdin. Thanks to @Lucas-C.
- Added possibility to specify custom file with stop-words in CLI. Thanks to @Lucas-C.
- Added support for French language (added stopwords & stemmer). Thanks to @Lucas-C.
- Function
sumy.utils.get_stop_words
raisesLookupError
instead ofValueError
for unknown language. - Exception
LookupError
is raised for unknown language of stemmer instead of falling silently tonull_stemmer
.
- Fixed installation of my own readability fork. Added
breadability
to the dependencies instead of it #8. Thanks to @pratikpoddar.
- First public release.