Releases · pemistahl/lingua-py · GitHub

27 Dec 20:56

Lingua 1.2.1

Bug Fixes

Under certain circumstances, calling the method LanguageDetector.detect_multiple_languages_of() raised an IndexError. This has been fixed. Thanks to @Saninsusanin for reporting this bug. (#98)

Contributors

Saninsusanin

Assets 4

19 Dec 22:57

Lingua 1.2.0

Features

The new method LanguageDetector.detect_multiple_languages_of() has been introduced. It allows to detect multiple languages in mixed-language text. (#4)
The new method LanguageDetector.compute_language_confidence() has been introduced. It allows to retrieve the confidence value for one specific language only, given the input text. (#86)

Improvements

The computation of the confidence values has been revised and the min-max normalization algorithm is now applied to the values, making them better comparable by behaving more like real probabilities. (#78)

Miscellaneous

The library now has a fresh and colorful new logo. Why? Well, why not? (-:

Assets 4

29 Sep 20:13

Lingua 1.1.3

Improvements

An __all__ variable has been added indicating which types are exported by the library. This helps with type checking programs using Lingua. Big thanks to @bscan for the pull request. (#76)
The rule-based language filter has been improved for German texts. (#71)
A further bottleneck in the code has been removed, making language detection 30 % faster compared to version 1.1.2, approximately.

Contributors

bscan

Assets 4

06 Sep 20:30

Lingua 1.1.2

Improvements

The language models are now stored on disk as serialized NumPy arrays instead of JSON. This reduces the preloading time of the language models significantly.
A bottleneck in the language detection code has been removed, making language detection 40 % faster, approximately.

Bug Fixes

The py.typed file that actives static type checking was missing. Big thanks to @Vasniktel for reporting this problem. (#63)
The ISO 639-3 code for Urdu was wrong. Big thanks to @pluiez for reporting this bug. (#64)

Contributors

pluiez and Vasniktel

Assets 4

26 Aug 08:35

Lingua 1.1.1

Bug Fixes

For certain ngrams, wrong probabilities were returned. This has been fixed. Big thanks to @3a77 for reporting this bug. (#62)

Contributors

3a77

Assets 4

22 Aug 18:08

Lingua 1.1.0

Features

The new method LanguageDetectorBuilder.with_low_accuracy_mode() has been introduced. By activating it, detection accuracy for short text is reduced in favor of a smaller memory footprint and faster detection performance.

Improvements

The memory footprint has been reduced significantly by storing the language models in structured NumPy arrays instead of dictionaries. This reduces memory consumption from 2600 MB to 800 MB, approximately.
Several language model files have become obsolete and could be deleted without decreasing detection accuracy. This results in a smaller memory footprint.

Compatibility

The lowest supported Python version is 3.8 now. Python 3.7 is no longer compatible with this library.

Assets 4

24 Jan 15:56

Lingua 1.0.1

Compatibility

This patch release makes the library compatible with Python >= 3.7.1. Previously, it could be installed from PyPI only with Python >= 3.9. Since updates of the Python interpreter obviously take a pretty long time in certain environments, I hope that this compatibility update will make more people use Lingua successfully. Thanks to @Jourdelune for making me aware of this issue.

Contributors

Jourdelune

Assets 4

10 Jan 20:41

Lingua 1.0.0

The very first release of Lingua. Enjoy! :)

Assets 4