chore(deps): update dependency chardet to v5 #85
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
==3.0.4
->==5.2.0
Release Notes
chardet/chardet (chardet)
v5.2.0
: chardet 5.2.0Compare Source
Adds support for running chardet CLI via
python -m chardet
(0e9b7bc
, @dan-blanchard)v5.1.0
: chardet 5.1.0Compare Source
Features
should_rename_legacy
argument to most functions, which will rename older encodings to their more modern equivalents (e.g.,GB2312
becomesGB18030
) (#264, @dan-blanchard)c292b52
, Rob Speer and @dan-blanchard )--minimal
flag tochardetect
command (#214, @dan-blanchard)Fixes
Misc changes
v5.0.0
: chardet 5.0.0Compare Source
In addition to that change, it features the following user-facing changes:
SingleByteCharSetProber
confidence to match latest uchardet (#209)detect_all
return child prober confidences (#210)v4.0.0
: chardet 4.0.0Compare Source
Major Changes
This release is multiple years in the making, and provides some quality of life improvements to chardet. The primary user-facing changes are:
CharsetGroupProber
class now properly short-circuits when one of the probers in the group is considered a definite match. This lead to a substantial speedup.chardet.detect_all
function that returns a list of possible encodings for the input with associated confidences.The changes in this release have also laid the groundwork for retraining the models to make them more accurate, and to support some more encodings/languages (see #99 for progress). This is our main focus for chardet 5.0 (beyond dropping Python 2 support).
Benchmarks
Running on a MacBook Pro (15-inch, 2018) with 2.2GHz 6-core i7 processor and 32GB RAM
old version (chardet 3.0.4)
new version (chardet 4.0.0)
Thank you to @aaaxx, @edumco, @hrnciar, @hroncok, @jdufresne, @mdamien, @saintamh , @xeor for submitting pull requests, to all of our users for being patient with how long this release has taken.
Full changelog
d7c7343
) @dan-blanchardd702545
,8dccd00
,726973e
,71a0fad
) @dan-blanchard056a2a4
) @dan-blanchard1e208b7
) @dan-blanchardConfiguration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.