forked from NetBSD/pkgsrc
-
Notifications
You must be signed in to change notification settings - Fork 51
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
This will be the last release of chardet to support Python 2.7. chardet 5.0 will only support 3.6+ Major Changes This release is multiple years in the making, and provides some quality of life improvements to chardet. The primary user-facing changes are: Single-byte charset probers now use nested dictionaries under the hood, so they are usually a little faster than before. (See #121 for details) The CharsetGroupProber class now properly short-circuits when one of the probers in the group is considered a definite match. This lead to a substantial speedup. There is now a chardet.detect_all function that returns a list of possible encodings for the input with associated confidences. We have dropped support for Python 2.6, 3.4, and 3.5 as they are all past end-of-life. The changes in this release have also laid the groundwork for retraining the models to make them more accurate, and to support some more encodings/languages (see #99 for progress). This is our main focus for chardet 5.0 (beyond dropping Python 2 support).
- Loading branch information
Showing
3 changed files
with
17 additions
and
11 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,6 @@ | ||
$NetBSD: distinfo,v 1.9 2017/06/08 19:06:51 adam Exp $ | ||
$NetBSD: distinfo,v 1.10 2021/01/04 11:53:14 wiz Exp $ | ||
|
||
SHA1 (chardet-3.0.4.tar.gz) = 4766fb07e700945a7085d073257f1f320d037ce8 | ||
RMD160 (chardet-3.0.4.tar.gz) = 03913482c682bf5e2b872d7f0a25d44fc1df9a47 | ||
SHA512 (chardet-3.0.4.tar.gz) = 61a03b23447a2bfe52ceed4dd1b9afdb5784da1933a623776883ee9f297e341f633e27f0ce0230bd5fdc5fdb5382105ab42736a74a417ddeb9f83af57455dba5 | ||
Size (chardet-3.0.4.tar.gz) = 1868453 bytes | ||
SHA1 (chardet-4.0.0.tar.gz) = 2384f6cfba4685d901262e073a4455d4cf76d102 | ||
RMD160 (chardet-4.0.0.tar.gz) = 55d244d032ebdf38c1da022155699732c9d8e513 | ||
SHA512 (chardet-4.0.0.tar.gz) = ebd7f420e1094445270db993f6373ffe7370419e002b0bb13299dc6c9b0f7c4e77b0f44f871fba6371e6869e7c86728514367db377e3137487a3acf50cb81e96 | ||
Size (chardet-4.0.0.tar.gz) = 1907771 bytes |