Releases: rapidfuzz/RapidFuzz
Releases · rapidfuzz/RapidFuzz
Release 3.8.1
Fixed
- use the correct version of
rapidfuzz-cpp
when building against a system installed version
Release 3.8.0
Added
- added
process.cpdist
which allows pairwise comparison of two collection of inputs
Fixed
- fix some minor errors in the type hints
- fix potentially incorrect results of JaroWinkler when using high prefix weights
Release 3.7.0
Changed
- reduce import time
Release 3.6.2
Changed
- upgrade to
Cython==3.0.9
Fixed
- upgrade
rapidfuzz-cpp
which includes a fix for build issues on some compilers - fix some issues with the sphinx config
Release 3.6.1
Fixed
- fix overflow error on systems with
sizeof(size_t) < 8
Release 3.6.0
Fixed
- fix pure python fallback implementation of
fuzz.token_set_ratio
- properly link with
-latomic
ifstd::atomic<uint64_t>
is not natively supported
Performance
- add banded implementation of LCS / Indel. This improves the runtime from
O((|s1|/64) * |s2|)
toO((score_cutoff/64) * |s2|)
Changed
- upgrade to
Cython==3.0.7
- cdist for many metrics now returns a matrix of
uint32
instead ofint32
by default
Release 3.5.2
Fixed
- use _mm_malloc/_mm_free on macOS if aligned_alloc is unsupported
Release 3.5.1
Fixed
- fix compilation failure on macOS
Release 3.5.0
Changed
- skip pandas
pd.NA
similar toNone
- add
score_multiplier
argument toprocess.cdist
which allows multiplying the end result scores
with a constant factor. - drop support for Python 3.7
Performance
- improve performance of simd implementation for
LCS
/Indel
/Jaro
/JaroWinkler
- improve performance of Jaro and Jaro Winkler for long sequences
- implement
process.extract
withlimit=1
usingprocess.extractOne
which can be faster
Fixed
- the preprocessing function was always called through Python due to a broken C-API version check
- fix wraparound issue in simd implementation of Jaro and Jaro Winkler
Release 3.4.0
Changed
- upgrade to
Cython==3.0.3
- add simd implementation for Jaro and Jaro Winkler