polars-strsim
Polars extension for string similarity
Polars extension for string similarity
To install this package, run one of the following:
This package provides python bindings to compute various string similarity measures directly on a polars dataframe. All string similarity measures are implemented in rust and computed in parallel. The similarity measures that have been implemented are: Levenshtein Jaro Jaro-Winkler Jaccard Sørensen-Dice Each similarity measure returns a value normalized between 0.0 and 1.0 (inclusive), where 0.0 indicates the inputs are maximally different and 1.0 means the strings are maximally similar.
Summary
Polars extension for string similarity
Last Updated
Apr 29, 2025 at 20:15
License
MIT
Supported Platforms
GitHub Repository
https://github.com/foxcroftjn/polars-strsim