CMD + K

r-stringdist

Community

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well.

Installation

To install this package, run one of the following:

Conda
$conda install conda-forge::r-stringdist

Usage Tracking

0.9.15
0.9.14
0.9.12
0.9.10
0.9.9
5 / 8 versions selected
Downloads (Last 6 months): 0

About

Summary

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well.

Last Updated

Jan 10, 2025 at 17:34

License

GPL-3.0-only

Total Downloads

411.6K

Supported Platforms

macOS-arm64
win-64
macOS-64
linux-aarch64
linux-ppc64le
linux-64