CMD + K

r-stringdist

Anaconda Verified

Implements an approximate string matching version of R's native 'match' function. Also offers fuzzy text search based on various string distance measures. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well. Reference: MPJ van der Loo (2014) <doi:10.32614/RJ-2014-011>.

Installation

To install this package, run one of the following:

Conda
$conda install r::r-stringdist

Usage Tracking

0.9.10
0.9.8
0.9.5.1
0.9.4.7
0.9.4.6
5 / 8 versions selected
Downloads (Last 6 months): 0

About

Summary

Implements an approximate string matching version of R's native 'match' function. Also offers fuzzy text search based on various string distance measures. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well. Reference: MPJ van der Loo (2014) <doi:10.32614/RJ-2014-011>.

Last Updated

Jan 16, 2024 at 21:36

License

GPL-3

Total Downloads

18.8K

Supported Platforms

linux-64

Unsupported Platforms

win-32 Last supported version: 0.9.5.1
macOS-64 Last supported version: 0.9.5.1
linux-32 Last supported version: 0.9.4
win-64 Last supported version: 0.9.5.1