About Anaconda Help Download Anaconda

conda-forge / packages / r-ahocorasicktrie

Aho-Corasick is an optimal algorithm for finding many keywords in a text. It can locate all matches in a text in O(N+M) time; i.e., the time needed scales linearly with the number of keywords (N) and the size of the text (M). Compare this to the naive approach which takes O(N*M) time to loop through each pattern and scan for it in the text. This implementation builds the trie (the generic name of the data structure) and runs the search in a single function call. If you want to search multiple texts with the same trie, the function will take a list or vector of texts and return a list of matches to each text. By default, all 128 ASCII characters are allowed in both the keywords and the text. A more efficient trie is possible if the alphabet size can be reduced. For example, DNA sequences use at most 19 distinct characters and usually only 4; protein sequences use at most 26 distinct characters and usually only 20. UTF-8 (Unicode) matching is not currently supported.

copied from cf-staging / r-ahocorasicktrie
Type Size Name Uploaded Downloads Labels
conda 82.2 kB | win-64/r-ahocorasicktrie-0.1.3-r44h8ae3a7c_0.conda  26 days and 7 hours ago 902 main
conda 78.9 kB | osx-64/r-ahocorasicktrie-0.1.3-r44h2711daa_0.conda  26 days and 7 hours ago 860 main
conda 82.0 kB | win-64/r-ahocorasicktrie-0.1.3-r43h8ae3a7c_0.conda  26 days and 7 hours ago 943 main
conda 78.2 kB | osx-64/r-ahocorasicktrie-0.1.3-r43h2711daa_0.conda  26 days and 7 hours ago 873 main
conda 88.6 kB | linux-64/r-ahocorasicktrie-0.1.3-r44h93ab643_0.conda  26 days and 7 hours ago 1006 main
conda 87.7 kB | linux-64/r-ahocorasicktrie-0.1.3-r43h93ab643_0.conda  26 days and 7 hours ago 983 main

© 2025 Anaconda, Inc. All Rights Reserved. (v4.0.7) Legal | Privacy Policy