About Anaconda Help Download Anaconda

Convert natural language text into tokens. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, tweets, Penn Treebank, regular expressions, as well as functions for counting characters, words, and sentences, and a function for splitting longer texts into separate documents, each with the same number of words. The tokenizers have a consistent interface, and the package is built on the 'stringi' and 'Rcpp' packages for fast yet correct tokenization in 'UTF-8'.

copied from cf-post-staging / r-tokenizers
Type Size Name Uploaded Downloads Labels
conda 647.5 kB | osx-64/r-tokenizers-0.2.3-r42h49197e3_1.tar.bz2  3 years and 6 months ago 90 main
conda 657.0 kB | linux-64/r-tokenizers-0.2.3-r41h7525677_1.tar.bz2  3 years and 6 months ago 2881 main
conda 656.9 kB | linux-64/r-tokenizers-0.2.3-r42h7525677_1.tar.bz2  3 years and 6 months ago 3243 main
conda 657.2 kB | linux-aarch64/r-tokenizers-0.2.3-r42hf13958e_1.tar.bz2  3 years and 6 months ago 81 main
conda 657.5 kB | linux-aarch64/r-tokenizers-0.2.3-r41hf13958e_1.tar.bz2  3 years and 6 months ago 80 main
conda 658.5 kB | win-64/r-tokenizers-0.2.3-r41ha856d6a_0.tar.bz2  3 years and 6 months ago 587 main
conda 663.7 kB | win-64/r-tokenizers-0.2.3-r40ha856d6a_0.tar.bz2  3 years and 6 months ago 587 main
conda 657.0 kB | linux-ppc64le/r-tokenizers-0.2.3-r40ha35a809_0.tar.bz2  3 years and 6 months ago 45 main
conda 657.8 kB | linux-ppc64le/r-tokenizers-0.2.3-r41ha35a809_0.tar.bz2  3 years and 6 months ago 42 main
conda 647.8 kB | osx-64/r-tokenizers-0.2.3-r41h49197e3_0.tar.bz2  3 years and 6 months ago 96 main
conda 647.5 kB | osx-64/r-tokenizers-0.2.3-r40h49197e3_0.tar.bz2  3 years and 6 months ago 89 main
conda 657.5 kB | linux-aarch64/r-tokenizers-0.2.3-r41hf13958e_0.tar.bz2  3 years and 6 months ago 82 main
conda 656.2 kB | linux-aarch64/r-tokenizers-0.2.3-r40hf13958e_0.tar.bz2  3 years and 6 months ago 81 main
conda 657.0 kB | linux-64/r-tokenizers-0.2.3-r41h7525677_0.tar.bz2  3 years and 6 months ago 2701 main
conda 656.2 kB | linux-64/r-tokenizers-0.2.3-r40h7525677_0.tar.bz2  3 years and 6 months ago 3203 main
conda 655.1 kB | win-64/r-tokenizers-0.2.1-r40ha856d6a_1002.tar.bz2  4 years and 10 months ago 824 main
conda 649.9 kB | win-64/r-tokenizers-0.2.1-r41ha856d6a_1002.tar.bz2  4 years and 10 months ago 844 main
conda 643.2 kB | osx-64/r-tokenizers-0.2.1-r40h9951f98_1002.tar.bz2  4 years and 10 months ago 163 main
conda 643.2 kB | osx-64/r-tokenizers-0.2.1-r41h9951f98_1002.tar.bz2  4 years and 10 months ago 198 main
conda 651.3 kB | linux-ppc64le/r-tokenizers-0.2.1-r40h6203a36_1002.tar.bz2  4 years and 10 months ago 52 main
conda 651.5 kB | linux-ppc64le/r-tokenizers-0.2.1-r41h6203a36_1002.tar.bz2  4 years and 10 months ago 52 main
conda 648.9 kB | linux-aarch64/r-tokenizers-0.2.1-r41hecdc70b_1002.tar.bz2  4 years and 10 months ago 97 main
conda 648.8 kB | linux-aarch64/r-tokenizers-0.2.1-r40hecdc70b_1002.tar.bz2  4 years and 10 months ago 94 main
conda 646.2 kB | linux-64/r-tokenizers-0.2.1-r40h03ef668_1002.tar.bz2  4 years and 10 months ago 4213 main
conda 646.5 kB | linux-64/r-tokenizers-0.2.1-r41h03ef668_1002.tar.bz2  4 years and 10 months ago 4179 main

© 2026 Anaconda, Inc. All Rights Reserved. (v4.2.17) Legal | Privacy Policy