About Anaconda Help Download Anaconda

Convert natural language text into tokens. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, tweets, Penn Treebank, regular expressions, as well as functions for counting characters, words, and sentences, and a function for splitting longer texts into separate documents, each with the same number of words. The tokenizers have a consistent interface, and the package is built on the 'stringi' and 'Rcpp' packages for fast yet correct tokenization in 'UTF-8'.

copied from cf-staging / r-tokenizers
Type Size Name Uploaded Downloads Labels
conda 621.7 kB | win-64/r-tokenizers-0.3.0-r43h8ae3a7c_2.conda  1 year and 1 month ago 338 main
conda 620.4 kB | win-64/r-tokenizers-0.3.0-r44h8ae3a7c_2.conda  1 year and 1 month ago 359 main
conda 615.9 kB | osx-64/r-tokenizers-0.3.0-r44h25d921d_2.conda  1 year and 1 month ago 281 main
conda 614.8 kB | osx-arm64/r-tokenizers-0.3.0-r44hd76f289_2.conda  1 year and 1 month ago 234 main
conda 615.8 kB | osx-arm64/r-tokenizers-0.3.0-r43hd76f289_2.conda  1 year and 1 month ago 247 main
conda 624.9 kB | linux-aarch64/r-tokenizers-0.3.0-r43h6170684_2.conda  1 year and 1 month ago 165 main
conda 626.3 kB | linux-ppc64le/r-tokenizers-0.3.0-r43h0cea4bf_2.conda  1 year and 1 month ago 138 main
conda 623.9 kB | linux-aarch64/r-tokenizers-0.3.0-r44h6170684_2.conda  1 year and 1 month ago 131 main
conda 616.9 kB | osx-64/r-tokenizers-0.3.0-r43h25d921d_2.conda  1 year and 1 month ago 299 main
conda 625.7 kB | linux-ppc64le/r-tokenizers-0.3.0-r44h0cea4bf_2.conda  1 year and 1 month ago 136 main
conda 624.8 kB | linux-64/r-tokenizers-0.3.0-r44h0d4f4ea_2.conda  1 year and 1 month ago 2069 main
conda 625.3 kB | linux-64/r-tokenizers-0.3.0-r43h0d4f4ea_2.conda  1 year and 1 month ago 2147 main
conda 616.1 kB | osx-arm64/r-tokenizers-0.3.0-r42h65f505e_1.conda  1 year and 7 months ago 215 main
conda 615.5 kB | osx-arm64/r-tokenizers-0.3.0-r43h65f505e_1.conda  1 year and 7 months ago 222 main
conda 616.5 kB | osx-64/r-tokenizers-0.3.0-r43h29979af_1.conda  1 year and 7 months ago 263 main
conda 616.8 kB | osx-64/r-tokenizers-0.3.0-r42h29979af_1.conda  1 year and 7 months ago 286 main
conda 620.6 kB | win-64/r-tokenizers-0.3.0-r41ha856d6a_1.conda  2 years and 2 months ago 576 main
conda 617.1 kB | osx-64/r-tokenizers-0.3.0-r42hac7d2d5_1.conda  2 years and 2 months ago 281 main
conda 617.0 kB | osx-64/r-tokenizers-0.3.0-r43hac7d2d5_1.conda  2 years and 2 months ago 301 main
conda 626.0 kB | linux-ppc64le/r-tokenizers-0.3.0-r42hd301276_1.conda  2 years and 2 months ago 148 main
conda 625.8 kB | linux-ppc64le/r-tokenizers-0.3.0-r43hd301276_1.conda  2 years and 2 months ago 158 main
conda 624.3 kB | linux-aarch64/r-tokenizers-0.3.0-r43ha6e910b_1.conda  2 years and 2 months ago 165 main
conda 625.5 kB | linux-64/r-tokenizers-0.3.0-r43ha503ecb_1.conda  2 years and 2 months ago 2934 main
conda 625.0 kB | linux-64/r-tokenizers-0.3.0-r42ha503ecb_1.conda  2 years and 2 months ago 2748 main
conda 625.1 kB | linux-aarch64/r-tokenizers-0.3.0-r42ha6e910b_1.conda  2 years and 2 months ago 191 main

© 2025 Anaconda, Inc. All Rights Reserved. (v4.2.0) Legal | Privacy Policy