About Anaconda Help Download Anaconda

Convert natural language text into tokens. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, tweets, Penn Treebank, regular expressions, as well as functions for counting characters, words, and sentences, and a function for splitting longer texts into separate documents, each with the same number of words. The tokenizers have a consistent interface, and the package is built on the 'stringi' and 'Rcpp' packages for fast yet correct tokenization in 'UTF-8'.

copied from cf-post-staging / r-tokenizers
Type Size Name Uploaded Downloads Labels
conda 616.6 kB | osx-arm64/r-tokenizers-0.3.0-r44hc1cd577_3.conda  17 days and 17 hours ago 27 main
conda 616.1 kB | osx-arm64/r-tokenizers-0.3.0-r45hc1cd577_3.conda  17 days and 17 hours ago 25 main
conda 618.9 kB | osx-64/r-tokenizers-0.3.0-r45ha730edb_3.conda  17 days and 17 hours ago 24 main
conda 618.6 kB | osx-64/r-tokenizers-0.3.0-r44ha730edb_3.conda  17 days and 17 hours ago 25 main
conda 630.1 kB | linux-aarch64/r-tokenizers-0.3.0-r45h08b74da_3.conda  17 days and 17 hours ago 25 main
conda 630.5 kB | linux-ppc64le/r-tokenizers-0.3.0-r45hb6fba7b_3.conda  17 days and 17 hours ago 16 main
conda 622.9 kB | win-64/r-tokenizers-0.3.0-r45hd8a2815_3.conda  17 days and 17 hours ago 29 main
conda 622.9 kB | win-64/r-tokenizers-0.3.0-r44hd8a2815_3.conda  17 days and 17 hours ago 30 main
conda 630.6 kB | linux-aarch64/r-tokenizers-0.3.0-r44h08b74da_3.conda  17 days and 17 hours ago 22 main
conda 627.2 kB | linux-64/r-tokenizers-0.3.0-r45h3697838_3.conda  17 days and 17 hours ago 94 main
conda 627.7 kB | linux-64/r-tokenizers-0.3.0-r44h3697838_3.conda  17 days and 17 hours ago 99 main
conda 630.6 kB | linux-ppc64le/r-tokenizers-0.3.0-r44hb6fba7b_3.conda  17 days and 17 hours ago 18 main
conda 621.7 kB | win-64/r-tokenizers-0.3.0-r43h8ae3a7c_2.conda  1 year and 2 months ago 345 main
conda 620.4 kB | win-64/r-tokenizers-0.3.0-r44h8ae3a7c_2.conda  1 year and 2 months ago 364 main
conda 615.9 kB | osx-64/r-tokenizers-0.3.0-r44h25d921d_2.conda  1 year and 2 months ago 287 main
conda 614.8 kB | osx-arm64/r-tokenizers-0.3.0-r44hd76f289_2.conda  1 year and 2 months ago 244 main
conda 615.8 kB | osx-arm64/r-tokenizers-0.3.0-r43hd76f289_2.conda  1 year and 2 months ago 255 main
conda 624.9 kB | linux-aarch64/r-tokenizers-0.3.0-r43h6170684_2.conda  1 year and 2 months ago 169 main
conda 626.3 kB | linux-ppc64le/r-tokenizers-0.3.0-r43h0cea4bf_2.conda  1 year and 2 months ago 140 main
conda 623.9 kB | linux-aarch64/r-tokenizers-0.3.0-r44h6170684_2.conda  1 year and 2 months ago 135 main
conda 616.9 kB | osx-64/r-tokenizers-0.3.0-r43h25d921d_2.conda  1 year and 2 months ago 304 main
conda 625.7 kB | linux-ppc64le/r-tokenizers-0.3.0-r44h0cea4bf_2.conda  1 year and 2 months ago 137 main
conda 624.8 kB | linux-64/r-tokenizers-0.3.0-r44h0d4f4ea_2.conda  1 year and 2 months ago 2190 main
conda 625.3 kB | linux-64/r-tokenizers-0.3.0-r43h0d4f4ea_2.conda  1 year and 2 months ago 2244 main
conda 616.1 kB | osx-arm64/r-tokenizers-0.3.0-r42h65f505e_1.conda  1 year and 8 months ago 219 main

© 2025 Anaconda, Inc. All Rights Reserved. (v4.2.2) Legal | Privacy Policy