About Anaconda Help Download Anaconda

Convert natural language text into tokens. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, tweets, Penn Treebank, regular expressions, as well as functions for counting characters, words, and sentences, and a function for splitting longer texts into separate documents, each with the same number of words. The tokenizers have a consistent interface, and the package is built on the 'stringi' and 'Rcpp' packages for fast yet correct tokenization in 'UTF-8'.

copied from cf-post-staging / r-tokenizers
Type Size Name Uploaded Downloads Labels
conda 655.1 kB | win-64/r-tokenizers-0.2.1-r40ha856d6a_1002.tar.bz2  4 years and 4 months ago 804 main
conda 649.9 kB | win-64/r-tokenizers-0.2.1-r41ha856d6a_1002.tar.bz2  4 years and 4 months ago 825 main
conda 643.2 kB | osx-64/r-tokenizers-0.2.1-r40h9951f98_1002.tar.bz2  4 years and 4 months ago 147 main
conda 643.2 kB | osx-64/r-tokenizers-0.2.1-r41h9951f98_1002.tar.bz2  4 years and 4 months ago 183 main
conda 651.3 kB | linux-ppc64le/r-tokenizers-0.2.1-r40h6203a36_1002.tar.bz2  4 years and 4 months ago 48 main
conda 651.5 kB | linux-ppc64le/r-tokenizers-0.2.1-r41h6203a36_1002.tar.bz2  4 years and 4 months ago 48 main
conda 648.9 kB | linux-aarch64/r-tokenizers-0.2.1-r41hecdc70b_1002.tar.bz2  4 years and 4 months ago 81 main
conda 648.8 kB | linux-aarch64/r-tokenizers-0.2.1-r40hecdc70b_1002.tar.bz2  4 years and 4 months ago 78 main
conda 646.2 kB | linux-64/r-tokenizers-0.2.1-r40h03ef668_1002.tar.bz2  4 years and 4 months ago 3838 main
conda 646.5 kB | linux-64/r-tokenizers-0.2.1-r41h03ef668_1002.tar.bz2  4 years and 4 months ago 3792 main
conda 671.0 kB | linux-ppc64le/r-tokenizers-0.2.1-r40he54295a_1002.tar.bz2  5 years and 2 months ago 59 main
conda 666.2 kB | linux-aarch64/r-tokenizers-0.2.1-r40h0357c0b_1002.tar.bz2  5 years and 2 months ago 89 main
conda 666.4 kB | linux-aarch64/r-tokenizers-0.2.1-r36h0357c0b_1002.tar.bz2  5 years and 2 months ago 86 main
conda 671.3 kB | linux-ppc64le/r-tokenizers-0.2.1-r36he54295a_1002.tar.bz2  5 years and 2 months ago 57 main
conda 656.5 kB | win-64/r-tokenizers-0.2.1-r36h796a38f_1002.tar.bz2  5 years and 4 months ago 1122 main
conda 656.5 kB | win-64/r-tokenizers-0.2.1-r40h796a38f_1002.tar.bz2  5 years and 4 months ago 1047 main
conda 643.6 kB | osx-64/r-tokenizers-0.2.1-r36hc5da6b9_1002.tar.bz2  5 years and 4 months ago 375 main
conda 643.7 kB | osx-64/r-tokenizers-0.2.1-r40hc5da6b9_1002.tar.bz2  5 years and 4 months ago 338 main
conda 652.0 kB | linux-64/r-tokenizers-0.2.1-r36h0357c0b_1002.tar.bz2  5 years and 4 months ago 6489 main
conda 651.6 kB | linux-64/r-tokenizers-0.2.1-r40h0357c0b_1002.tar.bz2  5 years and 4 months ago 4112 main
conda 655.0 kB | win-64/r-tokenizers-0.2.1-r36h796a38f_1001.tar.bz2  6 years and 2 months ago 1413 main cf202003
conda 653.7 kB | win-64/r-tokenizers-0.2.1-r35h796a38f_1001.tar.bz2  6 years and 2 months ago 1385 main cf202003
conda 640.4 kB | osx-64/r-tokenizers-0.2.1-r36hf99fc2c_1001.tar.bz2  6 years and 2 months ago 389 main cf202003
conda 692.8 kB | osx-64/r-tokenizers-0.2.1-r35hf99fc2c_1001.tar.bz2  6 years and 2 months ago 359 main cf202003
conda 646.6 kB | linux-64/r-tokenizers-0.2.1-r35h0357c0b_1001.tar.bz2  6 years and 2 months ago 5369 main cf202003

© 2025 Anaconda, Inc. All Rights Reserved. (v4.2.2) Legal | Privacy Policy