About Anaconda Help Download Anaconda

Convert natural language text into tokens. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, tweets, Penn Treebank, regular expressions, as well as functions for counting characters, words, and sentences, and a function for splitting longer texts into separate documents, each with the same number of words. The tokenizers have a consistent interface, and the package is built on the 'stringi' and 'Rcpp' packages for fast yet correct tokenization in 'UTF-8'.

copied from cf-post-staging / r-tokenizers
Type Size Name Uploaded Downloads Labels
conda 655.0 kB | win-64/r-tokenizers-0.2.1-r36h796a38f_1001.tar.bz2  6 years and 10 months ago 1436 main cf202003
conda 653.7 kB | win-64/r-tokenizers-0.2.1-r35h796a38f_1001.tar.bz2  6 years and 10 months ago 1408 main cf202003
conda 640.4 kB | osx-64/r-tokenizers-0.2.1-r36hf99fc2c_1001.tar.bz2  6 years and 10 months ago 407 main cf202003
conda 692.8 kB | osx-64/r-tokenizers-0.2.1-r35hf99fc2c_1001.tar.bz2  6 years and 10 months ago 377 main cf202003
conda 646.6 kB | linux-64/r-tokenizers-0.2.1-r35h0357c0b_1001.tar.bz2  6 years and 10 months ago 5842 main cf202003
conda 647.4 kB | linux-64/r-tokenizers-0.2.1-r36h0357c0b_1001.tar.bz2  6 years and 10 months ago 5067 main cf202003
conda 645.9 kB | linux-64/r-tokenizers-0.2.1-r351h29659fb_1000.tar.bz2  7 years and 6 months ago 5638 main gcc7 cf202003
conda 652.7 kB | win-64/r-tokenizers-0.2.1-r351h6115d3f_1000.tar.bz2  7 years and 6 months ago 1667 main cf202003 cf201901
conda 637.5 kB | osx-64/r-tokenizers-0.2.1-r351h466af19_1000.tar.bz2  7 years and 6 months ago 419 main gcc7 cf202003
conda 651.7 kB | win-64/r-tokenizers-0.2.1-r351h6115d3f_0.tar.bz2  7 years and 8 months ago 1756 main cf202003 cf201901
conda 634.3 kB | win-64/r-tokenizers-0.2.1-r341h6115d3f_0.tar.bz2  7 years and 8 months ago 1773 main cf202003 cf201901
conda 674.6 kB | osx-64/r-tokenizers-0.2.1-r351h9d2a408_0.tar.bz2  7 years and 8 months ago 382 main cf202003 cf201901
conda 657.4 kB | osx-64/r-tokenizers-0.2.1-r341h9d2a408_0.tar.bz2  7 years and 8 months ago 385 main cf202003 cf201901
conda 656.0 kB | linux-64/r-tokenizers-0.2.1-r341h9d2a408_0.tar.bz2  7 years and 8 months ago 5634 main cf202003 cf201901
conda 673.4 kB | linux-64/r-tokenizers-0.2.1-r351h9d2a408_0.tar.bz2  7 years and 8 months ago 5627 main cf202003 cf201901
conda 94.0 kB | win-64/r-tokenizers-0.1.4-r341hca4a3dc_1.tar.bz2  7 years and 9 months ago 1766 main cf202003 cf201901
conda 110.6 kB | osx-64/r-tokenizers-0.1.4-r341hfc679d8_1.tar.bz2  7 years and 9 months ago 366 main cf202003 cf201901
conda 121.9 kB | linux-64/r-tokenizers-0.1.4-r341hfc679d8_1.tar.bz2  7 years and 9 months ago 5734 main cf202003 cf201901
conda 110.7 kB | osx-64/r-tokenizers-0.1.4-r3.4.1_0.tar.bz2  8 years and 3 months ago 4513 main cf202003 cf201901
conda 94.3 kB | win-64/r-tokenizers-0.1.4-r3.4.1_0.tar.bz2  8 years and 9 months ago 6428 main cf202003 cf201901
conda 116.9 kB | linux-64/r-tokenizers-0.1.4-r3.4.1_0.tar.bz2  8 years and 9 months ago 10401 main cf202003 cf201901
conda 81.2 kB | win-64/r-tokenizers-0.1.4-r3.3.2_0.tar.bz2  9 years and 27 days ago 6031 main cf202003 cf201901
conda 93.3 kB | osx-64/r-tokenizers-0.1.4-r3.3.2_0.tar.bz2  9 years and 27 days ago 4703 main cf202003 cf201901
conda 107.0 kB | linux-64/r-tokenizers-0.1.4-r3.3.2_0.tar.bz2  9 years and 27 days ago 9925 main cf202003 cf201901

© 2026 Anaconda, Inc. All Rights Reserved. (v4.2.18) Legal | Privacy Policy