About Anaconda Help Download Anaconda

This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.

copied from cf-post-staging / r-udpipe
Type Size Name Uploaded Downloads Labels
conda 3.7 MB | win-64/r-udpipe-0.8.6-r41ha856d6a_0.tar.bz2  4 years and 29 days ago 763 main
conda 3.7 MB | win-64/r-udpipe-0.8.6-r40ha856d6a_0.tar.bz2  4 years and 29 days ago 761 main
conda 3.8 MB | osx-64/r-udpipe-0.8.6-r40h9951f98_0.tar.bz2  4 years and 29 days ago 91 main
conda 3.8 MB | osx-64/r-udpipe-0.8.6-r41h9951f98_0.tar.bz2  4 years and 29 days ago 92 main
conda 3.9 MB | linux-64/r-udpipe-0.8.6-r40h03ef668_0.tar.bz2  4 years and 29 days ago 2691 main
conda 3.9 MB | linux-64/r-udpipe-0.8.6-r41h03ef668_0.tar.bz2  4 years and 29 days ago 2780 main

© 2025 Anaconda, Inc. All Rights Reserved. (v4.2.2) Legal | Privacy Policy