About Anaconda Help Download Anaconda

Learn vector representations of sentences, paragraphs or documents by using the 'Paragraph Vector' algorithms, namely the distributed bag of words ('PV-DBOW') and the distributed memory ('PV-DM') model. The techniques in the package are detailed in the paper "Distributed Representations of Sentences and Documents" by Mikolov et al. (2014), available at <arXiv:1405.4053>. The package also provides an implementation to cluster documents based on these embedding using a technique called top2vec. Top2vec finds clusters in text documents by combining techniques to embed documents and words and density-based clustering. It does this by embedding documents in the semantic space as defined by the 'doc2vec' algorithm. Next it maps these document embeddings to a lower-dimensional space using the 'Uniform Manifold Approximation and Projection' (UMAP) clustering algorithm and finds dense areas in that space using a 'Hierarchical Density-Based Clustering' technique (HDBSCAN). These dense areas are the topic clusters which can be represented by the corresponding topic vector which is an aggregate of the document embeddings of the documents which are part of that topic cluster. In the same semantic space similar words can be found which are representative of the topic. More details can be found in the paper 'Top2Vec: Distributed Representations of Topics' by D. Angelov available at <arXiv:2008.09470>.

copied from cf-post-staging / r-doc2vec
Type Size Name Uploaded Downloads Labels
conda 4.9 MB | win-64/r-doc2vec-0.2.0-r44hd8a2815_4.conda  7 days and 33 minutes ago 20 main
conda 4.9 MB | win-64/r-doc2vec-0.2.0-r45hd8a2815_4.conda  7 days and 34 minutes ago 22 main
conda 4.9 MB | osx-64/r-doc2vec-0.2.0-r44ha730edb_4.conda  7 days and 35 minutes ago 19 main
conda 4.9 MB | osx-64/r-doc2vec-0.2.0-r45ha730edb_4.conda  7 days and 35 minutes ago 16 main
conda 4.9 MB | linux-64/r-doc2vec-0.2.0-r45h3697838_4.conda  7 days and 38 minutes ago 48 main
conda 4.9 MB | linux-64/r-doc2vec-0.2.0-r44h3697838_4.conda  7 days and 38 minutes ago 45 main
conda 5.0 MB | win-64/r-doc2vec-0.2.0-r43h8ae3a7c_3.conda  1 year and 2 months ago 263 main
conda 4.9 MB | win-64/r-doc2vec-0.2.0-r44h8ae3a7c_3.conda  1 year and 2 months ago 284 main
conda 4.9 MB | linux-64/r-doc2vec-0.2.0-r44h0d4f4ea_3.conda  1 year and 2 months ago 1339 main
conda 4.9 MB | osx-64/r-doc2vec-0.2.0-r44h25d921d_3.conda  1 year and 2 months ago 243 main
conda 5.0 MB | linux-64/r-doc2vec-0.2.0-r43h0d4f4ea_3.conda  1 year and 2 months ago 1303 main
conda 5.0 MB | osx-64/r-doc2vec-0.2.0-r43h25d921d_3.conda  1 year and 2 months ago 246 main
conda 5.0 MB | win-64/r-doc2vec-0.2.0-r41ha856d6a_2.conda  2 years and 3 months ago 483 main
conda 5.0 MB | osx-64/r-doc2vec-0.2.0-r43hac7d2d5_2.conda  2 years and 3 months ago 227 main
conda 5.0 MB | osx-64/r-doc2vec-0.2.0-r42hac7d2d5_2.conda  2 years and 3 months ago 227 main
conda 5.0 MB | linux-64/r-doc2vec-0.2.0-r42ha503ecb_2.conda  2 years and 3 months ago 1913 main
conda 5.0 MB | linux-64/r-doc2vec-0.2.0-r43ha503ecb_2.conda  2 years and 3 months ago 1973 main
conda 5.0 MB | win-64/r-doc2vec-0.2.0-r41ha856d6a_1.tar.bz2  2 years and 11 months ago 559 main
conda 5.0 MB | osx-64/r-doc2vec-0.2.0-r42h49197e3_1.tar.bz2  2 years and 11 months ago 75 main
conda 5.0 MB | osx-64/r-doc2vec-0.2.0-r41h49197e3_1.tar.bz2  2 years and 11 months ago 79 main
conda 5.0 MB | linux-64/r-doc2vec-0.2.0-r42h7525677_1.tar.bz2  2 years and 11 months ago 2187 main
conda 5.0 MB | linux-64/r-doc2vec-0.2.0-r41h7525677_1.tar.bz2  2 years and 11 months ago 2213 main
conda 5.0 MB | win-64/r-doc2vec-0.2.0-r41ha856d6a_0.tar.bz2  4 years and 29 days ago 764 main
conda 5.0 MB | win-64/r-doc2vec-0.2.0-r40ha856d6a_0.tar.bz2  4 years and 29 days ago 755 main
conda 5.0 MB | osx-64/r-doc2vec-0.2.0-r41h9951f98_0.tar.bz2  4 years and 29 days ago 100 main

© 2025 Anaconda, Inc. All Rights Reserved. (v4.2.2) Legal | Privacy Policy