Fast and memory-friendly tools for text vectorization, topic modeling (LDA, LSA), word embeddings (GloVe), similarities. This package provides a source-agnostic streaming API, which allows researchers to perform analysis of collections of documents which are larger than available RAM. All core functions are parallelized to benefit from multicore machines.
copied from cf-staging / r-text2vecLabel | Latest Version |
---|---|
main | 0.6.4 |
cf201901 | 0.5.1 |
cf202003 | 0.6 |
gcc7 | 0.5.1 |