fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
copied from cf-staging / floretfloret is an extended version of fastText that can produce word representations for any word from a compact vector table. It combines: - fastText's subwords to provide embeddings for any word - Bloom embeddings ("hashing trick") for a compact vector table