spark-nlp
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML
To install this package, run one of the following:
Spark NLP is a state-of-the-art Natural Language Processing library built on top of Apache Spark. It provides simple, performant & accurate NLP annotations for machine learning pipelines that scale easily in a distributed environment. Spark NLP comes with 21000+ pretrained pipelines and models in more than 200+ languages. It also offers tasks such as Tokenization, Word Segmentation, Part-of-Speech Tagging, Word and Sentence Embeddings, Named Entity Recognition, Dependency Parsing, Spell Checking, Text Classification, Sentiment Analysis, Token Classification, Machine Translation (+180 languages), Summarization, Question Answering, Table Question Answering, Text Generation, Image Classification, Image to Text (captioning), Automatic Speech Recognition, Zero-Shot Learning, and many more NLP tasks.
Summary
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML
Information Last Updated
Apr 22, 2025 at 15:33
License
Apache-2.0
Total Downloads
401
Platforms
GitHub Repository
https://github.com/JohnSnowLabs/spark-nlpDocumentation
https://sparknlp.org/docs/en/quickstart