Spark NLP is a state-of-the-art Natural Language Processing library built on top of Apache Spark. It provides simple, performant & accurate NLP annotations for machine learning pipelines that scale easily in a distributed environment. Spark NLP comes with 21000+ pretrained pipelines and models in more than 200+ languages. It also offers tasks such as Tokenization, Word Segmentation, Part-of-Speech Tagging, Word and Sentence Embeddings, Named Entity Recognition, Dependency Parsing, Spell Checking, Text Classification, Sentiment Analysis, Token Classification, Machine Translation (+180 languages), Summarization, Question Answering, Table Question Answering, Text Generation, Image Classification, Image to Text (captioning), Automatic Speech Recognition, Zero-Shot Learning, and many more NLP tasks.

Uploaded	Mon Mar 31 02:34:53 2025
md5 checksum	7e268ae97f1a6747ef8d141b4ab4beac
arch	x86_64
build	py310h06a4308_0
depends	jupyter, openjdk >=8, pyspark >=3.3.1, python >=3.10,<3.11.0a0
license	Apache-2.0
license_family	Apache
md5	7e268ae97f1a6747ef8d141b4ab4beac
name	spark-nlp
platform	linux
sha1	bed9d8d3b51015761b7bb46ffbc0eee122f53ff4
sha256	b10c2d1061b62cfe00d59a5b1feea5ac6c754955e12dc617efd175cd58c55dc7
size	394418
subdir	linux-64
timestamp	1696614534482
version	5.1.2

linux-64/spark-nlp-5.1.2-py310h06a4308_0.conda