CMD + K

datasets

Community

HuggingFace/Datasets is an open library of NLP datasets.

Installation

To install this package, run one of the following:

Conda
$conda install conda-forge::datasets

Usage Tracking

4.5.0
4.4.2
4.4.1
4.4.0
4.3.0
5 / 8 versions selected
Downloads (Last 6 months): 0

Description

Datasets is a lightweight library providing one-line dataloaders for many public datasets and one liners to download and pre-process any of the number of datasets major public datasets provided on the HuggingFace Datasets Hub. Datasets are ready to use in a dataloader for training/evaluating a ML model (Numpy/Pandas/PyTorch/TensorFlow/JAX). Datasets also provide an API for simple, fast, and reproducible data pre-processing for the above public datasets as well as your own local datasets in CSV/JSON/text.

About

Summary

HuggingFace/Datasets is an open library of NLP datasets.

Last Updated

Jan 22, 2026 at 23:43

License

Apache-2.0

Total Downloads

2.6M

Version Downloads

7.0K

Supported Platforms

noarch