About Anaconda Help Download Anaconda

main / packages / datasets 2.19.1

HuggingFace community-driven open-source library of datasets

Installers

  • linux-64 v2.19.1
  • linux-s390x v2.19.1
  • osx-arm64 v2.19.1
  • win-64 v2.19.1
  • linux-aarch64 v2.19.1
  • osx-64 v2.19.1
  • noarch v1.12.1
  • linux-ppc64le v2.12.0

conda install

To install this package run one of the following:
conda install main::datasets

Description

Datasets is a lightweight library providing two main features:

  • one-line dataloaders for many public datasets: one-liners to download and pre-process any of the number of datasets major public datasets (text datasets in 467 languages and dialects, image datasets, audio datasets, etc.) provided on the HuggingFace Datasets Hub. With a simple command like squaddataset = loaddataset("squad"), get any of these datasets ready to use in a dataloader for training/evaluating a ML model (Numpy/Pandas/PyTorch/TensorFlow/JAX),
  • efficient data pre-processing: simple, fast and reproducible data pre-processing for the above public datasets as well as your own local datasets in CSV/JSON/text/PNG/JPEG/etc. With simple commands like processed_dataset = dataset.map(process_example), efficiently prepare the dataset for inspection and ML model evaluation and training.

© 2025 Anaconda, Inc. All Rights Reserved. (v4.0.6) Legal | Privacy Policy