CMD + K

docarray

Community

The data structure for unstructured data

Installation

To install this package, run one of the following:

Conda
$conda install conda-forge::docarray

Usage Tracking

0.41.0
0.40.0
0.16.5
0.16.3
0.16.2
5 / 8 versions selected
Total downloads: 0

Description

DocArray is a library for nested, unstructured data such as text, image, audio, video, 3D mesh. It allows deep learning engineers to efficiently process, embed, search, recommend, store, transfer the data with Pythonic API.

🌌 All data types: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data.

🐍 Pythonic experience: designed to be as easy as Python list. If you know how to Python, you know how to DocArray. Intuitive idioms and type annotation simplify the code you write.

🧑‍🔬 Data science powerhouse: greatly accelerate data scientists work on embedding, matching, visualizing, evaluating via Torch/Tensorflow/ONNX/PaddlePaddle on CPU/GPU.

🚡 Portable: ready-to-wire at anytime with efficient and compact serialization from/to Protobuf, bytes, JSON, CSV, dataframe.

PyPI: https://pypi.org/project/docarray

About

Summary

The data structure for unstructured data

Information Last Updated

Apr 22, 2025 at 14:58

License

Apache-2.0

Total Downloads

241.7K

Platforms

noarch Version: 0.41.0