unstructured-pdf
A library that prepares raw documents for downstream ML tasks.
A library that prepares raw documents for downstream ML tasks.
To install this package, run one of the following:
Unstructured provides a platform and tools to ingest and process unstructured documents for Retrieval Augmented Generation (RAG) and model fine-tuning.
Summary
A library that prepares raw documents for downstream ML tasks.
Information Last Updated
Mar 25, 2025 at 16:20
License
Apache-2.0
Total Downloads
3.1K
Platforms
GitHub Repository
https://github.com/Unstructured-IO/unstructuredDocumentation
https://docs.unstructured.io/welcome