About Anaconda Help Download Anaconda

High-performance PDF content extraction to Markdown, text, and JSON

copied from cf-post-staging / unpdf-rs

Installers

  • linux-64 v0.4.5
  • win-64 v0.4.5
  • osx-64 v0.4.5
  • osx-arm64 v0.4.5
  • linux-aarch64 v0.4.5

conda install

To install this package run one of the following:
conda install conda-forge::unpdf-rs

Description

unpdf is a high-performance Rust library and CLI tool for extracting content from PDF documents to structured Markdown, plain text, and JSON. It supports PDF 1.0-2.0, including compressed object streams, table detection, image extraction, CJK text, and multiple text cleanup presets for LLM training data preparation.


© 2026 Anaconda, Inc. All Rights Reserved. (v4.2.17) Legal | Privacy Policy