CMD + K

unpdf-rs

Community

High-performance PDF content extraction to Markdown, text, and JSON

Installation

To install this package, run one of the following:

Conda
$conda install conda-forge::unpdf-rs

Usage Tracking

0.6.3
0.6.2
0.6.1
0.4.6
0.4.5
5 / 8 versions selected
Downloads (Last 6 months): 0

Description

unpdf is a high-performance Rust library and CLI tool for extracting content from PDF documents to structured Markdown, plain text, and JSON. It supports PDF 1.0-2.0, including compressed object streams, table detection, image extraction, CJK text, and multiple text cleanup presets for LLM training data preparation.

About

Summary

High-performance PDF content extraction to Markdown, text, and JSON

Last Updated

May 12, 2026 at 04:33

License

MIT

Supported Platforms

macOS-64
win-64
macOS-arm64
linux-64
linux-aarch64