CMD + K

unpdf-rs

Community

High-performance PDF content extraction to Markdown, text, and JSON

Installation

To install this package, run one of the following:

Conda
$conda install conda-forge::unpdf-rs

Usage Tracking

0.4.5
0.4.3
0.4.1
0.2.4
4 / 8 versions selected
Downloads (Last 6 months): 0

Description

unpdf is a high-performance Rust library and CLI tool for extracting content from PDF documents to structured Markdown, plain text, and JSON. It supports PDF 1.0-2.0, including compressed object streams, table detection, image extraction, CJK text, and multiple text cleanup presets for LLM training data preparation.

About

Summary

High-performance PDF content extraction to Markdown, text, and JSON

Last Updated

Apr 15, 2026 at 03:48

License

MIT

Supported Platforms

macOS-64
win-64
macOS-arm64
linux-64
linux-aarch64