tabula-py
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
To install this package, run one of the following:
tabula-py is a simple Python wrapper of tabula-java, which can read table of PDF. You can read tables from PDF and convert into pandas's DataFrame. tabula-py also enables you to convert a PDF file into CSV/TSV/JSON file.
Summary
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Information Last Updated
Nov 3, 2025 at 16:12
License
MIT
Total Downloads
334.2K
Platforms
GitHub Repository
https://github.com/chezou/tabula-pyDocumentation
https://tabula.technology/