html5lib
HTML parser based on the WHATWG HTML specification
HTML parser based on the WHATWG HTML specification
To install this package, run one of the following:
html5lib is a pure-python library for parsing HTML. It is designed to conform to the WHATWG HTML specification, as is implemented by all major web browsers.
Summary
HTML parser based on the WHATWG HTML specification
Information Last Updated
Apr 22, 2025 at 14:56
License
MIT
Total Downloads
5.8M
Platforms
GitHub Repository
https://github.com/html5lib/html5lib-pythonDocumentation
http://html5lib.readthedocs.org/