https://github.com/kovidgoyal/html5-parser
Fast C based HTML 5 parsing for python
https://github.com/kovidgoyal/html5-parser
Last synced: 2 months ago
JSON representation
Fast C based HTML 5 parsing for python
- Host: GitHub
- URL: https://github.com/kovidgoyal/html5-parser
- Owner: kovidgoyal
- License: apache-2.0
- Created: 2017-06-03T06:56:36.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2024-09-10T09:40:10.000Z (about 1 year ago)
- Last Synced: 2025-05-24T06:11:21.746Z (5 months ago)
- Language: C
- Size: 932 KB
- Stars: 688
- Watchers: 17
- Forks: 37
- Open Issues: 1
-
Metadata Files:
- Readme: README.rst
- License: LICENSE
Awesome Lists containing this project
- best-of-web-python - GitHub - 4% open · ⏱️ 26.09.2025): (HTML Processing)
README
html5-parser
================
|pypi| |build| |docs|
A *fast*, standards compliant, C based, HTML 5 parser for python. Over **thirty**
times as fast as pure python based parsers, such as html5lib.
See `documentation `_ for details.
.. |pypi| image:: https://img.shields.io/pypi/v/html5-parser.svg?label=version
:target: https://pypi.python.org/pypi/html5-parser
:alt: Latest version released on PyPi
.. |build| image:: https://github.com/kovidgoyal/html5-parser/workflows/CI/badge.svg
:target: https://github.com/kovidgoyal/html5-parser/actions?query=workflow%3ACI"
:alt: Build status of the master branch
.. |docs| image:: https://readthedocs.org/projects/html5-parser/badge/?version=latest
:target: https://html5-parser.readthedocs.io/en/latest/
:alt: Latest version of html5-parser documentation