https://github.com/hexydec/htmldoc
A token based HTML Document parser and minifier written in PHP. Extract attribute values and text using CSS selectors.
https://github.com/hexydec/htmldoc
html html-dom-parser html-parser html5 minification minify minify-html php simplehtmldom svg tokenize tokenizer
Last synced: 4 months ago
JSON representation
A token based HTML Document parser and minifier written in PHP. Extract attribute values and text using CSS selectors.
- Host: GitHub
- URL: https://github.com/hexydec/htmldoc
- Owner: hexydec
- License: mit
- Created: 2019-06-19T22:54:05.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2025-07-15T22:48:00.000Z (7 months ago)
- Last Synced: 2025-10-02T13:55:03.848Z (4 months ago)
- Topics: html, html-dom-parser, html-parser, html5, minification, minify, minify-html, php, simplehtmldom, svg, tokenize, tokenizer
- Language: PHP
- Homepage:
- Size: 533 KB
- Stars: 24
- Watchers: 3
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: docs/readme.md
- License: LICENSE
Awesome Lists containing this project
README
# HTMLdoc
HTMLdoc is a PHP based HTML compiler designed for extracting document contents and minification. It has been built as a replacement for simple_html_dom, with a minification process.
- [How it works](how-it-works.md)
- [How to use and examples](how-to-use.md)
- [API Reference](api/readme.md)
- [Mitigating Side Effects of Minification](mitigating-side-effects.md)
- [About Document Recycling](recycling.md)
- [Object Performance](performance.md)