https://github.com/oblac/jodd-lagarto-site
Jodd Lagarto site hosted on Gitbook
https://github.com/oblac/jodd-lagarto-site
html jodd jodd-lagarto parser
Last synced: 4 months ago
JSON representation
Jodd Lagarto site hosted on Gitbook
- Host: GitHub
- URL: https://github.com/oblac/jodd-lagarto-site
- Owner: oblac
- License: bsd-2-clause
- Created: 2020-07-27T16:22:21.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2022-05-30T15:06:27.000Z (about 4 years ago)
- Last Synced: 2025-02-26T17:16:23.560Z (over 1 year ago)
- Topics: html, jodd, jodd-lagarto, parser
- Homepage:
- Size: 44.9 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
---
description: Java libraries for HTML/XML parsing
---
# Lagarto HTML parsers suite
**Lagarto Suite** is the family of HTML/XML parsers written in Java. It consists of the following libraries:
1. **LagartoParser** is an all-purpose fast and versatile event-based HTML parser. You can use it to modify or analyze some markup content, allowing you to assemble custom complex transformations and code analysis tools quickly. It is performant and follows the rules of the official HTML specification.
2. **LagartoDom** builds a DOM tree in memory from the input. You can manipulate a tree more conveniently, with minor performance sacrifice.
3. **Jerry** is a "jQuery in Java" - you can use the familiar syntax of JavaScript library inside of Java to parse and manipulate HTML.
4. **CSSelly** - finally, the parser of CSS3 selectors.
{% hint style="info" %}
Each of the **Lagarto** libraries has its pros and cons. You should check each and use one that suits your requirements.
{% endhint %}
Lagarto parsers are compatible with Java 8 and newer.
### License
The code is released under the `BSD-2-Clause` license. It has a minimal set of dependencies with the same or similarly open license, so you should be able to use it in any project and for any purpose.