Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/oblac/jodd-lagarto

Java HTML parsers suite.
https://github.com/oblac/jodd-lagarto

html html-parser java jquery parser

Last synced: 2 months ago
JSON representation

Java HTML parsers suite.

Awesome Lists containing this project

README

        

# Jodd Lagarto

[![Jodd](https://img.shields.io/badge/>-Jodd-orange)](https://github.com/oblac/jodd)
![GitHub release](https://img.shields.io/github/release/oblac/jodd-lagarto.svg)
![Maven Central](https://img.shields.io/maven-central/v/org.jodd/jodd-lagarto)
[![javadoc](https://javadoc.io/badge2/org.jodd/jodd-lagarto/javadoc.svg)](https://javadoc.io/doc/org.jodd/jodd-lagarto)
[![Build](https://github.com/oblac/jodd-lagarto/actions/workflows/gradle-build.yml/badge.svg)](https://github.com/oblac/jodd-lagarto/actions/workflows/gradle-build.yml)
[![codecov](https://codecov.io/gh/oblac/jodd-lagarto/branch/master/graph/badge.svg)](https://codecov.io/gh/oblac/jodd-lagarto)
[![Codacy Badge](https://app.codacy.com/project/badge/Grade/cb8b9fbd66a84265b4cf526f5887edd5)](https://www.codacy.com/gh/oblac/jodd-lagarto?utm_source=github.com&utm_medium=referral&utm_content=oblac/jodd-lagarto&utm_campaign=Badge_Grade)
[![Stack Overflow](https://img.shields.io/badge/stack%20overflow-lagarto-4183C4.svg)](https://stackoverflow.com/questions/tagged/jodd-lagarto)
[![BSD License](https://img.shields.io/badge/license-BSD--2--Clause-blue.svg)](https://github.com/oblac/jodd-lagarto/blob/master/LICENSE)

🌟 Java HTML parsers suite.

🏡 Website: [lagarto.jodd.org](https://lagarto.jodd.org)

❤️ For developers, by developer.

### 🦎 Overview

Lagarto Suite is the family of HTML/XML parsers written in Java. It consists of the following libraries:

+ `LagartoParser` is an all-purpose fast and versatile event-based HTML parser. You can use it to modify or analyze some
markup content, allowing you to assemble custom complex transformations and code analysis tools quickly. It is
performant and follows the rules of the official HTML specification.
+ `LagartoDom` builds a DOM tree in memory from the input. You can manipulate a tree more conveniently, with minor
performance sacrifice.
+ `Jerry` is a "jQuery in Java" - you can use the familiar syntax of JavaScript library inside of Java to parse and
manipulate HTML.
+ `CSSelly` - finally, the parser of CSS3 selectors.