An open API service indexing awesome lists of open source software.

https://github.com/marthym/scraphead

🤖 Scraphead allow scrapping html from URL in order to retrieve OpenGraph, Twitter Card and many other meta information from HTML head tag.
https://github.com/marthym/scraphead

html java scraper

Last synced: 3 months ago
JSON representation

🤖 Scraphead allow scrapping html from URL in order to retrieve OpenGraph, Twitter Card and many other meta information from HTML head tag.

Awesome Lists containing this project

README

          

# Scraphead [![](https://img.shields.io/github/release/Marthym/scraphead.svg)](https://GitHub.com/Marthym/scraphead/releases/) [![GitHub license](https://img.shields.io/github/license/Marthym/scraphead.svg)](https://github.com/Marthym/scraphead/blob/master/LICENSE)

[![Quality Gate Status](https://sonarcloud.io/api/project_badges/measure?project=Marthym_scraphead&metric=alert_status)](https://sonarcloud.io/dashboard?id=Marthym_scraphead)
[![Coverage](https://sonarcloud.io/api/project_badges/measure?project=Marthym_scraphead&metric=coverage)](https://sonarcloud.io/dashboard?id=Marthym_scraphead)
[![Maintainability Rating](https://sonarcloud.io/api/project_badges/measure?project=Marthym_scraphead&metric=sqale_rating)](https://sonarcloud.io/dashboard?id=Marthym_scraphead)

**Scraphead** allow scrapping html from URL in order to retrieve OpenGraph, Twitter Card and many other meta information
from HTML head tag.

## Description

**Scraphead** was divided into `core` and `netty`. The `core` contains all the logic, the HTML head parsing and the
mapping into **OpenGraph** and **Twitter Card** model. The `netty` was one of the multiple possible implementations for
the web client.

### Main features

* non blocking
* download only the `