An open API service indexing awesome lists of open source software.

https://github.com/osamikoyo/geass

web crawler for you, with some api function and configuration
https://github.com/osamikoyo/geass

docker go golang searching url web-crawler web-crawling web-scraping

Last synced: 11 months ago
JSON representation

web crawler for you, with some api function and configuration

Awesome Lists containing this project

README

          

# Geass
web crawler for you

## Usage

# Nead
* Go >= 1.23
* Docker-compose
* task(recomended)
### Instalization manual
```bash
git clone github/osamikoyo/geass
cd geass
task run
```
### Docker
```bash
git clone github/osamikoyo/geass
task run
```

### Handler
```bash
curl "localhost:PORT_IN_CONFIG/get/content?url=EXEMPLE_URL"
```
### output:
```json
{
"url": "https://example.com",
"title": "Example Domain",
"meta_description": "This domain is for use in illustrative examples in documents.",
"content": {
"headings": ["Example Domain", "This domain is for use..."],
"text": "This domain is for use in illustrative examples in documents...",
"images": [
{
"src": "https://example.com/image.png",
"alt": "Example Image"
}
]
},
"links": [
{
"text": "More information...",
"href": "https://www.iana.org/domains/example"
}
],
"technical": {
"status_code": 200,
"content_type": "text/html",
"last_modified": "2023-10-01T12:00:00Z"
},
"metadata": {
"language": "en",
"canonical": "https://example.com",
"robots": "index, follow"
}
}
```

## Handler
```bash
curl "localhost:PORT_IN_CONFIG/get/urls?url=EXEMPLE_URL"
```
### output
```
http://exemplenumber1
http://exemplenumber2
http://exemplenumber3
http://exemplenumber4
```