Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/hktalent/scrapysite

ScrapySite,go Web Crawler(spider), scraping,intelligence gathering
https://github.com/hktalent/scrapysite

crawler elasticsearch go scraping site spider web

Last synced: 1 day ago
JSON representation

ScrapySite,go Web Crawler(spider), scraping,intelligence gathering

Awesome Lists containing this project

README

        

# ScrapySite
ScrapySite
image
image

# how build
```bash
git clone [email protected]:hktalent/scrapysite.git
cd scrapysite
go build main.go
#or build for all palteform
make all -f Makefile.cross-compiles
ls -lah release/
# or build
make all
ls -lah bin/
ls -lah main
```

# how use Elasticsearch
http://127.0.0.1:9200/_cat/indices?v
1、create index
```bash
./tools/CreateEs.sh scrapy
```

http://127.0.0.1:9200/scrapy_index/_doc/

# how use
```bash
./main -url="http://www.xxx1.cn;http://www.xx2.cn" -resUrl="http://127.0.0.1:9200/st_index/_doc/"
```
http://127.0.0.1:9200/st_index/_search?q=edu&pretty=true