https://github.com/zrquan/gatherer

Gatherer 是一个简易的爬虫工具
https://github.com/zrquan/gatherer

crawler infosec pentest security

Last synced: 6 months ago
JSON representation

Gatherer 是一个简易的爬虫工具

Host: GitHub
URL: https://github.com/zrquan/gatherer
Owner: zrquan
License: gpl-3.0
Created: 2024-03-24T04:40:27.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-05-12T08:32:46.000Z (about 2 years ago)
Last Synced: 2024-05-12T09:32:28.688Z (about 2 years ago)
Topics: crawler, infosec, pentest, security
Language: Go
Homepage:
Size: 65.4 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Gatherer

Gatherer 是一个简易的爬虫工具，它可以从各种内容中收集资源链接和 API 然后进行访问

[![asciicast](https://asciinema.org/a/lv1EaQyBFkeOtI74DBjP7vFRs.svg)](https://asciinema.org/a/lv1EaQyBFkeOtI74DBjP7vFRs)

```
Gatherer v0.1.0

Usage of ./gatherer:
-H value
HTTP request headers (eg. -H 'Header1:value' -H 'Header2:value')
-ch
Run Javascript in headless Chrome
-debug
Debug mode
-dep int
Maximum path depth (default 1)
-ef string
Filter by extensions (separated by commas)
-igq
Ignore the query portion on the URL from a[href]
-json
Log as JSON format
-lf string
Filter by response length (separated by commas)
-limit int
Maximum number of concurrent requests (default 100)
-nr
Disallow auto redirect
-proxy string
Proxy URL
-rod string
Set the default value of options used by rod.
-sf string
Filter by status codes (separated by commas)
-sub
Allow to visit sub-domains
-t int
Request timeout (second) (default 10)
-tt int
Total timeout (second)
-u string
Target URL
-ua
Use random User-Agent
-w string
Wordlist file path
```

## Features

- 从 JS 代码中收集资源链接
- 从 Webpack 打包的代码中收集动态生成的 JS 资源链接
- 从 Swagger 文档中解析 API 的完整路径、方法、参数
- 从 robots.txt 中收集资源链接
- 从 XML sitemap 中收集资源链接
- 执行 JS 完成页面渲染，比如 SPA

## Thanks

- [colly](https://github.com/gocolly/colly)
- [hakrawler](https://github.com/hakluke/hakrawler)
- [LinkFinder](https://github.com/GerbenJavado/LinkFinder)
- [Packer-Fuzzer](https://github.com/rtcatc/Packer-Fuzzer)
- more...

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/zrquan/gatherer

Awesome Lists containing this project

README