{"id":15323429,"url":"https://github.com/krishpranav/gocralwer","last_synced_at":"2026-04-24T10:31:17.451Z","repository":{"id":109907863,"uuid":"361104173","full_name":"krishpranav/gocralwer","owner":"krishpranav","description":"A awsome crawler made in go","archived":false,"fork":false,"pushed_at":"2021-04-25T05:01:49.000Z","size":200,"stargazers_count":1,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-12-25T22:31:28.749Z","etag":null,"topics":["crawler"],"latest_commit_sha":null,"homepage":"","language":"Go","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/krishpranav.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-04-24T07:57:48.000Z","updated_at":"2021-04-25T05:01:51.000Z","dependencies_parsed_at":"2023-05-12T23:30:53.093Z","dependency_job_id":null,"html_url":"https://github.com/krishpranav/gocralwer","commit_stats":{"total_commits":60,"total_committers":2,"mean_commits":30.0,"dds":"0.050000000000000044","last_synced_commit":"645159efec34c28941d8d23e496ca8c8749296c3"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/krishpranav/gocralwer","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/krishpranav%2Fgocralwer","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/krishpranav%2Fgocralwer/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/krishpranav%2Fgocralwer/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/krishpranav%2Fgocralwer/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/krishpranav","download_url":"https://codeload.github.com/krishpranav/gocralwer/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/krishpranav%2Fgocralwer/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32218906,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-24T10:26:35.452Z","status":"ssl_error","status_checked_at":"2026-04-24T10:25:27.643Z","response_time":64,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["crawler"],"created_at":"2024-10-01T09:19:49.033Z","updated_at":"2026-04-24T10:31:17.428Z","avatar_url":"https://github.com/krishpranav.png","language":"Go","funding_links":[],"categories":[],"sub_categories":[],"readme":"# gocralwer\nA awsome crawler made in go\n\n[![forthebadge](https://forthebadge.com/images/badges/made-with-go.svg)](https://forthebadge.com)\n\n### From GitHub\n```\ngit clone https://github.com/krishpranav/gocralwer\ncd gocrawler\ngo build .\nmv gocrawler /usr/local/bin\ngocrawler --help\n```\n\nNote: golang 1.13.x required.\n\n## Commands \u0026 Usage\n\nKeybinding                              | Description\n----------------------------------------|---------------------------------------\n\u003ckbd\u003eEnter\u003c/kbd\u003e                        | Run crawler (from URL view)\n\u003ckbd\u003eEnter\u003c/kbd\u003e                        | Display response (from Keys and Regex views)\n\u003ckbd\u003eTab\u003c/kbd\u003e       \t\t\t\t\t          | Next view\n\u003ckbd\u003eCtrl+Space\u003c/kbd\u003e                   | Run crawler\n\u003ckbd\u003eCtrl+S\u003c/kbd\u003e                       | Save response\n\u003ckbd\u003eCtrl+Z\u003c/kbd\u003e                       | Quit\n\u003ckbd\u003eCtrl+R\u003c/kbd\u003e                       | Restore to default values (from Options and Headers views)\n\u003ckbd\u003eCtrl+Q\u003c/kbd\u003e                       | Close response save view (from Save view)\n\n```bash\ngocrawler -h\n```\nIt will displays help for the tool:\n\n| flag | Description | Example |\n|------|-------------|---------|\n| -url | URL to crawl for | gocrawler -url toscrape.com |\n| -url-exclude string | Exclude URLs maching with this regex (default \".*\")  | gocrawler -url-exclude ?id= | \n| -domain-exclude string | Exclude in-scope domains to crawl. Separate with comma. default=root domain | gocrawler -domain-exclude host1.tld,host2.tld | \n| -code-exclude string | Exclude HTTP status code with these codes. Separate whit '\\|' (default \".*\") | gocrawler -code-exclude 200,201 | \n| -delay int  | Sleep between each request(Millisecond) | gocrawler -delay 300 | \n| -depth | Scraper depth search level (default 1) | gocrawler -depth 2 | \n| -thread int | The number of concurrent goroutines for resolving (default 5) | gocrawler -thread 10 |\n| -header | HTTP Header for each request(It should to separated fields by \\n). | gocrawler -header KEY: VALUE\\nKEY1: VALUE1 | \n| -proxy string | Proxy by scheme://ip:port | gocrawler -proxy http://1.1.1.1:8080 | \n| -scheme string | Set the scheme for the requests (default \"https\") | gocrawler -scheme http | \n| -timeout int | Seconds to wait before timing out (default 10) | gocrawler -timeout 15 | \n| -query string | JQuery expression(It could be a file extension(pdf), a key query(url,script,css,..) or a jquery selector($(\"a[class='hdr']).attr('hdr')\"))) | gocrawler -query url,pdf,txt |\n| -regex string | Search the Regular Expression on the page contents | gocrawler -regex 'User.+' |\n| -max-regex int | Max result of regex search for regex field (default 1000) | gocrawler -max-regex -1 | \n| -robots | Scrape robots.txt for URLs and using them as seeds | gocrawler -robots |\n| -sitemap | Scrape sitemap.xml for URLs and using them as seeds | gocrawler -sitemap |\n| -wayback | Scrape WayBackURLs(web.archive.org) for URLs and using them as seeds | gocrawler -sitemap |\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkrishpranav%2Fgocralwer","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkrishpranav%2Fgocralwer","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkrishpranav%2Fgocralwer/lists"}