Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/crwlrsoft/robots-txt

Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping
https://github.com/crwlrsoft/robots-txt

hacktoberfest robots-exclusion-protocol robots-exclusion-standard robots-txt robots-txt-parser web-crawling web-scraping

Last synced: about 11 hours ago
JSON representation

Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping

Awesome Lists containing this project

README

        

crwlr.software logo

# Robots Exclusion Standard/Protocol Parser
## for Web Crawling/Scraping

Use this library within crawler/scraper programs to parse robots.txt
files and check if your crawler user-agent is allowed to load certain
paths.

## Documentation
You can find the documentation at [crwlr.software](https://www.crwlr.software/packages/robots-txt/getting-started).

## Contributing

If you consider contributing something to this package, read the [contribution guide (CONTRIBUTING.md)](CONTRIBUTING.md).