Projects in Awesome Lists tagged with robotstxt
A curated list of projects in awesome lists tagged with robotstxt .
https://github.com/dmi3kno/polite
Be nice on the web
crawler memoise r r-package rate-limiter robotstxt rstats rvest scraper webscraping
Last synced: 14 Mar 2025
https://github.com/alextim/astro-lib
Makes it easy to add robots.txt, sitemap and web app manifest during build to your Astro app.
astro robots-txt robotstxt seo sitemap sitemap-xml webmanifest
Last synced: 06 Apr 2025
https://github.com/folyd/robotstxt
A native Rust port of Google's robots.txt parser and matcher C++ library.
google-robots-parser robotstxt rust
Last synced: 06 May 2025
https://github.com/Folyd/robotstxt
A native Rust port of Google's robots.txt parser and matcher C++ library.
google-robots-parser robotstxt rust
Last synced: 15 Apr 2025
https://github.com/itgalaxy/generate-robotstxt
Generator robots.txt for node js
cli generator-robots robot robots robots-generator robots-txt robotstxt
Last synced: 09 Apr 2025
https://github.com/itgalaxy/robotstxt-webpack-plugin
A webpack plugin to generate a robots.txt file
robots-txt robotstxt webpack webpack-plugin
Last synced: 05 May 2025
https://github.com/spcanelon/2022-ccd-sips
Workshop materials for creating the Center City Sips District 2022 interactive map
geocoding ggmap interactive-map leaflet r robotstxt webscraping
Last synced: 05 Apr 2025
https://github.com/vxern/robots_txt
⚙️ A quality `robots.txt` ruleset parser to ensure your application follows the standard specification for the file.
complete dart documented fast parser robots robots-txt robots-txt-parser robotstxt simple tiny
Last synced: 10 Apr 2025
https://github.com/onuratakan/site_seo_scanner
The site seo scanner.
robotstxt scanner scanner-web seo seotools site sitemap
Last synced: 08 Apr 2025
https://github.com/rimiti/robotizer
Robots.txt parser / generator
generator parser robots-parser robots-txt robotstxt
Last synced: 19 Feb 2025
https://github.com/marcodaniels/elm-robots-humans
Create robots.txt and humans.txt in Elm
Last synced: 09 Apr 2025
https://github.com/littlebizzy/virtual-robotstxt
Replaces the default virtual robots.txt generated by WordPress with an editable one, and deletes any physical robots.txt file that may already exist.
robotstxt virtual-robotstxt wordpress wordpress-plugin
Last synced: 07 May 2025
https://github.com/s-thom/create-robots-txt-action
An action to create a robots.txt file from different sources
action actions gh-action gh-actions github-action github-actions robots-txt robotstxt
Last synced: 02 Mar 2025
https://github.com/maximeguinard/robots.txt-viewer
🌐 Displays the contents of robots.txt and sitemap.xml files of a website google extension
extension extension-chrome extension-firefox extension-methods extension-pack extensions robots-txt robotstxt sitemap sitemap-xml sitemaps website website-builder website-design website-template websites
Last synced: 20 Mar 2025
https://github.com/prod3v3loper/php-honeypot-robots-bait
🍯 HoneyPot Robots Bait
backend honey-pot html ip ip-address log logging php robots robots-txt robotstxt server
Last synced: 15 May 2025
https://github.com/thomasleveil/pyrobots
python binding for Google robots.txt parser C++ library
googlebot python robots-exclusion-protocol robots-parser robots-txt robotstxt
Last synced: 27 Mar 2025
https://github.com/dpb587/go-sitemap
Stream decoders for sitemap.xml data and link feeds.
go golang robotstxt sitemapxml
Last synced: 05 Apr 2025