An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with robotstxt

A curated list of projects in awesome lists tagged with robotstxt .

https://github.com/crawler-commons/crawler-commons

A set of reusable Java components that implement functionality common to any web crawler

java library open-source robots-txt robotstxt sitemaps web-crawler

Last synced: 06 Mar 2026

https://github.com/alextim/astro-lib

Makes it easy to add robots.txt, sitemap and web app manifest during build to your Astro app.

astro robots-txt robotstxt seo sitemap sitemap-xml webmanifest

Last synced: 06 Apr 2025

https://github.com/folyd/robotstxt

A native Rust port of Google's robots.txt parser and matcher C++ library.

google-robots-parser robotstxt rust

Last synced: 06 May 2025

https://github.com/Folyd/robotstxt

A native Rust port of Google's robots.txt parser and matcher C++ library.

google-robots-parser robotstxt rust

Last synced: 15 Apr 2025

https://github.com/itgalaxy/robotstxt-webpack-plugin

A webpack plugin to generate a robots.txt file

robots-txt robotstxt webpack webpack-plugin

Last synced: 05 May 2025

https://github.com/kozmozio/kirby-llms

A Kirby CMS plugin that generates an llms.txt file for Large Language Models

kirby4 kirby5 kirbycms llms llmstxt robots robotstxt

Last synced: 01 Mar 2026

https://github.com/spcanelon/2022-ccd-sips

Workshop materials for creating the Center City Sips District 2022 interactive map

geocoding ggmap interactive-map leaflet r robotstxt webscraping

Last synced: 13 Aug 2025

https://github.com/j-plugins/robots-txt-plugin

Intellij IDEA Plugin for Robots.txt technology

intellij intellij-plugin kotlin robots-txt robotstxt viewer

Last synced: 07 May 2026

https://github.com/vxern/robots_txt

โš™๏ธ A quality `robots.txt` ruleset parser to ensure your application follows the standard specification for the file.

complete dart documented fast parser robots robots-txt robots-txt-parser robotstxt simple tiny

Last synced: 30 Jan 2026

https://github.com/marcodaniels/elm-robots-humans

Create robots.txt and humans.txt in Elm

elm humanstxt robotstxt

Last synced: 09 Apr 2025

https://github.com/s-thom/create-robots-txt-action

An action to create a robots.txt file from different sources

action actions gh-action gh-actions github-action github-actions robots-txt robotstxt

Last synced: 15 May 2026

https://github.com/littlebizzy/virtual-robotstxt

Replaces the default virtual robots.txt generated by WordPress with an editable one, and deletes any physical robots.txt file that may already exist.

robotstxt virtual-robotstxt wordpress wordpress-plugin

Last synced: 03 Jul 2025

https://github.com/rimiti/robotizer

Robots.txt parser / generator

generator parser robots-parser robots-txt robotstxt

Last synced: 19 Sep 2025

https://github.com/thomasleveil/pyrobots

python binding for Google robots.txt parser C++ library

googlebot python robots-exclusion-protocol robots-parser robots-txt robotstxt

Last synced: 31 Dec 2025

https://github.com/xhdndmm/one_file_search_engine

ไธ€ไธชๅ•ๆ–‡ไปถ็š„็ฎ€ๆ˜“ๆœ็ดขๅผ•ๆ“Ž

flask fts gplv3 html javascript python python3 robotstxt search-engine sqlite

Last synced: 12 Jan 2026

https://github.com/itechbear/robotstxt

A java clone of Google's robotst.txt parser: https://github.com/google/robotstxt

crawler google-robotst-parser java robotstxt

Last synced: 14 Jan 2026

https://github.com/peaceiris/docker-images

A collection of Docker images

docker robots-txt robotstxt

Last synced: 08 Jan 2026

https://github.com/dpb587/go-sitemap

Stream decoders for sitemap.xml data and link feeds.

go golang robotstxt sitemapxml

Last synced: 06 Oct 2025