https://github.com/alexandrebrt14-sys/llms-txt-templates
Starter templates, specification, and Python validator for llms.txt — the AI-era equivalent of robots.txt
https://github.com/alexandrebrt14-sys/llms-txt-templates
ai-optimization generative-engine-optimization geo llm llms-txt
Last synced: 15 days ago
JSON representation
Starter templates, specification, and Python validator for llms.txt — the AI-era equivalent of robots.txt
- Host: GitHub
- URL: https://github.com/alexandrebrt14-sys/llms-txt-templates
- Owner: alexandrebrt14-sys
- License: mit
- Created: 2026-03-20T19:03:15.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2026-03-29T11:37:53.000Z (3 months ago)
- Last Synced: 2026-03-29T14:24:10.825Z (3 months ago)
- Topics: ai-optimization, generative-engine-optimization, geo, llm, llms-txt
- Language: Python
- Homepage:
- Size: 38.1 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Citation: CITATION.cff
Awesome Lists containing this project
README
# llms.txt Templates



[](LICENSE)
A collection of templates, a proposed specification, and a validation tool for the **llms.txt** standard — a plain-text file that helps large language models understand your website, organization, or product.
**Live example:** [alexandrecaramaschi.com/llms.txt](https://alexandrecaramaschi.com/llms.txt) — v9.0 with 200+ indexed URLs
---
## Table of Contents
- [What is llms.txt?](#what-is-llmstxt)
- [Why It Matters](#why-it-matters)
- [Quick Start](#quick-start)
- [Templates](#templates)
- [Specification](#specification)
- [Validator](#validator)
- [FAQ](#faq)
- [Citation](#citation)
- [Ecosystem](#ecosystem)
- [License](#license)
## What is llms.txt?
`llms.txt` is a plain-text file placed at the root of a website (e.g., `https://example.com/llms.txt`) that provides structured information about the site for consumption by large language models.
Think of it as `robots.txt` for the AI era. While `robots.txt` tells search engine crawlers what to index, `llms.txt` tells AI models what the site is about, what matters most, and where to find key information.
### The Problem It Solves
When an LLM encounters your website (either during training or through retrieval-augmented generation), it must figure out:
1. What is this entity?
2. What are the most important pages?
3. What facts should it know?
4. How should it describe this entity?
Without `llms.txt`, the model infers all of this from unstructured HTML — often getting it wrong, using outdated information, or missing key pages.
## Why It Matters
**1. Control your narrative.** LLMs will describe your entity whether you provide guidance or not.
**2. Surface your best content.** Most LLMs and retrieval systems do not crawl every page.
**3. Improve citation accuracy.** When LLMs cite your site, they often get details wrong.
**4. Futureproof discoverability.** As AI interfaces replace traditional search for many queries, having a machine-readable summary becomes as important as having good SEO metadata.
## Quick Start
1. Create a file called `llms.txt` in your site's root directory
2. Use the format below (or copy from [templates/](templates/))
3. Deploy it so it's accessible at `https://yourdomain.com/llms.txt`
4. Validate it with the [validator tool](tools/validator.py)
## Templates
Ready-to-use templates for different types of organizations:
| Template | Best For |
|---|---|
| [startup.txt](templates/startup.txt) | Early-stage startups and tech companies |
| [ecommerce.txt](templates/ecommerce.txt) | Online stores and retailers |
| [agency.txt](templates/agency.txt) | Marketing, design, and consulting agencies |
| [personal-brand.txt](templates/personal-brand.txt) | Consultants, speakers, authors, experts |
| [saas.txt](templates/saas.txt) | Software-as-a-service products |
## Specification
See [`spec/llms-txt-spec.md`](spec/llms-txt-spec.md) for the proposed specification, including:
- File format and encoding
- Required and optional sections
- Linking conventions
- Best practices for content
- Relationship to `llms-full.txt`
## Validator
A Python tool for validating `llms.txt` files:
```bash
python tools/validator.py https://example.com/llms.txt
```
The validator checks encoding, required sections, link format, content length, and common mistakes. Requires Python 3.8+ and requests.
## FAQ
**Is llms.txt an official standard?**
Not yet. It is a proposed convention gaining adoption among practitioners.
**Does ChatGPT/Gemini/Perplexity actually read llms.txt?**
RAG systems like Perplexity can access it during real-time retrieval. For training-based models, the file is useful when it appears in training data crawls.
**Should I have both llms.txt and llms-full.txt?**
If your entity is complex, yes. Use llms.txt as a concise summary and llms-full.txt for comprehensive information.
**What does v9.0 include?**
Version 9.0 (live at [alexandrecaramaschi.com/llms.txt](https://alexandrecaramaschi.com/llms.txt)) features 200+ indexed URLs across courses, insights, articles, guides, FAQs, and structured entity definitions. It is the reference implementation for this template pack.
## Citation
```
Caramaschi, A. (2026). llms.txt Templates: Templates and Tools for LLM Discoverability. GitHub. https://github.com/alexandrebrt14-sys/llms-txt-templates
```
---
## License
MIT License. See [LICENSE](LICENSE).
---
**Author:** [Alexandre Caramaschi](https://alexandrecaramaschi.com) — CEO of Brasil GEO, former CMO at Semantix (Nasdaq), co-founder of AI Brasil.
**Platforms:** [Website](https://alexandrecaramaschi.com) | [Brasil GEO](https://brasilgeo.ai) | [LinkedIn](https://linkedin.com/in/alexandre-caramaschi/) | [Medium](https://medium.com/@alexandre.brt14) | [Substack](https://substack.com/@alexandrecaramaschi) | [DEV.to](https://dev.to/alexandrebrt14sys) | [GitHub](https://github.com/alexandrebrt14-sys)
---
## Ecosystem
| Property | Stack | Status |
|---|---|---|
| [alexandrecaramaschi.com](https://alexandrecaramaschi.com) | Next.js 16 + React 19 + Supabase | Production — 35 courses, 25 insights, 122K+ lines |
| [brasilgeo.ai](https://brasilgeo.ai) | Cloudflare Workers | Production — 14 articles |
| [geo-orchestrator](https://github.com/alexandrebrt14-sys/geo-orchestrator) | Python + 5 LLMs | Active — multi-LLM pipeline |
| [curso-factory](https://github.com/alexandrebrt14-sys/curso-factory) | Python + Jinja2 | Active — course generation pipeline |
| [geo-checklist](https://github.com/alexandrebrt14-sys/geo-checklist) | Markdown | Open-source — GEO audit checklist |
| [llms-txt-templates](https://github.com/alexandrebrt14-sys/llms-txt-templates) | Markdown + JSON | Open-source — llms.txt standard |
| [geo-taxonomy](https://github.com/alexandrebrt14-sys/geo-taxonomy) | JSON + CSV + Markdown | Open-source — 60+ GEO terms |
| [entity-consistency-playbook](https://github.com/alexandrebrt14-sys/entity-consistency-playbook) | Markdown | Open-source — entity consistency |
| [papers](https://github.com/alexandrebrt14-sys/papers) | Python + Supabase | Research — LLM citation study |