Documentation Index
Fetch the complete documentation index at: https://base.bangwu.me/llms.txt
Use this file to discover all available pages before exploring further.
Make your docs site agent-ready
Documentation is no longer consumed only by humans in browsers. Coding agents, AI search products, and chat tools now fetch docs directly. If your docs site is hard for agents to discover or parse, the result is slower retrieval, higher token cost, and worse answers. Cloudflare’s Agent Readiness write-up from April 17, 2026 is a useful signal:Content-Signal adoption was still around 4%, Markdown content negotiation around 3.9%, and newer protocol-discovery standards were extremely rare. That means small improvements can still make your docs stand out.
Start with the basics
Before you add AI-specific files, make sure the normal web hygiene is solid:- Keep
robots.txtvalid and intentional. - Keep
sitemap.xmlup to date. - Use clear page titles and descriptions.
- Prefer stable URLs.
- Avoid sending agents through low-value directory pages when a better landing page exists.
Publish llms.txt
/llms.txt is a curated reading list for LLMs. Unlike sitemap.xml, it should not try to list everything. It should point agents to the most useful pages in a token-efficient way.
Use it to answer three questions quickly:
- What is this site?
- Which sections matter most?
- Which Markdown-friendly pages should an agent fetch next?
A small example
For large doc sets
Do not dump thousands of links into one root file. A better pattern is:- keep a short root
llms.txt - add one
llms.txtper top-level section - link to section-level indexes from the root file
Serve Markdown directly
Agents do better with Markdown than with full HTML. Cloudflare notes that serving Markdown can reduce token usage significantly, which improves both cost and answer quality. The best options are:- Support
Accept: text/markdown- let clients request the same page as Markdown
- Provide a URL fallback like
/index.md- useful because not every agent sends the Markdown
Acceptheader by default
- useful because not every agent sends the Markdown
Be explicit about AI access
robots.txt is still where you declare crawl rules. If you want more granular AI permissions, add Content-Signal directives.
Example:
search: indexing and search resultsai-input: inference-time use such as grounding or RAGai-train: model training or fine-tuning
Publish machine-discoverable capabilities when relevant
Not every docs site needs protocol discovery. If your site only serves public content, you can stop at content readiness. Add capability endpoints when your service also exposes APIs or tools:/.well-known/api-catalogfor public API discovery/.well-known/mcp/server-card.jsonif you expose an MCP server- Web Bot Auth only if bot identity matters for your use case
Make large doc sets easier to navigate
Structure matters as much as availability.- Link agents to content pages, not only directory listings.
- Remove low-value index pages from
llms.txtif they add almost no semantic information. - Write descriptive frontmatter and headings.
- Link Markdown URLs in
llms.txt, not the HTML pages.
Roll this out in phases
Use the smallest useful sequence:- Fix
robots.txt,sitemap.xml, titles, and descriptions. - Add
llms.txt. - Add Markdown delivery via
Accept: text/markdownand/or/index.md. - Add
Content-Signalif you need explicit AI usage preferences. - Add API or MCP discovery only if your site actually exposes capabilities.
Audit your site
Use both automated and manual checks:- Run your site through
https://isitagentready.com - Test a page with:
- Ask an AI tool to answer one precise question using only your docs
- Check whether it finds the right page quickly or wastes time looping through indexes
