Ready to grow?
Book a free 30-minute strategy call. No pitch, no hard sell.
Book Your Strategy CallIf you have spent any time on GEO in 2026, you have seen the term llms.txt and probably wondered whether it actually does anything. Short version: it is a small file that hands AI engines a clean map of your most important pages, it takes about ten minutes to add, and it carries no SEO downside. At SugarNova we treat it as basic GEO hygiene rather than a magic ranking lever, and this guide explains what it is, what goes in it, the honest state of adoption, and whether it is worth your time.
What is llms.txt?
llms.txt is a plain Markdown file placed at the root of your site, at yourdomain.com/llms.txt. It gives large language models a curated, machine-readable shortlist of the content you most want them to read and cite. The format was proposed in late 2024 by Jeremy Howard to solve a specific problem: AI assistants do not crawl a site the way Google does. They have limited context, they skip a lot, and left to their own devices they can pull a summary from an outdated blog post instead of your core pages. llms.txt is your chance to point them at the right ones.
It is deliberately lean. The convention is a single H1 with your site or brand name, an optional blockquote giving a one to three sentence summary of what you do, and then a set of Markdown link lists grouped under headings such as Services, Guides or Documentation. Some sites also publish a longer companion file at llms-full.txt containing the full text of key pages for models that want more depth.
llms.txt vs robots.txt vs sitemap.xml
These three files are easy to confuse because they all live at the root and all talk to crawlers, but they do different jobs.
robots.txt sets access rules. It tells crawlers which paths they may or may not fetch, and it is where you allow or block specific AI bots such as GPTBot, ClaudeBot, Google-Extended and PerplexityBot. It is about permission.
sitemap.xml is a complete inventory. It lists every URL you want indexed so search engines can find them all. It is about coverage.
llms.txt is a curated recommendation. It does not list everything; it highlights the handful of pages you most want an AI model to use when it summarises or cites your brand. It is about priority. Think of robots.txt as the door policy, sitemap.xml as the full directory, and llms.txt as the concierge pointing guests to the best rooms.
What actually goes in the file
The syntax is Markdown because that is the native language of language models, so there is no complex parsing. A minimal, well-formed file looks like this:
# SugarNova
> SugarNova is a UK integrated growth agency running GEO, SEO, paid media, PR and CRO for ambitious DTC and high-ticket brands.
## Core services
- [GEO Agency](https://sugarnova.com/geo-agency): getting your brand cited by AI answer engines
- [AI SEO Agency](https://sugarnova.com/ai-seo-agency): technical SEO and answer engine optimisation
## Key guides
- [GEO vs SEO](https://sugarnova.com/blog/geo-vs-seo): how the two differ and why you need both
- [Schema markup for AI search](https://sugarnova.com/blog/schema-markup-for-ai-search): the structured data that helps models cite you
Two rules matter. First, the file must be named exactly llms.txt, not llm.txt, or discovery breaks. Second, keep it to your genuinely important, public pages. Confidential material such as client reports or campaign data should never be linked here, and anything you want kept private still needs proper access controls, because llms.txt is opt-in guidance, not a security mechanism.
The honest state of adoption in 2026
This is where most guides oversell, so here is the balanced view. llms.txt is gaining real traction among developer and SEO tooling. Documentation platforms and content teams have adopted it quickly, and its trajectory looks a lot like sitemap.xml before that became standard practice.
But it is not yet an official standard, and no major AI platform has publicly committed to reading it as a first-class input. As of mid-2026, OpenAI, Anthropic, Google and Perplexity have not confirmed automatic reading, Google's John Mueller has noted that major crawlers do not currently prioritise these files over standard HTML, and Semrush reported no statistical correlation between adding llms.txt and improved AI-search performance in a controlled study. So it would be dishonest to promise citations off the back of one file.
The counterpoint is equally fair. Absence of confirmation is not evidence of no value. Web conventions usually get adopted first and formalised later; robots.txt itself was a convention long before any engine officially committed to it. The file is free to add, has no downside, and future-proofs you for the point where support does formalise. That risk-reward is why we recommend it as hygiene, not as a headline tactic.
Should you add one?
For most brands, yes. Not because it guarantees a citation tomorrow, but because it is a ten-minute, zero-risk move that makes your most citable content easier to find, and it signals that your site is built with AI discovery in mind. Where it earns its keep is as one layer inside a real GEO stack, not as a standalone trick.
The layers that actually move citation share are the ones we build first: clean entity definitions so models know exactly what your brand is, question-and-answer content structured the way models quote, schema markup for AI search, and the digital PR that earns the third-party corroboration AI engines reward. llms.txt sits on top of that foundation and helps crawlers navigate it. Add it to a weak base and it does very little; add it to a strong one and it removes friction.
How llms.txt fits the wider GEO play
Getting cited by AI is not one action, it is a system. If you want the full picture, our guide to generative engine optimisation covers the strategy end to end, and how generative engine optimisation works breaks down the mechanics. For the search-versus-AI split, GEO vs SEO explains why you need both. And once your file is live and your content is structured, the practical question becomes engine-specific: our walkthrough on how to get cited by ChatGPT shows what that looks like in practice, alongside our wider work on AI search optimisation.
Run as one engine, these compound. PR earns authoritative coverage and links. That authority lifts domain strength, which improves both Google rankings and AI citation likelihood. Stronger organic visibility lowers paid acquisition cost, which frees budget for the creative and coverage that feed the loop again. llms.txt is a small, cheap cog in that machine, worth turning precisely because it costs almost nothing to turn.
Getting started
Add the file, point it at your best pages, and then measure whether it moves anything. The honest test is citation share: ask the major AI engines the questions your buyers ask and record whether your brand is named, before and after. That is the number that matters, and it is exactly what our free Growth Audit measures, alongside the entity, schema and authority work that makes an llms.txt file worth reading in the first place. If AI visibility is on your roadmap, our GEO agency and AI SEO teams build the whole stack, not just the file.
Frequently asked questions
What is llms.txt used for?
llms.txt is a Markdown file at the root of your site that gives AI models a curated shortlist of your most important pages. It helps large language models find and cite the content you actually want them to use, rather than guessing from whatever they happen to crawl.
Is llms.txt the same as robots.txt?
No. robots.txt sets access rules and tells crawlers which paths they may fetch or must avoid, including specific AI bots. llms.txt does the opposite job: it recommends your best content to AI models. One controls permission, the other guides priority.
Does llms.txt actually improve AI citations?
The honest answer in 2026 is that it is unproven. Adoption is growing, but no major AI platform has committed to reading it and one Semrush study found no measured correlation. It is best treated as free, no-downside GEO hygiene that future-proofs your site, not as a guaranteed ranking lever.
Where do I put the llms.txt file?
At the root of your domain, so it resolves at yourdomain.com/llms.txt. It must be named exactly llms.txt. You can optionally add a longer companion file at llms-full.txt containing the full text of your key pages for models that want more depth.
Should I use llms.txt or focus on schema and content first?
Content, entity definitions and schema come first, because those are what actually earn citations. llms.txt sits on top of that foundation and helps crawlers navigate it. Add it once your citable content exists, not as a substitute for building that content.