How does ChatGPT decide what to cite?
ChatGPT pulls from two sources: its training data (a snapshot of the web) and real-time web browsing (when enabled).
What makes content "citable":
- Direct, clear answers to the question being asked
- Authoritative sources — sites with strong domain authority and brand recognition
- Structured content — well-organized with headings, lists, and clear formatting
- Fresh content — 95% of citations come from content updated within 10 months
- Third-party validation — ChatGPT cites third-party sources 92.1% of the time in some categories
Critically, ChatGPT doesn't care about persuasive marketing copy. It cares about factual, verifiable, well-structured information.
Step 1: Make sure ChatGPT can crawl you
Check your robots.txt file. Many sites unknowingly block AI crawlers.
Add this to your robots.txt:
User-agent: GPTBot
Allow: /
User-agent: ChatGPT-User
Allow: /
Without this, ChatGPT's browsing feature literally cannot read your site. This is the most common AEO mistake and the easiest to fix.
Step 2: Structure content for extraction
ChatGPT extracts content in chunks. Make those chunks clean:
- Use question-format H2 headings. Instead of "Our Pricing," write "How much does [product] cost?"
- Front-load answers. Put the direct answer in the first sentence after the heading, then add context.
- Use lists and tables. ChatGPT renders these directly in responses.
- Add FAQ schema. ChatGPT's browsing feature reads structured data — give it ready-made Q&A pairs.
Step 3: Build citation authority
ChatGPT trusts content that other authoritative sources trust:
- Get cited on authoritative sites. Press coverage, industry publications, Wikipedia mentions, Reddit discussions.
- Cite authoritative sources yourself. Include links to studies, official docs, and recognized experts. Statistics boost visibility by 40%+.
- Create original research. Proprietary data, surveys, case studies — content that can't be found elsewhere.
Step 4: Create a llms.txt file
The llms.txt file is a convention — a file at your domain root that tells AI models how to understand and reference your site. Think of it as a README for AI.
Place it at yoursite.com/llms.txt with:
- Site name and description
- Key topics you cover
- How you want to be cited
- Your most important URLs
Read our complete llms.txt guide for a template you can copy.
Step 5: Keep content fresh
Pages with visible timestamps get 1.8x more ChatGPT citations than those without.
- Add
datePublishedanddateModifiedschema to every page - Show "Last updated" dates visibly on your content
- Actually update the content — refresh statistics, add new info
- Publish regularly — consistent publishing = more frequent crawling




