Which sources does ChatGPT prefer?

Three patterns dominate. Wikipedia and Wikidata are reached for entity-level questions. Major news outlets (Reuters, NYT, BBC, Bloomberg) for current events. Product documentation, GitHub repos, and Stack Overflow for technical and developer queries. Industry-specific publications for niche queries. The implication: a brand that's covered in Wikipedia, mentioned in major press, and has thorough public documentation gets more citation surface area than one without those.

Why doesn't ChatGPT cite my brand even when it mentions us?

ChatGPT's citation pattern is primarily reference-cite — citations clustered or available on request, not always inline. The engine often mentions brand names by name without linking. Brand mentions without links still drive awareness (the user reads your name) and reinforce entity signals across millions of conversations. The optimization implication: brand-name presence in the engine's training and retrieval set is itself the win, separately from clickthrough.

How do I find out if ChatGPT mentions our brand?

Three approaches. Manual probing — run 30–50 high-intent prompts in your category and record whether your brand appears. Tooling — services like SEOTopSecret's AI mention tracker, Goodie, and Profound run scheduled prompts and report mention rates over time. API monitoring — set up automated prompt batches via the OpenAI API and track responses. The chapter on AI search measurement covers the methodology in depth.

Are GPT Store / Custom GPTs an SEO surface?

Indirectly. Custom GPTs published to the GPT Store can build brand awareness within ChatGPT's user surface, and a Custom GPT that becomes popular for a category produces brand association at the conversational level. The discoverability rules inside the Store are separate from web SEO but the goal overlaps — earn category presence within the ChatGPT product. For most brands, this is a secondary investment after ensuring the broader citation set is strong.

AI SEO

Chapter 02 / 08

ChatGPT optimization

The most-used AI engine in the world doesn't always cite — but it always reads. How to win brand mentions, citations when they exist, and the source preferences that decide whether ChatGPT reaches for your content.

8 min readPublished May 8, 2026

ChatGPT is the most-used AI assistant in the world by a wide margin. Optimizing for it doesn't always produce a clickable citation — the engine's default mode is reference-cite, with citations available rather than inline. But every conversation that mentions your brand produces awareness, every retrieval that pulls your content reinforces entity signals, and the cumulative pattern moves the citation-and-mention rate the same way classical SEO moves organic traffic.

“The mistake most teams make with ChatGPT optimization is treating it like a click-through channel. It's an awareness channel and a citation channel. The page that gets quoted but not linked still wins — the brand name in the answer is the prize. Optimize for being the source the engine reaches for, and the citations follow.”

Two retrieval layers

ChatGPT operates on two layers, and both are optimization targets:

Training data. The base model was trained on a massive text corpus with a cutoff date. Brands, facts, and content present in that corpus are answerable from memory without retrieval. Optimization for this layer is slow — it changes only when the model is retrained — but extremely durable once you're in.
Live retrieval (browsing). When the user enables browsing or the query triggers it, ChatGPT runs live searches against the web. The engine fetches pages, parses them, and stuffs relevant passages into context. Optimization for this layer is the standard SEO stack — indexability, structured content, well-linked URLs.

A complete ChatGPT optimization program addresses both layers. Training-data optimization compounds slowly but durably. Live-retrieval optimization is faster and more responsive to ongoing work.

Source preferences

ChatGPT shows clear preferences for source types depending on query category. Knowing the preference lets you decide where to invest editorial effort:

Entity-level / definitional queries. Wikipedia and Wikidata dominate. The engine reaches for these as primary sources for "what is X" / "who is X" type queries. A brand without a Wikipedia entry loses entity-level retrieval to brands with one.
Current events / news. Major news outlets (Reuters, AP, BBC, NYT, Bloomberg, Guardian, FT) are the dominant set. Industry-specific newsrooms supplement.
Technical / developer queries. Product documentation, GitHub READMEs, Stack Overflow answers, and major dev publications (CSS-Tricks, MDN, official framework docs).
Product comparison / "best of" queries. Established review sites (Wirecutter, Tom's Guide, The Verge), industry publications, and category-specific aggregators.
Local / business-specific queries. Less-developed surface for ChatGPT compared to Perplexity or Gemini; the engine increasingly defers to web search for local intent.

Optimization tactics for the training layer

Earn Wikipedia presence. A Wikipedia entry is the single most consequential AI-SEO asset for brand-level questions. Notability standards apply — earned coverage in independent sources is the route, not self-promotion.
Wikidata sameAs and entity properties. Wikidata is read directly by some engines and indirectly by all. Maintain accurate sameAs (to your homepage, social handles, professional registries) and industry/category properties.
Earn major-press mentions. Brand mentions in Reuters, NYT, BBC, FT and the equivalent national press carry training-data weight even when unlinked. Digital PR oriented at major outlets compounds.
Publish data and primary research. Original research that gets cited by other publications produces secondary citations that propagate into training corpora.
Sponsor or speak at category-defining events. Conference proceedings and recap content mention named participants and propagate through the same channels.

Optimization tactics for the live-retrieval layer

Standard technical SEO. Indexability, page speed, mobile rendering, schema. The retrieval tool can't read what isn't crawlable.
Passage-level structure. FAQs with explicit Q&A, H2-as-question patterns, structured tables, schema-marked answers. The engine retrieves at the passage level.
Topic depth. 1,500–3,000 word treatments of single topics outperform shorter pages on retrieval frequency. The longer page presents more candidate passages.
Recency markers. Updated dates in schema, byline + credentials, and a visible "last updated" date build retrievability for queries where freshness matters.
Topic clusters with internal linking. A cluster of related articles is more retrievable than a single page because the engine can land on any of them and follow internal links to deeper context.

Schema that ChatGPT actually reads

ChatGPT's browsing tool parses JSON-LD when it encounters it. The schema types that earn the most retrieval lift:

Article + author — establishes the byline, the publication date, the credentials.
FAQPage — the engine extracts FAQ entries and treats them as direct answer candidates.
HowTo — for procedural content; the steps surface as ordered lists in the answer.
Product — for SKU-level pages; price, rating, availability all surfaced.
Organization + sameAs — confirms the brand entity and its identity claims (Wikipedia, social, registries).
WebSite + searchAction — declares site search; the engine can use this to issue secondary queries.

The brand-name paragraph rule

Engines retrieve passages, and the passages that survive grounding checks are the ones that mention the entity by name. A passage that says "the leading SaaS pricing platforms in 2026 are A, B, and C" is more retrievable for a comparison query than a passage that says "we offer competitive SaaS pricing." Use the brand name (yours and competitors') in the passages you want retrieved. Generic copy doesn't get retrieved against named-entity queries.

Measuring ChatGPT presence

Three signals worth tracking:

Mention rate — how often your brand appears in responses to a fixed set of category prompts. Tracked over time.
Citation rate — when ChatGPT does cite (browsing-enabled answers), how often is your URL in the citation set.
Co-mention pattern — which competitors are named alongside you. The chapter on AI search measurement covers tooling and methodology.

ChatGPT is the largest AI surface; it's also the slowest to update. The next chapter, Gemini optimization, covers the engine that's most directly forecastable from organic SEO behavior.

Common questions

Quick answers to what we get asked before every trial signup.

Yes, when the user enables browsing or asks a question that triggers it automatically. The default model has a training cutoff and answers from training knowledge first; specific query types — current events, recent product launches, real-time data — trigger live web retrieval. Optimizing for ChatGPT means optimizing both layers: the training-data layer (broad citation, established sources) and the live-retrieval layer (well-indexed, well-linked content the engine can find via its retrieval tool).

In this cluster

AI SEO

Previous chapter

01. How AI engines rank

Next chapter

03. Gemini optimization

Product

Resources

Company

ChatGPT optimization

Two retrieval layers

Source preferences

Optimization tactics for the training layer

Optimization tactics for the live-retrieval layer

Schema that ChatGPT actually reads

The brand-name paragraph rule

Measuring ChatGPT presence

Common questions

AI SEO

See the OS in Action

Quick context, then book