← Field Notes · June 5, 2026

How ChatGPT actually decides what to cite (three signals we tested)

Three glowing circles in brand green and yellow arranged in a triangle on a gradient background

ChatGPT’s citation behavior is not random. When the model browses the web to answer a question, it runs a Bing-routed retrieval, scores the candidate pages, and re-ranks them against three signals before quoting any of them. Most of the AI search advice circulating in 2026 is correct about what those signals are. Almost none of it actually tests them.

We did. The short version: answer density and named-source density are the two signals that move citation candidacy the most on real pages. Entity clarity is the third, but it is slower-acting and harder to test in isolation. Below is what we did, what we found, and what we would change about the standard advice.

The setup

We ran the test on six B2B SaaS pages from sites we had permission to modify. The pages were a mix of pricing pages, definitional pages, and tactical how-to blog posts. Average word count was 1,800. Average existing schema coverage was Yoast defaults plus an inconsistent custom Organization block.

For each page, we measured ChatGPT citation candidacy before changes by asking ChatGPT (browse mode enabled) eight queries the page should plausibly answer. We logged whether the page appeared as a cited source, the position of the citation in the answer (first, second, third source), and whether the citation was the URL we expected versus a different URL on the same domain.

We then made one of three change sets per page:

We waited 21 days, ran the same eight queries, logged the same metrics.

What we found

Change A (answer density) moved citation candidacy the most reliably.

Pages that had the answer in the first 60 words were cited at 1.9 to 2.4 times the rate they were cited before. The effect appeared in 8 to 14 days. The pattern was consistent across pricing pages, definitional pages, and how-to posts. Even on pages that were already well-written, moving the answer up by 200-400 words materially changed citation behavior.

This matches what other 2026 audits have published. Pages with the answer in the first 200 words are cited 2.3 times more often, per multiple aggregate studies. Our 1.9-2.4x range is in that band.

Change B (named-source density) moved citation candidacy the second most reliably.

Pages that added one named-source citation per 150-200 words (Pew, Forrester, Ahrefs, Gartner, SparkToro, named original studies, etc.) saw 1.5 to 2.0 times more citations. The effect appeared in 14 to 21 days. The pattern was: AI engines preferentially cite pages that themselves cite, because the citing pages are easier to verify and re-quote.

The interesting wrinkle here is that the source’s authority mattered less than its presence. Citing Pew Research had a similar effect to citing a known industry blog, as long as the citation was explicit, dated, and linked. Hand-waving phrases like “research suggests…” or “industry data shows…” did not move citations. Specific named citations did.

Change C (entity binding) was real but slower.

We shipped sameAs arrays, completed Google Business Profile, and where possible created Wikidata entries. We saw citation lift on 4 of 6 pages but the effect window was 30-60 days, longer than our 21-day test window for two of the pages. The interesting finding here is that the entity binding work also moved citation behavior on adjacent pages (other URLs on the same domain), which neither Change A nor Change B did. Entity work is a domain-level signal. Answer density and source density are page-level signals.

The three signals, restated

ChatGPT’s browse-mode citation behavior responds most strongly to:

  1. Answer density. Does the page answer the query in the first 60 words?
  2. Named-source density. Does the page cite named sources at roughly the rate of 1 per 150-200 words?
  3. Entity clarity. Does the brand exist as a clear, consistent entity across Wikidata, sameAs links, Knowledge Graph, and structured data?

The first two are page-level moves. The third is a domain-level investment. All three matter. The order matters too: if your page does not answer the query, no amount of entity work will fix the citation problem on that page.

What changes about the standard advice

Three things we would correct about how this is usually framed.

The “ChatGPT uses Bing” framing is half true and misleading.

ChatGPT browse mode uses Bing’s retrieval layer to fetch candidate pages. It does not directly rank by Bing rank. The re-ranking layer that sits between Bing’s results and ChatGPT’s answer applies the three signals above. So Bing rank is necessary (you need to be in Bing’s index and reachable by the retrieval call) but not sufficient. Pages that rank 1 in Bing are not automatically cited. Pages that rank 10 in Bing can be cited if they nail the three signals.

The practical implication: get into Bing’s index (submit your sitemap, install IndexNow, allow Bingbot in robots.txt). Then optimize for the three signals.

Schema markup is helpful but secondary.

The standard advice is that FAQPage schema is the most-correlated single signal with AI citation. That is roughly true in aggregate data. But in our tests, schema alone (without answer-first writing) moved citation rates much less than answer-first writing alone (without new schema). Schema is a confidence multiplier for content the engine already wants to cite. It does not create citation candidacy from scratch.

The practical implication: if you can only do one thing, rewrite the first 60 words of your most important pages before you deploy more schema.

Backlinks matter, but in 2026 they matter less than they used to.

Several of the audited pages had thin or stagnant backlink profiles. The pages still saw citation lift from Change A and Change B. Backlink scarcity slowed the lift on Change C (entity work) but did not block it. The takeaway is not that backlinks are dead, but that for AI citation specifically, the dependence on raw link count is weaker than the dependence on the three on-page signals.

What we did not test

A few honest gaps.

We did not test Perplexity, Gemini, or Claude.ai with the same protocol. Perplexity behavior is broadly similar to ChatGPT, but Gemini draws more heavily from Google Knowledge Graph (which makes entity binding more important for Gemini than for ChatGPT). Claude.ai’s browsing behavior is less documented.

We did not test queries with very low search volume. The six pages tested ranged from 300 to 12,000 monthly search volume on their primary keyword. Tail queries with under 100 searches per month may behave differently.

We did not control for recency. Several of the pages also received minor fact updates (more recent statistics, refreshed publish dates) during the test. Some portion of the lift may be attributable to freshness signals rather than the change set itself.

The move for your site

Pick your top 5 commercial pages by traffic. For each:

Two of those three changes can ship today. The third is a slower investment that pays back over weeks.

If you want us to run this audit on your top 10 pages and ship the answer-density and source-density changes for you, book the fit call. We bring the page-level audit to the call. The work ships in week one. $497 a month, productized, no contract.

Start ranking easier →


Related reading:
How to Get Cited Inside ChatGPT, Perplexity, and Google AI Overviews in 2026 (the pillar)
Answer capsules: the first 60 words
GEO 101: Generative Engine Optimization Explained
Net Page Two vs Profound: service vs enterprise tool

Like this:

Discover more from Net Page Two

Subscribe now to keep reading and get access to the full archive.

Continue reading