Business

What AI Actually Cites: The B2B SaaS AI-Citation Benchmark (2026)

An original benchmark of 160,240 AI citations across 23,545 buyer conversations and five B2B SaaS brands: what ChatGPT, Perplexity, and Google AI Overviews actually cite, and the AEO moves that follow.

Arnel BukvaArnel Bukva8 min read

Across 160,240 AI citations from five B2B SaaS brands, company websites won half of everything AI cited (50.7%), but Reddit was the single most-cited source, more than twice the next domain. Corporate content wins the category; a community forum wins the leaderboard, almost entirely through ChatGPT.

Across 160,240 citations from five B2B SaaS brands, company websites won half of everything AI cited (50.7%). One domain still beat them all. Reddit was the single most-cited source in the dataset at 11,237 citations, more than twice the next domain. So corporate content wins the category, and a community forum wins the leaderboard.

That tension is the whole story. Below is what we measured, every table behind it, and what you should change about your own AEO work because of it.

How we measured this

We pooled AI-citation data from five live B2B SaaS projects we run inside Peec AI: LoudFace, Toku, and three anonymized clients (a fintech-payroll client, a B2B research client, and two more). The window was 30 days ending 2026-06-01. The instrument logged 23,545 sampled conversations and 160,240 individual citations, which is every link an AI engine surfaced inside an answer.

One number matters before you trust any of the rest: we sampled three engines. ChatGPT, Perplexity, and Google AI Overviews returned data this window. Claude, Copilot, and Grok were not measured. Nothing here describes them, and we will not pretend otherwise.

Source types come from Peec's own domain classification, which buckets every cited domain into one of eight categories (Corporate, UGC, Editorial, Reference, Competitor, You, Other, Institutional). "You" means a project's own owned domain. "Competitor" means a domain Peec tagged as a rival for that project. We folded null classifications into "Other," which stayed small at 2.0% of citations. Larger projects pull more weight in the pooled totals, so we checked each finding per-project to confirm it is not one big client talking.

Finding 1: company websites win half of everything

Corporate content is the most-cited source type by a wide margin. Across all three engines, owned-and-operated company sites took 50.7% of citations. The next type, user-generated content, sat at 13.2%. Nothing else cleared 12%.

Source typeCitationsShare
Corporate81,21850.7%
UGC21,16013.2%
Editorial17,82211.1%
Reference12,8758.0%
Competitor12,5817.9%
You (owned)8,0645.0%
Other3,2812.0%
Institutional3,2392.0%

This held in every single project. Corporate ranged from 41.6% to 66.9% across the five brands, and it was the top type in all of them. The ordering below Corporate shifted (UGC and Editorial traded places depending on the client), but the headline never moved. If an AI answer cites a single source about your category, the odds favor a company website saying it.

That is the good news for anyone who owns a website. You are not fighting Wikipedia for every slot. You are competing inside the source type that already wins.

Finding 2: Reddit is the single most-cited domain, and it is a ChatGPT habit

Source type is one lens. Individual domains are another, and they tell a sharper story. Corporate wins as a category because thousands of company sites add up. No single corporate domain dominates. Reddit does.

#DomainTypeCitationsShareTop engine
1reddit.comUGC11,2377.0%ChatGPT 80.5%
2toku.comYou5,7663.6%ChatGPT 53.7%
3riseworks.ioCompetitor2,5181.6%Google AIO 43.5%
4remote.comCompetitor2,0111.3%ChatGPT 81.7%
5youtube.comUGC2,0051.3%Google AIO 72.4%
6investopedia.comEditorial1,8391.1%Google AIO 71.0%
7businessinsider.comEditorial1,7821.1%Perplexity 37.6%
8deel.comCompetitor1,1970.7%ChatGPT 59.2%
9loudface.coYou1,0970.7%ChatGPT 58.5%

Reddit took 7.0% of all citations on its own. The second-place domain, toku.com, took 3.6%. So Reddit out-cited the next domain by more than 2x.

Look at the "top engine" column and the pattern jumps out. Reddit's dominance is 80.5% driven by ChatGPT specifically. Of Reddit's 11,237 citations, 9,050 came from ChatGPT, and ChatGPT cited Reddit at a rate of 2.64 citations per retrieval. Perplexity barely touched it, contributing 335 citations against 1,252 retrievals. Same forum, completely different treatment depending on which engine answers.

Finding 3: the three engines have different personalities

If you optimize for "AI search" as one thing, you are averaging across machines that disagree. Each engine leans on a different source mix.

EngineCorporateUGCEditorialReferenceCompetitorYouTotal cites
ChatGPT48.0%15.0%10.2%10.3%7.4%5.4%80,608
Google AIO53.4%14.0%10.7%3.5%8.4%4.6%42,147
Perplexity53.4%8.4%13.6%8.3%8.3%4.8%37,485

ChatGPT leans hardest on UGC at 15.0%, and most of that is Reddit. It also pulls Reference content more than the others (10.3%). Perplexity goes the other way: it is the most editorial engine at 13.6% and the lightest on UGC at 8.4%. Google AI Overviews is the most corporate-heavy at 53.4% and pulls the least Reference at 3.5%.

The Corporate-first rule held for every engine in every project, and ChatGPT's UGC tilt was consistent across clients. So the floor is the same everywhere (own your category page), but the edges reward different moves per engine.

Finding 4: owned pages get cited hardest when they get pulled

Citation share tells you how often a source shows up. It does not tell you how efficiently a page converts attention into a citation. For that we use citation rate, which is citations per retrieval: when an engine pulls a page into its working set, how reliably does it actually cite it.

Source typeCitation rateRetrieval rateGap
You (owned)2.0380.775+1.263
Institutional0.7740.058+0.716
Competitor0.9460.315+0.631
Corporate0.7150.091+0.623
Other0.5980.037+0.562
Reference0.6760.127+0.549
Editorial0.7400.206+0.534
UGC0.9010.413+0.487

Owned pages lead at a 2.04 citation rate, far ahead of every other type. When an engine reaches a well-structured owned page, it cites it roughly twice per retrieval. Competitor pages (0.95) and UGC (0.90) come next. Corporate-at-large sits lower at 0.72, which makes sense: the corporate bucket includes a lot of pages that get crawled and ignored.

This shows up in real ranks, not just averages. Three of our five brands are a top-2 cited source in their own category. Toku ranks #1 of 1,425 domains in its space. LoudFace ranks #2 of 2,160. The anonymized B2B research client ranks #2 of 1,302. Owning a domain that AI trusts is not theoretical. It is happening for most of the brands we measured.

Finding 5: being cited is not the same as being retrieved

The last column of that table is the one most people miss. Every source type was cited more than it was retrieved, which means citation and retrieval are not the same signal.

UGC has the highest retrieval rate of any type (0.413), so engines pull community content into context constantly. But its citation rate (0.90) is lower than owned pages, so a lot of what gets retrieved never makes the answer. Owned pages flip that: a smaller retrieval rate (0.775) and a much higher citation rate (2.04). Engines reach for them less often, but when they do, the page earns the link.

The practical read: chasing retrieval (getting crawled, getting pulled) is a different job than chasing citation (getting quoted). A page can be retrieved all day and cited rarely. You want both, and the two levers are not the same page edits.

What this means for your AEO

Three moves come straight out of the data.

First, get into the Reddit conversation if you care about ChatGPT. ChatGPT drives 80.5% of Reddit's citations and cites the forum at 2.64 per retrieval. You cannot fake your way into that with a marketing account, but you can make sure your category's real Reddit threads are accurate, current, and mention you where it is honest to. For ChatGPT specifically, the community layer is part of the answer. Our guide on how to become a trusted LLM source goes deeper on building that off-site trust.

Second, structure owned pages to be quoted, not just crawled. Owned pages cite at 2.04 per retrieval, the highest of any type, but only if the engine can lift a clean, self-contained answer off the page. Short definitional blocks, direct claims, and clear headers do this. See how to structure content for AI extraction for the format that earns the quote.

Third, optimize per engine instead of for "AI" as one blob. Perplexity wants editorial-grade sourcing (13.6% editorial), ChatGPT wants community and reference signals, Google AI Overviews wants authoritative corporate pages (53.4%). A page tuned for one is not automatically tuned for the others. The full method is in our answer engine optimization guide.

Limitations

Read these before you quote the numbers.

This is a three-engine benchmark. ChatGPT, Perplexity, and Google AI Overviews are all that returned data this window. We have no measurement of Claude, Copilot, or Grok, so we make no claim about them at all.

The pooled totals are weighted by project size. Toku and one other anonymized client each contributed close to 6,800 conversations, while the smallest project contributed 1,665. Bigger projects move the aggregate more. That is why every finding above was also checked per-project, and the directional patterns (Corporate first, owned pages cite hardest, ChatGPT's Reddit habit) held in each.

And this is a single 30-day snapshot. Engine behavior shifts. A number true in May is a hypothesis in August until you measure again.

Run this for your brand

We measure exactly this for the clients we run AEO for, per engine, per source type, per competitor, every month. If you want to see where your domain ranks in your own category and which engines are ignoring you, book an AEO audit. You can also read how we got Toku to a top-cited B2B pipeline or see which agencies show up in ChatGPT and Perplexity citations.

Frequently Asked Questions

Key takeaways from this article on What AI Actually Cites: The B2B SaaS AI-Citatio….

What do AI engines cite most?

Company-owned corporate websites. Across 160,240 citations from five B2B SaaS brands, corporate sites took 50.7% of all citations, more than three times the next source type (user-generated content at 13.2%). Corporate was the top source type in every project measured, ranging from 41.6% to 66.9% per brand.

Does Reddit really matter for B2B?

Yes, more than any other single domain. Reddit was the most-cited domain in the entire benchmark at 11,237 citations (7.0% of all citations), beating the next domain by over 2x. The catch: 80.5% of Reddit's citations came from ChatGPT specifically, which cited it at 2.64 times per retrieval. Perplexity barely cited it.

Can my own site get cited?

It is the highest-converting source type when it gets pulled. Owned pages had a 2.04 citation rate (citations per retrieval), the best of all eight source types. Three of our five brands rank as a top-2 cited source in their own category: Toku at #1 of 1,425 domains, LoudFace at #2 of 2,160. Structure decides whether the page earns the quote.

Do ChatGPT and Perplexity cite differently?

Clearly. ChatGPT leans on user-generated content at 15.0% of its citations, mostly Reddit. Perplexity is the most editorial engine at 13.6% and the lightest on UGC at 8.4%. Google AI Overviews is the most corporate-heavy at 53.4%. Tuning a page for one engine does not automatically tune it for the others.

How is AI citation measured?

We sampled 23,545 conversations across five B2B SaaS projects in Peec AI over 30 days ending 2026-06-01, logging all 160,240 citations the engines surfaced. Three engines returned data (ChatGPT, Perplexity, Google AI Overviews). Each cited domain is classified into a source type, and we compute both citation share and citation rate per type.

What is citation rate versus citation share?

Citation share is how often a source appears as a percentage of all citations. Citation rate is citations per retrieval, how reliably an engine quotes a page once it pulls it into context. They diverge: UGC has the highest retrieval rate (0.413) but owned pages have the highest citation rate (2.04), so each lever needs different page work.

Ready to grow your business?

Let's discuss how we can help you achieve your goals.

Or explore our work

Webflow Enterprise Partner Badge