AI Search Evaluation ⑦ Citation Overlap — How Much Do the Engines Reference the Same Domains

til/applied-sciences/information-retrieval/ai-search-metrics/07-citation-overlap

07-citation-overlap.mdupdated 2026-07-292457 words

ダブルクリックで英日反転

Applied Sciences · Engineering

Citation Overlap — How Much Do AI Engines Reference the Same Domains?

EN

Citation Overlap measures what fraction of domains are shared across multiple AI search engines. High-overlap domains are cited by every engine; low overlap means a fragmented market with little cross-engine agreement.

Formula

Pairwise (Jaccard): |Domains_A ∩ Domains_B| ÷ |Domains_A ∪ Domains_B|
Overall rate: unique domains cited by ≥2 engines ÷ all unique domains
6 engines → 15 pairwise combinations (6C2)

Worked Example

ChatGPT × Copilot: Jaccard 0.58 — high, due to shared Bing index
Claude × AI Overview: Jaccard 0.18 — very different citation spaces
Overall rate 0.30 → 30% are multi-engine staples; 70% are engine-specific

Business Angle

High-overlap domains (Wikipedia, official sites, industry associations) are the benchmark targets
Goal: place client content on high-overlap domains or raise the client's own domain into that group
Low overall overlap signals a fragmented market — no clear citation consensus

Role in the AI-Search Project

Applied across all 600 target queries
Used as an input to C2 Source Trustworthiness Bias and C3 Brand Visibility analyses
Also used for engine clustering — grouping engines that cite similar sources

→ Target domains cited by every engine; citation overlap quantifies exactly which those are.

Applied Sciences · Engineering

引用重複率 — AIエンジン間でどれだけ同じドメインを引用しているか

JP

Citation Overlap（引用重複率）は、複数のAI検索エンジン間で共通して引用されるドメインの割合を測る指標。重複率が高いドメインは「全エンジンから引用される勝者」、低ければ各エンジンの引用傾向が分散した市場を意味する。

計算式

ペア比較（Jaccard係数）: |A∩B| ÷ |A∪B|
全体重複率: 2つ以上のエンジンで引用されたユニークドメイン数 ÷ 全ユニークドメイン数
6エンジンの場合、ペア組み合わせは15通り（6C2）

具体例

ChatGPT × Copilot: Jaccard 0.58 — Bingインデックス共有で高い一致
Claude × AI Overview: Jaccard 0.18 — 参照空間が大きく異なる
全体重複率0.30 → 30%が複数エンジン共通の定番、70%はエンジン固有

ビジネス的意義

高重複ドメイン（Wikipedia・公式サイト・業界団体サイト）がベンチマーク目標
クライアント情報を高重複ドメインに掲載、または自社サイトをその群に引き上げることが戦略目標
全体重複率が低い市場は引用合意が薄く、特定ドメインへの集中投資効果が限定的

プロジェクト内での役割

対象クエリ600問すべてに適用
C2「情報源の信頼性バイアス」・C3「ブランド可視性」の派生分析へのインプット指標
引用傾向が近いエンジンのクラスタリング（グループ化）にも活用

→ 全エンジンから引用されるドメインを狙え——引用重複率がその対象を定量的に特定する。

Applied Sciences · Engineering

Citation Overlap — How Much Do AI Engines Reference the Same Domains?

EN

Citation Overlap measures what fraction of domains are shared across multiple AI search engines. High-overlap domains are cited by every engine; low overlap means a fragmented market with little cross-engine agreement.

Formula

Pairwise (Jaccard): |Domains_A ∩ Domains_B| ÷ |Domains_A ∪ Domains_B|
Overall rate: unique domains cited by ≥2 engines ÷ all unique domains
6 engines → 15 pairwise combinations (6C2)

Worked Example

ChatGPT × Copilot: Jaccard 0.58 — high, due to shared Bing index
Claude × AI Overview: Jaccard 0.18 — very different citation spaces
Overall rate 0.30 → 30% are multi-engine staples; 70% are engine-specific

Business Angle

High-overlap domains (Wikipedia, official sites, industry associations) are the benchmark targets
Goal: place client content on high-overlap domains or raise the client's own domain into that group
Low overall overlap signals a fragmented market — no clear citation consensus

Role in the AI-Search Project

Applied across all 600 target queries
Used as an input to C2 Source Trustworthiness Bias and C3 Brand Visibility analyses
Also used for engine clustering — grouping engines that cite similar sources

→ Target domains cited by every engine; citation overlap quantifies exactly which those are.

Applied Sciences · Engineering

引用重複率 — AIエンジン間でどれだけ同じドメインを引用しているか

JP

Citation Overlap（引用重複率）は、複数のAI検索エンジン間で共通して引用されるドメインの割合を測る指標。重複率が高いドメインは「全エンジンから引用される勝者」、低ければ各エンジンの引用傾向が分散した市場を意味する。

計算式

ペア比較（Jaccard係数）: |A∩B| ÷ |A∪B|
全体重複率: 2つ以上のエンジンで引用されたユニークドメイン数 ÷ 全ユニークドメイン数
6エンジンの場合、ペア組み合わせは15通り（6C2）

具体例

ChatGPT × Copilot: Jaccard 0.58 — Bingインデックス共有で高い一致
Claude × AI Overview: Jaccard 0.18 — 参照空間が大きく異なる
全体重複率0.30 → 30%が複数エンジン共通の定番、70%はエンジン固有

ビジネス的意義

高重複ドメイン（Wikipedia・公式サイト・業界団体サイト）がベンチマーク目標
クライアント情報を高重複ドメインに掲載、または自社サイトをその群に引き上げることが戦略目標
全体重複率が低い市場は引用合意が薄く、特定ドメインへの集中投資効果が限定的

プロジェクト内での役割

対象クエリ600問すべてに適用
C2「情報源の信頼性バイアス」・C3「ブランド可視性」の派生分析へのインプット指標
引用傾向が近いエンジンのクラスタリング（グループ化）にも活用

→ 全エンジンから引用されるドメインを狙え——引用重複率がその対象を定量的に特定する。

Related notes

148 notestil

Citation Overlap — How Much Do AI Engines Reference the Same Domains?

Formula

Worked Example

Business Angle

Role in the AI-Search Project

引用重複率 — AIエンジン間で​どれだけ​同じ​ドメインを​引用しているか

計算式

具体例

ビジネス的意義

プロジェクト内での​役割

Citation Overlap — How Much Do AI Engines Reference the Same Domains?

Formula

Worked Example

Business Angle

Role in the AI-Search Project

引用重複率 — AIエンジン間で​どれだけ​同じ​ドメインを​引用しているか

計算式

具体例

ビジネス的意義

プロジェクト内での​役割

Related notes

引用重複率 — AIエンジン間でどれだけ同じドメインを引用しているか

プロジェクト内での役割

引用重複率 — AIエンジン間でどれだけ同じドメインを引用しているか

プロジェクト内での役割