As AI systems increasingly mediate product discovery, brand evaluation, and purchasing decisions, a critical gap has emerged in marketing measurement: no standard metric captures whether a brand is visible to AI. Traditional search metrics — keyword rankings, domain authority, click-through rates — measure visibility to humans navigating search engine results pages. They do not measure whether an AI system knows a brand exists, can accurately describe it, or will recommend it when a user asks for help.
The Ghost Score is a standardized metric (0–100) that quantifies a brand's visibility across the four major AI platforms consumers use for product discovery: ChatGPT, Claude, Gemini, and Perplexity. This document defines the methodology, explains the measurement framework, establishes scoring bands with business-context definitions, and provides preliminary industry benchmarks.
Our goal is not to replace existing SEO and brand measurement tools. It is to define the metric that captures the thing they cannot: whether your brand exists in the AI layer that sits between your customer and your product.
For two decades, digital brand visibility has been synonymous with search engine visibility. Brands invested in SEO to appear on the first page of Google results. The measurement infrastructure — keyword rank tracking, domain authority scores, SERP analysis — was built entirely around this paradigm.
That paradigm is fracturing. Gartner projects a 50% decline in traditional organic search traffic by 2028. Capgemini research indicates that 58% of consumers have already replaced traditional search with generative AI tools for product research and recommendations. Google itself has acknowledged this shift by deploying AI Overviews, which synthesize answers directly rather than presenting a list of links.
When a consumer asks ChatGPT “What's the best project management tool for a 20-person startup?” the AI does not return ten blue links. It returns a synthesized recommendation — typically naming two to four products, explaining their strengths, and sometimes making a direct suggestion. The brand that appears in that answer captures the consumer's attention. The brand that doesn't appear doesn't just rank lower — it functionally doesn't exist.
This is a qualitatively different visibility problem than traditional search. A brand on page two of Google results is disadvantaged. A brand absent from an AI's synthesized answer is invisible.
Existing measurement tools were designed for a link-based discovery model. This framework breaks down in AI-mediated discovery for several reasons:
The business consequence of AI invisibility is straightforward: if an AI system mediates a purchasing decision and your brand is not mentioned, you have zero probability of being selected through that channel. As AI-mediated discovery grows, the cost of invisibility compounds.
Early evidence suggests this effect is already material. Brands that appear consistently in AI recommendations report increased direct traffic and higher conversion rates from AI-referred visitors compared to traditional search visitors — likely because AI recommendations carry an implicit endorsement that a ranked search result does not.
The Ghost Score exists to make this risk measurable, trackable, and actionable.
AI Brand Visibility is the degree to which a brand is recognized, accurately described, and recommended by AI systems when users ask questions relevant to that brand's category, use cases, or competitive set.
This definition has three components, each of which the Ghost Score captures:
AI brand visibility is distinct from several adjacent concepts:
AI language models are fundamentally nondeterministic. The same query submitted to the same model twice can produce different responses. Any credible measurement methodology must account for this variance rather than ignoring it.
Given nondeterministic outputs, the only stable measurement approach is frequency across a sufficient sample of queries. The Ghost Score asks: across N relevant queries, what proportion of responses mention the brand?
This approach works because while individual responses are noisy, the aggregate signal is stable. A brand that appears in 70% of relevant queries today will appear in approximately 65–75% tomorrow, even as individual responses vary. The frequency metric is robust to the nondeterminism that makes positional ranking meaningless.
The Ghost Score methodology uses official APIs exclusively:
The Ghost Score methodology uses a minimum of 10 queries per platform, per measurement cycle.
Queries are drawn from three intent types:
The Ghost Score is calculated as:
A “mention” is the brand name appearing in the AI's response in a context that correctly identifies the brand and its category:
In the current methodology (v1.0), all four platforms are weighted equally at 25% each. Equal weighting was chosen because reliable market share data for AI-assisted product discovery is not yet available. Future versions may introduce market-share-based weighting as usage data matures.
What it means: The brand is invisible or nearly invisible to AI systems. Across 40 queries, the brand appears in fewer than 14 responses.
Business implication: The brand is not participating in AI-mediated discovery. When potential customers ask AI systems for recommendations, competitors are named and the brand is not. This represents a complete loss of the AI discovery channel.
Typical profiles: Early-stage startups (pre-Series A), local businesses, companies in highly specialized B2B niches, brands that have relied primarily on paid acquisition with minimal organic content footprint.
What it means: The brand has partial visibility. AI systems know it exists but mention it inconsistently — reliably on some platforms but not others, or for some query types but not others.
Business implication: The brand participates in AI-mediated discovery intermittently. Competitors with “Known” scores are consistently chosen in head-to-head comparisons. The brand has a foundation to build on but is at risk of fading further as competitors invest.
Typical profiles: Mid-market SaaS companies, established SMBs with some press coverage, brands in competitive categories where AI systems rotate between many options.
What it means: The brand is reliably visible across AI systems. Across 40 queries, the brand appears in 27 or more responses. AI systems know the brand, describe it accurately, and recommend it with consistency.
Business implication: The brand is an active participant in AI-mediated discovery. This represents a functioning AI discovery channel that generates awareness without paid advertising.
Typical profiles: Category leaders, well-funded companies with significant PR and content operations, open-source projects with strong community presence.
Based on preliminary measurement across multiple B2B SaaS categories:
The distribution is heavily left-skewed. Most brands are more invisible to AI than they expect.
The four platforms fall into two architectural categories:
Perform real-time web searches to ground responses. Visibility is primarily a function of current web footprint — dynamic, responsive to content and SEO efforts.
Rely primarily on knowledge learned during training. Visibility is a function of historical web footprint at training cutoff. 6–18 month lag between web presence improvements and parametric visibility.
Measuring only one platform gives an incomplete picture. The four-platform measurement provides:
The following benchmarks are based on preliminary measurement across B2B and B2C categories. They represent estimated typical Ghost Scores, not guarantees, and will be updated as the measurement dataset grows.
| Company Profile | Est. Ghost Score | Notes |
|---|---|---|
| Pre-seed / bootstrapped, < 1 year | 0–5 | Effectively invisible. No training data on the brand. |
| Seed-stage, 1–2 years | 5–20 | Occasional mentions, usually only on Perplexity if recent web coverage exists. |
| Series A, moderate PR coverage | 15–35 | Emerging visibility; may appear on search-grounded platforms but rarely on parametric. |
| Series B+, established player | 30–55 | Inconsistent visibility; appears for broad queries but gaps on specific use cases. |
| Category leader, strong content engine | 60–85 | Reliably visible; appears in most relevant queries across most platforms. |
| Dominant market leader (e.g., Salesforce in CRM) | 80–95+ | Near-universal visibility; AI systems default to naming this brand. |
| Enterprise-only, minimal public content | 10–30 | Low visibility despite revenue scale; enterprise sales motions don't generate public web presence. |
| Open-source project, strong community | 40–75 | Often higher than revenue would suggest, due to community-generated content and GitHub presence. |
| Category Density | Leader Score | Median Score | Tail Score |
|---|---|---|---|
| Oligopoly (2–4 well-known brands) | 85–95 | 50–65 | 10–25 |
| Moderately competitive (5–15 brands) | 75–90 | 25–40 | 0–10 |
| Fragmented (15+ competitors) | 60–80 | 15–30 | 0–5 |
| Pattern | Perplexity/Gemini | ChatGPT/Claude | Interpretation |
|---|---|---|---|
| Rising brand | High (50–80) | Low (10–30) | Recent content gains haven't entered parametric training data. Brand on upward trajectory. |
| Legacy brand | Low (20–40) | High (50–70) | Strong historical presence but declining current relevance. Warning sign of fading relevance. |
| Balanced brand | Similar across all | Similar across all | Consistent long-term presence. Healthiest profile. |
| Perplexity-only brand | High (60+) | Low (< 20) | Visibility driven by recent web content. Not yet in training data. Fragile. |
The Ghost Score methodology is published openly. We encourage researchers, analysts, journalists, and marketers to:
Commercial use of the Ghost Score name in competing products requires written permission.
This is a living document. As measurement practices mature and empirical data accumulates, the methodology will be updated. We welcome feedback, critique, and collaboration from researchers, practitioners, and platform providers.
Contact: methodology@ghostedbyai.co