Volume I · Q2 2026 · AI Visibility

Volume II field re-run in progress

The 2026 Houston AI Visibility Index

How often the assistants Houston actually uses — ChatGPT, Claude, Gemini, Grok, and Perplexity — name, cite, and rank the city's businesses. A 3,750-response cross-model study, and the first standardized measure of who exists inside the AI answer.

By Shayne Beavan

Founder, Deep AI Solutions · Inventor of record, 6 USPTO filings

May 30, 2026Updated June 30, 2026 16 min

When a Houston resident opens ChatGPT and asks who to call for a root canal, a flooded kitchen, or a car-accident claim, an answer comes back in seconds. It is short. It is confident. It names two or three businesses. There is no second page.

For twenty years, being found meant ranking on Google. That problem had a tail: page two still existed, position seven still got clicks. The AI answer has no tail. You are named, or you are not in the conversation. We built the Houston AI Visibility Index to measure, for the first time and at the level of a single composite score, who the machines are naming — and who they have quietly erased.

3,750: Machine responses
31: Median HAVI score
48%: Effectively invisible
94–27%: Citation spread

Author’s note

Every local-business owner I talk to in Houston is about to lose the front door of their business, and most of them do not know it yet. I filed six patent applications on the mechanics of this shift and then did the unglamorous thing: I ran the measurement. This is Volume I of a recurring index. The numbers are uncomfortable. They are also, for the operators willing to act, the clearest map of unclaimed ground I have seen since the early days of search.

— Shayne Beavan

Executive summary

We dispatched 750 buyer-intent prompts across 5 frontier assistants — ChatGPT, Claude, Gemini, Grok, and Perplexity — and analyzed all 3,750 machine-responses for three things: was a business named, was the mention carried by a citation, and were the stated facts true. We scored 150 anonymized business cohorts across 3 Houston sectors on a single 0–100 composite. Six findings stand out:

The median Houston business scores 31 out of 100. Visibility is the exception, not the baseline. 48% of the frame is effectively invisible — named in fewer than one in five relevant answers.
AI recommendation is winner-take-most. The top five cohorts in each sector capture the majority of all mentions; everyone else competes for the remainder.
The five assistants disagree sharply on trust. Citation rate runs from 94% (Perplexity) down to 27% (ChatGPT). The most-used assistant cites the least.
Hallucination is real and uneven. Models invent verifiable facts about real businesses between 1.9% (Claude) and 5.8% (Grok) of the time.
Structured data is the strongest available predictor of being named — schema.org completeness tracks mention rate at r = 0.71, ahead of Google Business Profile completeness and classic domain authority.
Machine authority is geographic. Inner-loop ZIP codes are recommended roughly 3.2× more often than equally dense suburban codes, leaving large authority dead zones unclaimed.

What the Index measures

HAVI is a composite, not a single metric, because being recommended by an AI is not a single behavior. A business has to be retrieved, trusted, cited, and described accurately. Each cohort receives a 0–100 score built from four weighted sub-indices.

What the score is made ofHAVI v1.0

40%

25%

20%

15%

Mention Rate · Share of relevant buyer-intent prompts in which the business is named at all.
Machine Trust · Entity completeness a model can verify: schema.org Organization markup, consistent name/address/phone across the web, and a connected sameAs graph.
Citation Capture · Share of mentions that arrive with a source link the user (and the model) can follow back to the business.
Hallucination Resistance · One minus the rate at which models fabricate verifiable facts — hours, location, services, ownership — about the business.

The HAVI composite weights being named (Mention Rate) above all else, then the verifiable trust signals that drive it.

The frame is Houston metro (Harris + Fort Bend counties): 50 established providers in each of 3 sectors, assembled from public directories and scored as anonymized cohorts. The prompt bank holds 250 buyer-intent questions per sector — the phrasing residents actually type, not keywords. Every prompt ran against every model during May 1–28, 2026.

Finding 1 — The Houston visibility gap

The metro-wide median composite is 31 out of 100. Read that plainly: a typical established Houston business is named in roughly a third of the AI answers where it could belong, often without a citation, and sometimes with a fabricated detail attached.

The distribution is steep. The top decile clears 72. Just below the leaders, the floor drops out: 48% of the frame scores under 20 — businesses that exist in public records, on Google, and on their own websites, and are almost entirely absent from the AI layer now sitting in front of all three.

The Houston visibility gapHAVI v1.0

31Median

72+Top decile

020406080100

Metro-wide composite on a 0–100 scale. The median business scores 31. The shaded band marks the 48% of the frame scoring below 20 — present everywhere except the AI answer.

Finding 2 — Winner-take-most

Search returned a page of ten options. The assistant returns a list of three. Across the frame, the top five cohorts in each sector capture the majority of every mention the sector receives.

Winner-take-most, by sectorHAVI v1.0

Sector	Median	Top-5 share	Invisible
DentalGeneral, cosmetic, and implant practices	38	61%	41%
LegalPersonal-injury and family-law firms	29	56%	49%
Home ServicesHVAC, plumbing, and roofing	24	64%	55%

Median composite, the share of all sector mentions captured by just the top five cohorts, and the share of each sector that is effectively invisible.

This is the structural break between search and answer. A tenth-place result on Google still drew traffic. A tenth-place business in an AI’s consideration set is usually not mentioned at all. The cost of being average moved from *less traffic* to *no presence*. Home services is the most concentrated and the least machine-present sector in the study, with a median of just 24; dental, where the earliest operators have started publishing structured identity, leads at 38.

Finding 3 — The five assistants do not agree

Treating “AI” as one thing is the most expensive assumption an operator can make. The assistants differ most on the dimension buyers use to trust an answer: whether it is cited.

The five assistants do not agreeHAVI v1.0

Perplexity

94%

cite

2.4% halluc

Gemini

68%

cite

3% halluc

Claude

41%

cite

1.9% halluc

Grok

33%

cite

5.8% halluc

ChatGPT

27%

cite

4.1% halluc

Citation rate is the share of mentions that carry a followable source link (teal, higher is better). Hallucination rate is the share of brand references containing a fabricated fact (amber, lower is better). The most-used assistant, ChatGPT, cites the least.

Perplexity cites 94% of the time and will omit a business rather than invent one — if your entity is not on a crawlable page, you are simply left out. ChatGPT, the most-used assistant in the study, cites only 27% of the time, which makes its recommendations the hardest for a buyer to verify and the hardest for a business to trace back to a source. Grok is the most willing to assert specifics it cannot support, fabricating a verifiable fact 5.8% of the time. Claude fabricates least, at 1.9%, but is also the quickest to decline to name anyone when its confidence runs low.

Finding 4 — Structured data predicts presence

We tested which signals move with mention rate across the frame. The strongest is not a classic SEO signal. It is whether a business publishes machine-readable identity: schema.org Organization markup, a consistent name/address/phone footprint, and a connected sameAs graph.

What predicts being namedHAVI v1.0

schema.org Organization completeness

0.71

Consistent NAP across the web

0.55

Review volume + recency

0.44

Google Business Profile completeness

0.38

Classic domain authority (backlinks)

0.21

Pearson correlation between each signal and a cohort's mention rate across the frame. Teal signals are machine-readable identity; slate signals are classic SEO. These are associations, not proof of cause.

These are associations, not proof of cause, and we report them as such. But the ordering is consistent and it is actionable: the signals a retrieval system can parse directly outrank the signals built for a human skimming a results page. The AI internet weights structured data differently than the search internet did. The businesses publishing it are not winning because they are larger. They are winning because they are legible.

Finding 5 — The geography of machine authority

Machine authority clusters, and the cluster is the inner loop. ZIP codes close to the urban core concentrate AI recommendations far beyond their share of businesses, while high-population suburban codes — full of qualified providers — go thinly represented.

The geography of machine authorityHAVI v1.0

Authority clusters

Recommended far above their share of businesses

77002
77006
77019
77027
77098
77005

Authority dead zones

High density, low AI presence — unclaimed

77449
77084
77083
77072
77036

Inner-loop ZIP codes draw roughly 3.2× the AI mentions of equally dense suburban codes. The dead zones are full of qualified providers and thinly represented in AI answers.

The dead zones are the opportunity. A roofer in 77449 or a dentist in 77072 competes against far fewer machine-legible rivals than one inside the loop. Authority there is not contested. It is unclaimed.

What this means for operators

Whether an AI assistant recommends you over a competitor in the next twelve months is, to a first approximation, a function of how completely your business is represented in machine-readable form today. That is the uncomfortable conclusion of this Index, and also the hopeful one: the gap is large, it is measurable, and it is mostly unclaimed. The businesses that close it first will hold the position the way early domain owners held theirs — not because they were biggest, but because they were there first and built to be read.

The mid-2026 update — what changed since May

*Added June 30, 2026. The figures in this section are third-party measurements, cited as such. Several of them independently corroborate what this Index found in Houston.*

When Volume I went out, the claim that AI had become a front door for local businesses was still, to many operators, a forecast. The national evidence has since caught up — and in several places it matches this Index's own results almost exactly.

Why this matters now — the mid-2026 pictureSOURCED · 2026

45%: of consumers now use AI to find local businesses — up from 6% a year ago
1.2%: of a category's local businesses are surfaced by ChatGPT in controlled tests
900M: weekly active users on ChatGPT; Google's AI Mode crossed 1B monthly
15→8%: organic click-through roughly halves when an AI summary appears

Sources: BrightLocal · PolyGrowth · Search Engine Land · Google · Pew Research Center

The national evidence has caught up to the thesis: AI is now a primary local-discovery surface, and it concentrates recommendations into a tiny set of providers.

Start with demand. The share of consumers using AI to find a local business jumped from 6% to 45% in a single year, moving AI past Yelp and TripAdvisor to the #3 discovery channel — and 88% of those users still fact-check the AI's sources, which makes being *cited and accurate* the entire contest. Now the supply side: in a controlled multi-engine experiment, ChatGPT surfaced just 1.2% of a category's local businesses, against 35.9% for Google's local 3-pack. That is winner-take-most, confirmed outside Houston — the same structure Finding 2 measured here.

Winner-take-most, confirmed nationallySOURCED · 2026

Google 3-pack

35.9%

Gemini

11%

Perplexity

7.4%

ChatGPT

1.2%

Source: PolyGrowth

In a controlled multi-engine experiment, ChatGPT surfaced just 1.2% of a category's local businesses — versus 35.9% for Google's local 3-pack. AI recommendation is far more concentrated than the ten-blue-links era it replaces.

The assistants still disagree on trust, and the national audit found the same ordering this Index did. An 8-engine study of 1,600 queries put citation-error rates between 37% (Perplexity, the most accurate) and 94% (Grok-3, the least) — mirroring the citation and fabrication spread in Finding 3. Meanwhile the open web is contracting underneath all of it: when an AI summary appears, organic click-through roughly halves and only 1% of users click a link inside the summary, even as Google's AI Mode crossed a billion monthly users.

Independent corroboration — citation accuracy by engineSOURCED · 2026

Perplexity

37% wrong ·best

Grok-3

94% wrong ·worst

Source: Columbia Journalism Review / Tow Center

An 8-engine audit of 1,600 queries found over 60% of responses carried incorrect citations — with the two named endpoints running from 37% wrong (Perplexity, best) to 94% (Grok-3, worst), the same trust ordering this Index measured for Houston businesses. Lower is better.

This is also where we update our own posture honestly. Volume I found schema.org completeness the strongest *available* predictor of being named in our frame (r = 0.71). The broader 2026 evidence is more mixed than that single number suggests: a 300,000-domain analysis found earned authority, quality backlinks, branded mentions, and complete Google Business Profile data to be the consistent drivers of LLM citations, while `llms.txt` showed no measurable effect and schema's direct citation lift is genuinely contested. We report it the way the data reads — the durable levers for a Houston operator are authority, reviews, GBP completeness, and machine-legible identity working together, not any one file or markup trick.

What actually drives AI citations — and what is hypeSOURCED · 2026

Google Business Profile completeness + review depthSupported
Correlated most strongly with AI citation in dentistry — the category closest to the Index's subjects. PolyGrowth
Earned authority, backlinks (DA 60+), brand & third-party mentionsSupported
Consistent drivers of LLM citations across a 300k-domain analysis; branded web mentions topped ChatGPT and AI Overviews. Search Engine Journal / SE Ranking
Citing sources, quotations, statistics in your own content (GEO)Supported
The strongest GEO levers in the peer-reviewed benchmark — up to a 40% visibility lift, though it varies by domain. Aggarwal et al. (KDD 2024)
schema.org structured dataMixed
Worth implementing for eligibility and disambiguation, but external evidence on a direct citation lift is genuinely contradictory. The Index's own frame found it predictive; at least one third-party study found no correlation. Search Engine Journal / SE Ranking
llms.txtOverhyped
Found on ~10% of domains, with no measurable relationship to AI-citation frequency across 300k domains. Google has confirmed it has no ranking effect. Search Engine Journal / SE Ranking

The external evidence, graded honestly. We down-weight the tactics it does not support — including treating schema and llms.txt as guaranteed citation multipliers.

None of this changes Volume I's measured numbers — it raises the stakes on them. A Volume II field re-run on the same five-model instrument is built and underway. What the mid-2026 evidence settles is the question operators kept asking in May: *is this real yet?* It is.

Be the answer when AI is asked. Or be the business that wasn’t.

Methodology and limitations

Frame. 150 established providers (50 per sector) across Houston metro (Harris + Fort Bend counties), drawn from public directories. Cohorts are anonymized; no score in this report is attributed to a named third-party business.

Instrument. Deep AI Solutions’ cross-model AI Visibility Scanner dispatched 250 buyer-intent prompts per sector (750 total) to the then-current flagship consumer model from each of the five providers during May 1–28, 2026, capturing 3,750 machine-responses. Each response was parsed for mention, rank, citation, sentiment, and factual accuracy against a verified record of each cohort.

Scoring. The HAVI composite weights Mention Rate (40%), Machine Trust (25%), Citation Capture (20%), and Hallucination Resistance (15%).

Limitations. Findings describe this frame, this prompt bank, and this measurement window. Correlations are associations, not causal claims. Model behavior changes between versions — consumer defaults shifted even during the window we measured. This is Volume I of a recurring Index; method and frame will be versioned as it repeats. Figures in Findings 1–5 are Deep AI Solutions’ own measured results, not independent third-party audits — the *mid-2026 update* section above is separately sourced to named third parties and labeled as such. A Volume II field re-run on the same instrument is built and in progress; this June refresh updates the external landscape, not the Index’s primary measurements.