Score one page on the structural foundation AI engines need before they can cite it.
Five minutes. One number. One recommended next fix specific enough to action this afternoon.
You are about to score one page on your store against the five structural-foundation signals AI engines read before they can cite the page at all: sitemap inclusion, crawler permissions, schema validity, internal links, and content originality.
This tool checks the foundation, which is upstream of content optimization. If your product pages also need content-layer work (richer descriptions, comparison tables, FAQ schema, entity richness), run the Product Page Audit Scorecard on that page after the foundation passes.
A note before you start: this is a self-administered scorecard. It cannot crawl your site, so every answer is your honest read of what is on the page. If you are unsure, click the inline help on the question, then answer.
Before we score, one quick check.
Two of the four big AI engines (Gemini and Perplexity) rely on Google's search index to find pages. If Google has not indexed this page, those two engines cannot cite it regardless of how strong the structural work below is. So before we score, confirm Google has actually indexed the page.
How to check in Google Search Console:
- Open Google Search Console and select your property.
- Paste the page URL into the top search bar (the URL Inspection tool).
- Look at the result. If you see "URL is on Google", the page is indexed. If you see "URL is not on Google" or "Crawled, currently not indexed", it is not.
- Once you know the answer, come back and choose Yes or No above.
Is the page in your sitemap, and does Google see it?
sitemap.xml?Are the four AI crawlers explicitly allowed?
robots.txt file explicitly allow all four AI crawlers: GPTBot (ChatGPT), ClaudeBot (Claude), PerplexityBot (Perplexity), and Google-Extended (Gemini)?A correctly-configured robots.txt block for the four AI crawlers looks like this:
User-agent: GPTBot Allow: / User-agent: ClaudeBot Allow: / User-agent: PerplexityBot Allow: / User-agent: Google-Extended Allow: /
Does the page have valid, page-type-appropriate schema?
Open search.google.com/test/rich-results, paste the page URL, and read the result. Note: this dimension checks whether schema is present, correct-type, and valid. It does not check whether the content the schema describes is rich enough. For product-page content-layer work, run the Product Page Audit Scorecard after this tool's foundation pass.
Is the page well-connected from other indexed pages on your site?
Search site:yourdomain.com/the-linking-url in Google. If the page appears in the results, it is indexed. If it does not appear, it is not indexed yet.
Is the body copy unmistakably yours?
Two final signals AI engines weight heavily.
Q10 is a hidden-but-critical signal AI engines read; Q11 is the freshness anchor for the diagnostic clock.
Right-click the page in your browser, then choose View Page Source. Search the raw HTML for a sentence from your body copy. If the sentence appears in the source, the content is in raw HTML. If it does not appear, it is only in the rendered DOM (JavaScript-injected) and AI crawlers may not see it.
Your structural-foundation score
Answer all questions to see your tier and recommended next fix.
Dimension breakdown
What to do next
Run the recommended next fix above this week, then re-score the page next Monday. Most pages move from a clear top-leak tier to Healthy in two to three weeks once the top-leak is addressed.

