Why robots.txt is critical
Your robots.txt file configuration directly impacts your store’s accessibility to AI crawlers. A misconfigured robots.txt can make your product pages harder to retrieve, cite, validate, or recommend.
Why it matters for AI
The robots.txt file is the first thing a crawler visits before exploring your site. If it contains a blocking rule, a respectful crawler stops immediately. The important part is role separation: search bots, user-action fetchers, training bots, and ads-validation bots do not have the same business impact.
AI Crawlers to Know
| Crawler | Operator | Usage |
|---|---|---|
GPTBot | OpenAI | ChatGPT training and general AI knowledge |
OAI-SearchBot | OpenAI | ChatGPT search and citation (no training data collected) |
OAI-AdsBot | OpenAI | ChatGPT ads landing-page validation and relevance |
ChatGPT-User | OpenAI | ChatGPT with real-time browsing |
PerplexityBot | Perplexity | Perplexity Search & Shopping |
ClaudeBot | Anthropic | Claude with browsing (traffic doubled Q3 2025 - Q1 2026, SE Ranking, 2026) |
Google-Extended | AI usage/training control, not Google Search crawling | |
Amazonbot | Amazon | Amazon search & Alexa |
Bytespider | ByteDance | TikTok AI features |
Why OAI-SearchBot matters: Unlike GPTBot (used for training), OAI-SearchBot is OpenAI’s retrieval crawler for search and shopping discovery. OAI-AdsBot is different again: it validates ad landing pages and should be treated as paid-media readiness, not organic GEO.
Most Common Mistakes
- Global disallow :
Disallow: /blocks everything for all crawlers - Specific AI bot blocking : Some SEO guides recommend blocking AI crawlers. Counterproductive if you want recommendations.
- Confusion :
Disallow: /(blocks everything) vsDisallow: /policies/(blocks only policies)
Recommended Configuration
User-agent: *
Disallow: /admin
Disallow: /cart
Disallow: /orders
Disallow: /checkouts/
Disallow: /account
# OpenAI crawlers (training + search/citation)
User-agent: GPTBot
Allow: /
User-agent: OAI-SearchBot
Allow: /
User-agent: ChatGPT-User
Allow: /
# Other AI crawlers
User-agent: PerplexityBot
Allow: /
User-agent: ClaudeBot
Allow: /
User-agent: Google-Extended
Allow: /
Sitemap: https://your-store.com/sitemap.xml
Related articles
- llms.txt for Shopify: Useful AI Discovery File, Not a Primary Ranking Signal
- Agent Readiness Score: the 11 /.well-known/ files to publish in 2026
- Understanding Your GEO Score: 9 Factors Explained
- GEO vs SEO: What’s the Difference for E-commerce?
- Schema.org Product: Why and How on Shopify
- Sell on ChatGPT: The Complete Shopify Guide for 2026
Ready to check your store? Run a free GEO audit →