Why robots.txt is critical
Your robots.txt file configuration directly impacts your storeβs accessibility to AI crawlers. A misconfigured robots.txt can make your store completely invisible to AI agents.
Why it matters for AI
The robots.txt file is the first thing a crawler visits before exploring your site. If it contains a blocking rule, a respectful crawler (like GPTBot or ClaudeBot) stops immediately. Your store is never read, never indexed, never recommended.
AI Crawlers to Know
| Crawler | Operator | Usage |
|---|---|---|
GPTBot | OpenAI | ChatGPT training and general AI knowledge |
OAI-SearchBot | OpenAI | ChatGPT search and citation (no training data collected) |
ChatGPT-User | OpenAI | ChatGPT with real-time browsing |
PerplexityBot | Perplexity | Perplexity Search & Shopping |
ClaudeBot | Anthropic | Claude with browsing (traffic doubled Q3 2025βQ1 2026) |
Google-Extended | Gemini, AI Overview | |
Amazonbot | Amazon | Amazon search & Alexa |
Bytespider | ByteDance | TikTok AI features |
Why OAI-SearchBot matters: Unlike GPTBot (used for training), OAI-SearchBot is OpenAIβs real-time retrieval crawler that powers ChatGPTβs search and shopping citation features. Allowing it ensures your product pages can appear directly cited in ChatGPT Shopping results. OpenAI updated its crawler documentation in December 2025 to clarify the distinct roles of each bot (source: OpenAI Developer Docs, December 2025).
Most Common Mistakes
- Global disallow :
Disallow: /blocks everything for all crawlers - Specific AI bot blocking : Some SEO guides recommend blocking AI crawlers. Counterproductive if you want recommendations.
- Confusion :
Disallow: /(blocks everything) vsDisallow: /policies/(blocks only policies)
Recommended Configuration
User-agent: *
Disallow: /admin
Disallow: /cart
Disallow: /orders
Disallow: /checkouts/
Disallow: /account
# OpenAI crawlers (training + search/citation)
User-agent: GPTBot
Allow: /
User-agent: OAI-SearchBot
Allow: /
User-agent: ChatGPT-User
Allow: /
# Other AI crawlers
User-agent: PerplexityBot
Allow: /
User-agent: ClaudeBot
Allow: /
User-agent: Google-Extended
Allow: /
Sitemap: https://your-store.com/sitemap.xml
Related articles
- llms.txt: The New robots.txt for LLMs
- Understanding Your GEO Score: 9 Factors Explained
- GEO vs SEO: Whatβs the Difference for E-commerce?
- Schema.org Product: Why and How on Shopify
- Sell on ChatGPT: The Complete Shopify Guide for 2026
Ready to check your store? Run a free GEO audit β