# thejohndwilliams.com robots.txt # Photography is protected intellectual property. Search engines are welcome; # AI training crawlers are not. # --- AI training crawlers (blocked) ------------------------------------------ # Tier 1: publicly identified AI training bots User-agent: GPTBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Claude-Web Disallow: / User-agent: anthropic-ai Disallow: / User-agent: CCBot Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Perplexity-User Disallow: / User-agent: Google-Extended Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: Bytespider Disallow: / User-agent: Amazonbot Disallow: / User-agent: FacebookBot Disallow: / User-agent: Meta-ExternalAgent Disallow: / User-agent: Meta-ExternalFetcher Disallow: / User-agent: Diffbot Disallow: / User-agent: ImagesiftBot Disallow: / User-agent: cohere-ai Disallow: / User-agent: YouBot Disallow: / User-agent: omgili Disallow: / User-agent: OAI-SearchBot Disallow: / # --- Search engines (allowed) ------------------------------------------------- User-agent: * Allow: / Sitemap: https://thejohndwilliams.com/sitemap-index.xml