๐ŸฆŠStackFox
Google logo

Google-Extended

Tier 1
๐Ÿ“š AI Trainingby Google โ†—ยท Since 2023

Controls whether content is used for Gemini/Vertex AI training. Does NOT affect Google Search.

User-Agent Token
Google-Extended
Respects robots.txt
Yes
Impact Level
Critical
Billions of users - OpenAI, Anthropic, Google
Estimated Reach
350M+ Gemini users, 4B+ Google users

๐ŸŽฏWhat is Google-Extended?

Google-Extended is an AI training crawler operated by Google. Collects data to train AI models.

๐Ÿ“Š How Your Data is Used

Pre-training for Gemini models. Separate from Search indexing (Googlebot).

๐Ÿšซ What Happens If You Block

Content won't train Gemini models. Google Search ranking completely UNAFFECTED.

๐Ÿ’ก Good to Know

Safe to block without SEO impact. Only affects AI training, not search visibility.

๐ŸขAbout Google

Google logo
Google

Google operates 9 known bots for AI model training. Their service reaches 350M+ Gemini users, 4B+ Google users.

๐Ÿ›ก๏ธGoogle-Extended robots.txt Configuration

Control Google-Extended access to your website using robots.txt directives.

Block Google-Extended

To completely block Google-Extended from crawling your site:

User-agent: Google-Extended
Disallow: /

Allow Google-Extended Full Access

To explicitly allow Google-Extended to crawl your entire site:

User-agent: Google-Extended
Allow: /

Selective Access for Google-Extended

To allow Google-Extended but restrict certain directories:

User-agent: Google-Extended
Disallow: /private/
Disallow: /api/
Disallow: /admin/
Allow: /

โœ“ Google-Extended respects robots.txt directives.

Google-Extended User-Agent String

The user-agent token for Google-Extended is:

Google-Extended

Check Your Site's AI Policy

See if you're blocking or allowing Google-Extended and other AI crawlers.