๐ŸฆŠStackFox
Webz.io logo

Webz.io Extended

Tier 4
๐Ÿ“š AI Trainingby Webz.io โ†—ยท Since 2015

Crawls content for Webz.io's data repository sold for AI training.

User-Agent Token
webzio-extended
Respects robots.txt
Yes
Impact Level
Niche
Smaller players and developer tools
Estimated Reach
Smaller players and developer tools

๐ŸŽฏWhat is Webz.io Extended?

Webz.io Extended is an AI training crawler operated by Webz.io. Collects data to train AI models.

๐Ÿ“Š How Your Data is Used

Webz.io sells structured web data to AI companies for training.

๐Ÿšซ What Happens If You Block

Content won't be included in datasets sold to AI companies.

๐Ÿ’ก Good to Know

Commercial data provider. Your content may end up training various AI models.

๐ŸขAbout Webz.io

Webz.io logo
Webz.io

Webz.io operates 3 known bots for AI model training.

๐Ÿ›ก๏ธwebzio-extended robots.txt Configuration

Control webzio-extended access to your website using robots.txt directives.

Block webzio-extended

To completely block Webz.io Extended from crawling your site:

User-agent: webzio-extended
Disallow: /

Allow webzio-extended Full Access

To explicitly allow Webz.io Extended to crawl your entire site:

User-agent: webzio-extended
Allow: /

Selective Access for webzio-extended

To allow Webz.io Extended but restrict certain directories:

User-agent: webzio-extended
Disallow: /private/
Disallow: /api/
Disallow: /admin/
Allow: /

โœ“ Webz.io Extended respects robots.txt directives.

webzio-extended User-Agent String

The user-agent token for Webz.io Extended is:

webzio-extended

Check Your Site's AI Policy

See if you're blocking or allowing Webz.io Extended and other AI crawlers.