๐ŸฆŠStackFox
๐Ÿค–

Spider

Tier 4
๐Ÿ“š AI Trainingby Spider Cloud โ†—

Converts web data into formats optimized for AI and RAG systems.

User-Agent Token
Spider
Respects robots.txt
Yes
Impact Level
Niche
Smaller players and developer tools
Estimated Reach
Smaller players and developer tools

๐ŸŽฏWhat is Spider?

Spider is an AI training crawler operated by Spider Cloud. Collects data to train AI models.

๐Ÿ“Š How Your Data is Used

Transforms web pages into LLM-ready formats.

๐Ÿšซ What Happens If You Block

Spider can't convert your content for AI consumption.

๐ŸขAbout Spider Cloud

Spider Cloud

Spider Cloud operates 1 known bot for AI model training.

๐Ÿ›ก๏ธSpider robots.txt Configuration

Control Spider access to your website using robots.txt directives.

Block Spider

To completely block Spider from crawling your site:

User-agent: Spider
Disallow: /

Allow Spider Full Access

To explicitly allow Spider to crawl your entire site:

User-agent: Spider
Allow: /

Selective Access for Spider

To allow Spider but restrict certain directories:

User-agent: Spider
Disallow: /private/
Disallow: /api/
Disallow: /admin/
Allow: /

โœ“ Spider respects robots.txt directives.

Spider User-Agent String

The user-agent token for Spider is:

Spider

Check Your Site's AI Policy

See if you're blocking or allowing Spider and other AI crawlers.