๐ŸฆŠStackFox
DeepSeek logo

DeepSeekBot

Tier 3
๐Ÿ“š AI Trainingby DeepSeek โ†—ยท Since 2024

DeepSeek language model training. Known to ignore robots.txt.

User-Agent Token
DeepSeekBot
Respects robots.txt
No
Impact Level
Notable
10M+ users - Regional/specialized AI companies
Estimated Reach
Growing, especially in Asia

๐ŸŽฏWhat is DeepSeekBot?

DeepSeekBot is an AI training crawler operated by DeepSeek. Collects data to train AI models.

๐Ÿ“Š How Your Data is Used

Pre-training for DeepSeek models (DeepSeek-V2, Coder, etc).

๐Ÿšซ What Happens If You Block

Robots.txt blocking may be IGNORED. Similar to Bytespider concerns.

๐Ÿ’ก Good to Know

CONTROVERSIAL: Chinese company, may ignore robots.txt. Known for impressive cost-efficient models.

๐ŸขAbout DeepSeek

DeepSeek logo
DeepSeek

DeepSeek operates 1 known bot for AI model training. Their service reaches Growing, especially in Asia.

๐Ÿ›ก๏ธDeepSeekBot robots.txt Configuration

Control DeepSeekBot access to your website using robots.txt directives.

Block DeepSeekBot

To completely block DeepSeekBot from crawling your site:

User-agent: DeepSeekBot
Disallow: /

Allow DeepSeekBot Full Access

To explicitly allow DeepSeekBot to crawl your entire site:

User-agent: DeepSeekBot
Allow: /

Selective Access for DeepSeekBot

To allow DeepSeekBot but restrict certain directories:

User-agent: DeepSeekBot
Disallow: /private/
Disallow: /api/
Disallow: /admin/
Allow: /

โš  DeepSeekBot may not fully respect robots.txt. Consider additional server-side controls if needed.

DeepSeekBot User-Agent String

The user-agent token for DeepSeekBot is:

DeepSeekBot

Check Your Site's AI Policy

See if you're blocking or allowing DeepSeekBot and other AI crawlers.