DeepSeekBot
Tier 3DeepSeek language model training. Known to ignore robots.txt.
DeepSeekBot๐ฏWhat is DeepSeekBot?
DeepSeekBot is an AI training crawler operated by DeepSeek. Collects data to train AI models.
Pre-training for DeepSeek models (DeepSeek-V2, Coder, etc).
Robots.txt blocking may be IGNORED. Similar to Bytespider concerns.
CONTROVERSIAL: Chinese company, may ignore robots.txt. Known for impressive cost-efficient models.
๐ขAbout DeepSeek
DeepSeek operates 1 known bot for AI model training. Their service reaches Growing, especially in Asia.
๐ก๏ธDeepSeekBot robots.txt Configuration
Control DeepSeekBot access to your website using robots.txt directives.
Block DeepSeekBot
To completely block DeepSeekBot from crawling your site:
User-agent: DeepSeekBot
Disallow: /Allow DeepSeekBot Full Access
To explicitly allow DeepSeekBot to crawl your entire site:
User-agent: DeepSeekBot
Allow: /Selective Access for DeepSeekBot
To allow DeepSeekBot but restrict certain directories:
User-agent: DeepSeekBot
Disallow: /private/
Disallow: /api/
Disallow: /admin/
Allow: /โ DeepSeekBot may not fully respect robots.txt. Consider additional server-side controls if needed.
DeepSeekBot User-Agent String
The user-agent token for DeepSeekBot is:
DeepSeekBot๐Who Blocks DeepSeekBot?
Check Your Site's AI Policy
See if you're blocking or allowing DeepSeekBot and other AI crawlers.