Leipzig Corpora Collection
Tier 4Academic linguistic corpus builder from University of Leipzig.
LCC๐ฏWhat is Leipzig Corpora Collection?
Leipzig Corpora Collection is an AI training crawler operated by University of Leipzig. Collects data to train AI models.
Academic linguistic research and NLP model training.
Your content won't be included in linguistic research corpora.
Non-profit academic research. Data used for linguistic studies.
๐ขAbout University of Leipzig
University of Leipzig operates 1 known bot for AI model training.
๐ก๏ธLCC robots.txt Configuration
Control LCC access to your website using robots.txt directives.
Block LCC
To completely block Leipzig Corpora Collection from crawling your site:
User-agent: LCC
Disallow: /Allow LCC Full Access
To explicitly allow Leipzig Corpora Collection to crawl your entire site:
User-agent: LCC
Allow: /Selective Access for LCC
To allow Leipzig Corpora Collection but restrict certain directories:
User-agent: LCC
Disallow: /private/
Disallow: /api/
Disallow: /admin/
Allow: /โ Leipzig Corpora Collection respects robots.txt directives.
LCC User-Agent String
The user-agent token for Leipzig Corpora Collection is:
LCC๐Who Blocks Leipzig Corpora Collection?
Check Your Site's AI Policy
See if you're blocking or allowing Leipzig Corpora Collection and other AI crawlers.