🦊StackFox

llms.txt Directory

Websites that have implemented llms.txt to declare their AI permissions

📋What is llms.txt?

llms.txt is a proposed standard that allows website owners to declare permissions for AI systems.

Similar to robots.txt for search engines, llms.txt specifies whether AI models can use site content for training, RAG (retrieval-augmented generation), summarization, and more.

#3,449
mongodb.com
Found at /llms.txt
Unknown
#3,451
visualwebsiteoptimizer.com
Found at /llms.txt·OpenAI, Google...
Permissive
#3,468
trip.com
Found at /llms.txt
✓ TrainingPermissive
#3,485
iubenda.com
Found at /llms.txt·Google
Permissive
#3,523
bigcommerce.com
Found at /llms.txt
✓ TrainingPermissive
#3,526
x-cdn-static.com
Found at /llms.txt·Google, Microsoft...
Permissive
#3,534
tenki.jp
Found at /llms.txt·OpenAI, Anthropic...
Permissive
#3,539
ringcentral.com
Found at /llms.txt·Microsoft
Unknown
#3,540
nykaa.com
Found at /llms.txt·Amazon
Permissive
#3,548
prezi.com
Found at /llms.txt·Google, Microsoft
✓ TrainingPermissive
#3,549
mountain.com
Found at /llms.txt·Google
Permissive
#3,588
cafe24.com
Found at /llms.txt·Google, Meta
✓ TrainingPermissive
#3,610
okx.com
Found at /llms.txt
Unknown
#3,611
slashdot.org
Found at /llms.txt
Unknown
#3,617
woocommerce.com
Found at /llms.txt
Permissive
#3,670
pubnative.net
Found at /llms.txt
Permissive
#3,672
c6gj-static.net
Found at /llms.txt·Google, Microsoft...
Permissive
#3,695
kochava.com
Found at /llms.txt·OpenAI, Anthropic...
✓ TrainingPermissive
#3,739
auth0.com
Found at /llms.txt·Anthropic, Google...
✓ RAGPermissive
#3,949
allyononewgames.com
Found at /llms.txt
Unknown
#4,032
inmotionhosting.com
Found at /llms.txt
Unknown
#4,036
rudderstack.com
Found at /llms.txt
Unknown
#4,042
yg5sjx5kzy.com
Found at /llms.txt·Google, Microsoft...
Permissive
#4,055
made-in-china.com
Found at /llms.txt·Google
✓ TrainingPermissive
#4,101
webflow.com
Found at /llms.txt·Meta
✓ RAGPermissive
#4,159
ably.io
Found at /llms.txt·OpenAI, Anthropic...
Permissive
#4,262
nsw.gov.au
Found at /llms.txt
✓ TrainingPermissive
#4,295
weatherbug.net
Found at /llms.txt
Permissive
#4,459
kpmg.com
Found at /llms.txt·Microsoft
Permissive
#4,488
square.site
Found at /llms.txt·Google
Unknown
#4,523
postman.com
Found at /llms.txt·OpenAI, Google...
✓ TrainingPermissive
#4,529
ex.co
Found at /llms.txt·OpenAI, Anthropic...
Permissive
#4,582
threatlocker.com
Found at /llms.txt
✓ TrainingPermissive
#4,628
hootsuite.com
Found at /llms.txt
Unknown
#4,638
domainmarket.com
Found at /llms.txt
✓ TrainingPermissive
#4,713
pushwoosh.com
Found at /llms.txt
Permissive
#4,730
zoominfo.com
Found at /llms.txt·Microsoft
Selective
#4,760
guard.io
Found at /llms.txt·OpenAI, Google...
Permissive
#4,768
hola.org
Found at /llms.txt·Google, Microsoft...
Permissive
#4,789
redis.io
Found at /llms.txt
Permissive
#4,825
intercomcdn.com
Found at /llms.txt
Unknown
#4,852
supabase.co
Found at /llms.txt
Unknown
#4,901
smartsheet.com
Found at /llms.txt
Unknown
#4,910
clickup.com
Found at /llms.txt
✓ TrainingPermissive
#4,951
cyberark.com
Found at /llms.txt
Unknown
#4,958
traveloka.com
Found at /llms.txt
Unknown
#4,969
laracasts.com
Found at /llms.txt
Selective
#4,990
domaintools.com
Found at /llms.txt·Google, Meta...
✗ Training✗ RAGSelective
#4,996
zscaler.com
Found at /llms.txt·OpenAI, Google...
Permissive
#5,001
dynadot.com
Found at /llms.txt
Permissive

Showing 101-150 of 200 sites

Check any website's AI policy

See their llms.txt, robots.txt rules for AI bots, and more.

Check AI Policy