llms.txt Directory
Websites that have implemented llms.txt to declare their AI permissions
📋What is llms.txt?
llms.txt is a proposed standard that allows website owners to declare permissions for AI systems.
Similar to robots.txt for search engines, llms.txt specifies whether AI models can use site content for training, RAG (retrieval-augmented generation), summarization, and more.
#43
digicert.com
Found at
/llms.txtUnknown
#83
adobe.com
Found at
/llms.txt✓ RAGPermissive
#107
opera.com
Found at
/llms.txtPermissive
#118
wordpress.com
Found at
/llms.txt·Google, Meta...✓ TrainingPermissive
#127
sentry.io
Found at
/llms.txtUnknown
#158
dropbox.com
Found at
/llms.txt·Google, Microsoft✓ RAGPermissive
#160
gravatar.com
Found at
/llms.txt·OpenAI, Anthropic...Permissive
#194
paypal.com
Found at
/llms.txt·GoogleUnknown
#196
kaspersky.com
Found at
/llms.txt✓ TrainingPermissive
#219
shopify.com
Found at
/llms.txtUnknown
#241
avast.com
Found at
/llms.txtPermissive
#265
salesforce.com
Found at
/llms.txt·Google, Microsoft...✓ Training✓ RAGPermissive
#274
sourceforge.net
Found at
/llms.txt·OpenAI, Anthropic...✗ Training✓ RAGSelective
#289
weather.com
Found at
/llms.txtPermissive
#335
stripe.com
Found at
/llms.txt✓ TrainingPermissive
#338
wyzecam.com
Found at
/llms.txt·Google, AmazonPermissive
#370
oxylabs.io
Found at
/llms.txt·OpenAI, Anthropic...✓ Training✓ RAGPermissive
#411
slack.com
Found at
/llms.txt·Google, AmazonPermissive
#466
nvidia.com
Found at
/llms.txtUnknown
#467
dailymotion.com
Found at
/llms.txt✓ TrainingPermissive
#499
plesk.com
Found at
/llms.txtPermissive
#512
dell.com
Found at
/llms.txt·OpenAI, MicrosoftUnknown
#520
wp.com
Found at
/llms.txt·Google, Meta...✓ TrainingPermissive
#546
calendly.com
Found at
/llms.txt·Google, Microsoft✓ TrainingPermissive
#658
paloaltonetworks.com
Found at
/llms.txt·GoogleUnknown
#684
sophos.com
Found at
/llms.txt·Microsoft✓ TrainingPermissive
#689
dynatrace.com
Found at
/llms.txt·Google, Microsoft...Unknown
#699
trendmicro.com
Found at
/llms.txt✓ TrainingPermissive
#711
playstation.com
Found at
/llms.txt✓ TrainingPermissive
#715
target.com
Found at
/llms.txtPermissive
#717
braze.com
Found at
/llms.txt·OpenAI, Anthropic...Unknown
#733
okta.com
Found at
/llms.txt·OpenAI, Google...✓ RAGPermissive
#736
dreamhost.com
Found at
/llms.txt·Google, MicrosoftUnknown
#741
intercom.io
Found at
/llms.txtUnknown
#770
onetrust.com
Found at
/llms.txtUnknown
#798
optimizely.com
Found at
/llms.txtUnknown
#879
mailchimp.com
Found at
/llms.txt·Google, Meta...✓ Training✓ RAGPermissive
#933
allaboutcookies.org
Found at
/llms.txt·Google, Meta...Selective
#957
prnewswire.com
Found at
/llms.txtPermissive
#990
cursor.sh
Found at
/llms.txt·AmazonPermissive
#1,000
datadoghq.com
Found at
/llms.txt·OpenAI, Google...Permissive
#1,021
box.com
Found at
/llms.txt·Google, Microsoft✓ TrainingPermissive
#1,044
singular.net
Found at
/llms.txt·OpenAI, Anthropic...Permissive
#1,070
clever.com
Found at
/llms.txt·OpenAI, Google...✓ TrainingPermissive
#1,074
qualtrics.com
Found at
/llms.txt·Google✓ TrainingPermissive
#1,175
kslawin.com
Found at
/llms.txtUnknown
#1,187
classlink.com
Found at
/llms.txt·Google, Microsoft✓ Training✓ RAGPermissive
#1,191
repubblica.it
Found at
/llms.txtUnknown
#1,194
typeform.com
Found at
/llms.txtUnknown
#1,197
netgear.com
Found at
/llms.txt✓ TrainingPermissive
Showing 1-50 of 200 sites
Check any website's AI policy
See their llms.txt, robots.txt rules for AI bots, and more.
Check AI Policy