> Couldn't this be handled in robots.txt? That would require knowing all the use...

AnthonyMouse · on March 18, 2024

How are you proposing to distinguish between "AI" and "search engines"? Most of the search engines now have a summarizer at the top which is presumably LLM output, and search engines operate on the basis of ML in general.

xigoi · on March 19, 2024

> Most of the search engines now have a summarizer at the top which is presumably LLM output, and search engines operate on the basis of ML in general.

There is still a difference between scraping content for the purpose of searching it and training on it.

AnthonyMouse · on March 20, 2024

A search engine is an AI model that outputs search results. Creating the index is training it. There is no obvious principled way to distinguish them.