Yes and no. There is a file called, "robots.txt", which sets crawling limits for the site. Nothing stops crawlers from crawling past (unless account restrictions exist), but it also sets a legal standard. Many sites' contents are crawled or indexed because of this defacto standard.
That said, copyright, which is the actual claim here, is pretty cut and dry. The AI is digesting the copyrighted contents to form at least part of its language model. This legally means the language model is a derivative work, which means the AI is in violation of copyright laws.
Yes and no. There is a file called, "robots.txt", which sets crawling limits for the site. Nothing stops crawlers from crawling past (unless account restrictions exist), but it also sets a legal standard. Many sites' contents are crawled or indexed because of this defacto standard.
That said, copyright, which is the actual claim here, is pretty cut and dry. The AI is digesting the copyrighted contents to form at least part of its language model. This legally means the language model is a derivative work, which means the AI is in violation of copyright laws.
(post is archived)