396
Authors Are Furious After Finding Their Works on List of Books Used To Train AI
(www.themarysue.com)
This is a most excellent place for technology news and articles.
Here's an idea, legally force companies like OpenAI to rely on opt-in data, rather then build their entire company on stealing massive amounts of data. That includes requiring to retrain from scratch. Sam Altman was crying for regulations for scary AI, right?
Would search engines only be allowed to show search results for sources that had opted in? They "train" their search engine on public data too, after all.
They aren't reselling their information, they're linking you to the source which then the website decides what to do with your traffic. Which they usually want your traffic, that's the point of a public site.
That's like trying to say it's bad to point to where a book store is so someone can buy from it. Whereas the LLM is stealing from that bookstore and selling it to you in a back alley.