this post was submitted on 03 Nov 2025
167 points (100.0% liked)
Technology
40635 readers
373 users here now
A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.
Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.
Subcommunities on Beehaw:
This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Imagine you have a magic box that can generate any video you want. Some people ask it to generate fan fiction-like videos, some ask it to generate meme-like videos, and a whole lot of people ask it to generate porn.
Then there's a few people that ask it to generate videos using trademarked and copyrighted stuff. It does what the user asks because there's no way for it to know what is and isn't copyrighted. What is and isn't parody or protected fair use.
It's just a magic box that generates videos... Whatever the human asks for.
This makes some people and companies very, very upset. They sue the maker of the magic box, saying it's copying their works. They start PR campaigns, painting the magic box in a bad light. They might even use the magic box quite a lot themselves but it doesn't matter. To them, the magic box is pure evil; indirectly preventing them from gaining more profit... Somehow. Just like Sony was sued for making a machine that let people copy whatever videos they wanted (https://en.wikipedia.org/wiki/Sony_Corp._of_America_v._Universal_City_Studios%2C_Inc.).
Before long, other companies make their own magic boxes and then, every day people get access to their own, personal magic boxes that no one can see the output from unless they share.
Why is this different from the Sony vs Universal situation? The AI magic box is actually worse at copying videos than a VCR.
When a person copies—and then distributes—a movie do we say the maker of the VCR/DVD burner/computer is at fault for allowing this to happen? No. It's the person that distributed the copyrighted work.
Yes, this is in fact a good argument for not banning AI.
It's not an argument for not holding companies legally accountable for using copyrighted material to do it.
These suits aren't actually equivalent to Sony v UCS, they're equivalent to someone suing a bootleg video company.
If you believe AI companies should NOT be allowed to train AI with copyrighted works you should stop using Internet search engines. Because the same rules that allow Google to train their search with everyone's copyrighted websites are what allow the AI companies to train their models.
Every day, Google and others download huge swaths of the Internet directly into their servers and nobody bats an eye. An AI company does the same thing and now people say that's copyright infringement.
What the fuck! I don't get it. It's the exact same thing. Why is an AI company doing that any different‽
It'd be one thing if people were bitching about just the output of AI models but they're not. They're bitching about the ingress step!
The day we ban ingress of copyrighted works into whatever TF people want is the day the Internet stops working.
My comment right here is copyrighted. So is yours! I didn't ask your permission before my Lemmy client downloaded it. I don't need to ask your permission to use your comment however TF I want until I distribute it. That's how the law works. That's how it's always worked.
The DMCA also protects the sites that host Lemmy instances from copyright lawsuits. Because without that, they'd be guilty of distribution of copyrighted works without the owner's permission every damned day.
People who hate AI are supporting an argument that the movie and music studios made in the 90s: That "downloading is theft." It is not! In fact, because that is not theft, we're all able to enjoy the Internet every day.
Ever since the Berne convention, literally everything is copyrighted. Everything.
This is false, by omission. Many of the AI companies have been downloading content through means other than scraping, such as bittorrent, to access and compile copyrighted data that is not publicly scrape-able. That includes Meta, OpenAI, and Google.
That is also false. Just because you don't understand the legal distinction between scraping content to summarize in order to direct people to a site (there was already a lawsuit against Google that established this, as well as its boundaries), versus scraping content to generate a replacement that obviates the original content, doesn't mean the law doesn't understand it.
And none of this matters, because AI companies aren't just reading content, they're taking it and using it for commercial purposes.
Perhaps you are unaware, but (at least in the US) while it is legal for you to view a video on YouTube, if you download it for offline use that would constitute copyright infringement if the owner objects. The video being public does not grant anyone and everyone the right to use it however they wish. Ditto for something like making an mp3 of a song on Spotify using Audacity.
First off, I do not hate AI, I use it myself (locally-run). My issue is with AI companies using it to generate profit at the expense of the actual creators whose art AI companies are trying to replace (i.e. not directing people to it, like search results).
Secondly, no one is arguing that it is theft, they are arguing that it is copyright infringement, which is what all of us are also subject to under the DMCA. So we're actually arguing that AI companies should be held to the same standard that we are.
Also, note that AI companies have argued in court (in the case brought by Steven King et al) that their use of copyrighted material shouldn't fall under DMCA at all (i.e. arguing that it's not about Fair Use), because their argument is that AI training is not the 'intended use' of the source material, so this is not eating into that commercial use. That argument leaves copyright infringement liability intact for the rest of us, while solely exempting them from liability. No thanks.
Luckily, them arguing they're apart and separate from Fair Use also means that this can be rejected without affecting Fair Use! Double-win!
Now imagine it's not a magic box. Imagine it is a computer program, written with intent, that was intentionally fed copyrighted material so it could make those things people asked for.
Giant companies operating outside the law at "the cost of doing business" built plagiarism machines off the life's work of thousands of people so that horny weirdos could jerk it to Pikachu with tits.
To be fair, a horny weirdo would still probably pay a human artist for a Pikachu with tits. Image generators probably wouldn't get the nuance exactly right. An image generator is better for like, Azula getting her toes sucked by a Dai Li agent or something. Probably. I'm only speculating
as a totally off-topic aside, I find that people tend to dislike AI in areas they are deeply familiar with.
Silence, AI defender
There's one difference you ignored: to copy a video with a VCR, the user needs to supply the copyrighted material. I'm sure the manufacturers would've been in more legal trouble if they shipped VCRs packed with pirated content.
You obviously never used a VCR to record a live broadcast before. When people were using a VCR to record things, that's what they were doing 99% of the time. Nobody had two VCRs hooked up to each other to copy tapes. That was a super rare situation that you'd typically only find in professional studios.
Mate, what?
It is the end user who receives the broadcast and feeds it into the VCR, is it not?
Your VCR is hooked up to your TV (coaxial into the VCR and from the VCR into the TV). Just before the broadcast starts, you press the record button (which was often mechanically linked to the play button). When it's done, you press stop. Then you rewind and can play it back later.
The end user is sort of passively recording it. The broadcast happens regardless of the user's or the VCR's presence.
Well derailed, succesfully avoided going anywhere near the actual argument.
VCRs do not contain such copyrighted material.
And you need a blank tape to copy to, the makers of which did have to pay a charge to content firms to cover piracy.
In this tortured analogy the blanks are also included in the ai firm product, so yes they would have had to pay
That "blank tape charge" was only implemented in Canada. Not the US.
This is easily top 5 dumbest takes ever about AI.
The difference between Gen AI and Sony v. Universal feels pretty substantial to me: VCRs did not require manufacturers to use any copyrighted material to develop and manufacture them. They only could potentially infringe copyright if the user captured a copyrighted signal and used it for commercial purposes.
If you read the title and the description of the article, it admittedly does make it sound like the studios are taking issue with copyrighted IPs being able to be generated. But the first paragraph of the body states that the problem is actually the usage of copyrighted works as training inputs:
You compare Gen AI to “magic boxes”… but they’re not magic. They have to get their “knowledge” from somewhere. These AI tools are using many patterns far more subtle and complex than humans can recognize, and they aren’t storing the training inputs using them— it’s just used to strengthen connections within the neural net (afaik, as I’m not an ML developer). I think that’s why it’s so unregulated: how to you prove they used your content? And even so, they aren’t storing or outputting it directly. Could it fall under fair use?
Still, using copyrighted information in the creation of an invention has historically been considered infringement (I may not be using the correct terminology in this comparison, since maybe it’s more relevant to patent law), even if it didn’t end up in the invention— in software, for example, reverse engineers can’t legally rely on leaked source code to guide their development.
Also, using a VCR for personal use wouldn’t be a problem, which I’d say was a prominent use-case. And using it commercially wouldn’t involve any copyrighted material, unless the owner inputs any. Those aren’t the case with Gen AI: regardless of what you generate, non-commercially or commercially, the neural network was built using a majority of unauthorized, copyrighted content.
That said, copyright law functions largely to protect corporations anyways— an individual infringing the copyright of a corporation for personal or non-commercial use causes very little harm, but can usually be challenged and stopped. A corporation infringing copyright of an individual often can’t be stopped. Most individuals can’t even afford the legal fees, anyways.
For that reason, I’m glad to see companies taking legal action against OpenAI and other megacorps which are (IMO) infringing the copyright of individuals and corporations at this kind of a massive scale. Individuals certainly can’t stop it, but corporations may be able to get some justice or encourage more to be done to safeguard the technology.
Much damage is already done, though. E-waste and energy usage from machine learning have skyrocketed. Websites struggle to fight crawlers and lock down their APIs, both harming legit users. Non-consensual AI pornography is widely accessible. Many apps encourage people, including youth, to forgo genuine connection, both platonic and romantic, in exchange for AI chatbots. Also LLMs are fantastic misinformation machines. And we have automated arts, arguably the most “human” thing we can do, and put many artists out of work in doing so.
Whether the lack of safety guards is because of government incompetence, corruption, or is inherent to free-market capitalism, I’m not sure. Probably all of those reasons.
In summary, I disagree with you. I think companies training AI with unauthorized material are at fault. And personally, I think the entire AI industry as it exists currently is unethical.