Jaded

joined 2 years ago
[–] Jaded@lemmy.dbzer0.com -5 points 2 years ago (1 children)

To avoid being sued? The internet archive shouldn't be acting like a new age limewire. I hate record companies as much as the next guy but I use torrents and youtube-dl. No need for the internet archive to be offering the service at such risk.

They hold a lot of important stuff, I just don't want open season to be declared on suing them. Pick your battles kind of moment.

[–] Jaded@lemmy.dbzer0.com 21 points 2 years ago* (last edited 2 years ago) (2 children)

https://en.m.wikipedia.org/wiki/Julian_Assange

I highly suggest anyone not knowledgable on the subject to quickly read his wiki to get an idea of what he leaked.

We wouldn't know his name if the us had kept it's nose clean. He isn't the bad guy, the country drone striking and killing civilians while illegally spying on its citizens is. State secrets don't deserve to be kept secret if it's literally poison and corruption.

[–] Jaded@lemmy.dbzer0.com 1 points 2 years ago

Ignoring the fact that training an AI is insanely transformative and definitely fair use, people would not get any kind of pay. The data is owned by websites and corporations.

If AI training was to be highly restricted, Microsoft and google would just pay each other for the data and pay the few websites they don't own (stack, GitHub, Reddit, Shutterstock, etc), a bit of money would go to publishing houses and record companies, not enough for the actual artist to get anything over a few dollars.

And they would happily do it, since they would be the only players in the game and could easily overcharge for a product that is eventually going to replace 30% of our workforce.

Your emotional short sighted response kills all open source and literally gives our economy to Google and Microsoft. They become the sole owners of AI tech. Don't be stupid, please. They want you to be mad, it literally only helps them.

[–] Jaded@lemmy.dbzer0.com 2 points 2 years ago

Check out fusion 360. There is a free version for personal use, you have to search for it on their website since they hide it.

[–] Jaded@lemmy.dbzer0.com 11 points 2 years ago

The purchased service is internet. I should be able to use it how I want, including supplying it to other devices through my phone. This is the equivalent of Netflix not letting us cast onto tvs.

Not sure what you are defending here, this is clearly unethical and gross corporate behavior.

[–] Jaded@lemmy.dbzer0.com 1 points 2 years ago

There is no open source future if all we have is the blender and nothing else

[–] Jaded@lemmy.dbzer0.com 0 points 2 years ago

It depends for what kind of AI and but no, giving sources and building with just volunteer data is just not possible at our current technological level. I'm mostly talking about large llms because that's what's really at stake and they train on huge amounts of data. Like ALL of stack, GitHub, Reddit, etc. Just fine tuning them on a consumer level takes more than 50 000 question and answer pairs, that's just one tiny superficial layer that's added on top.

Grammerly should absolutely add an opt out option to gain consumers trust, but forcing the the whole industry to do so is a disaster.

If individuals can opt out, so will websites to "protect their users". Then we get data hoarding, where stack and GitHub opt out of all open source options but sell it to the only ones that can now afford to build ais, Microsoft and google. it won't include data of certain individuals, the few that opt out, but I'm guessing eventually the opt in will be directly into the terms of service of websites, you opt in or you fuck off.

How does anyone except corporations benefit from this kind of circus. In 10 years, AI will be doing most office work. Google isn't dumb and wants that profit. They and openai have all the data, they can strong arm or buy what they are missing. Restricting and legislating only widens their moat.

[–] Jaded@lemmy.dbzer0.com 2 points 2 years ago* (last edited 2 years ago) (2 children)

Most of the data is scraped, it's not up to the website. You can't give a list of citation since it isn't a search engine, it doesn't know where the information comes from and it's highly transformative, it melds information from hundreds if not thousand of different sources.

If it worked only with volunteer work, there would simply be not enough data.

Any law restricting data use in AI is only going to benefit corporations, there isn't a solution for individual content creators. You can't pay them for the drop in the bucket they add, thee logistics are insane. You can let them opt out, but then you need to do the same for whole websites which leads to a corporate hellscape where three companies own our whole economy since they are the only ones who can train ais.

[–] Jaded@lemmy.dbzer0.com 1 points 2 years ago (2 children)

What happens when every corporation and website closes their doors to AI? There isn't any open source if we can't use scrapped information from stack overflow, GitHub, Reddit etc.

Sure some users will opt out but most won't. Every single website will restrict though and then they will sell it to google and Microsoft who will be the only companies able to build ais.

[–] Jaded@lemmy.dbzer0.com -2 points 2 years ago* (last edited 2 years ago) (9 children)

Models need vast amounts of data. Paying individual users isnt feasible, and like you said most of it can be scraped.

The only way I see this working is if scraped content is a no go and then you pay the website, publishing house, record company, etc which kills any open source solution and doesn't really help any of the users or creators that much. It also paves the way for certain companies owning a lot of our economy as we move towards an AI driven society.

It's definitely a hot mess but the way I see it, the more restrictive we are with it, the more gross monopolies we create for no real gains.

[–] Jaded@lemmy.dbzer0.com 10 points 2 years ago (3 children)

It's because certain companies are stirring the pot and manipulating. They want people mad so they can put restrictions on training AI, to stifle the open source scene.

view more: ‹ prev next ›