this post was submitted on 17 Feb 2024
993 points (98.7% liked)

Technology

73939 readers
4732 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
(page 2) 50 comments
sorted by: hot top controversial new old
[–] lvxferre@mander.xyz 11 points 2 years ago* (last edited 2 years ago)

I am not sure on what I'm going to say, but I think that LLMs are a technological dead end. They might get some use now, but eventually the industry will shift towards better models for machine text generation. And, if those models rely on a tiny corpus of hand-reviewed data, instead of shoving down as much text as possible into the model (the first "L" in "LLM" is "large"), then Reddit posts/comments will become outright useless.

In other words: Reddit is degrading further the trust of its userbase, and it might not even get much in return.

[–] thawed_caveman@lemmy.world 10 points 2 years ago* (last edited 2 years ago) (5 children)

I feel like AI companies have been scraping Reddit for their datasets already since the beginning and without permission. In fact, unless there's been a regulation change that i'm not aware of, i'm not sure why they would have Reddit "sign away" the data when they can just scrape it.

Also dubious if the current form of AI has a future. They seem like they should revolutionize every sector when you look at their capacities, but in practice their applications might be more limited than we thought?

Anyway, if Reddit does go public i will be deleting my account within the hour. The only reason i haven't yet is that i've been a moderator of the same subreddit for eight years and it's the only thing that's been consistent in my life in that time, i'm kind of attached. The reason i will is i didn't sign up to create value for shareholders, i signed up to create value for a community.

[–] RunningInRVA@lemmy.world 7 points 2 years ago (1 children)

You need to go ahead and delete your account and give up the ghost on modding whatever sub you are referring to. I’m tired of these types of posts where you are both beholden to Reddit and also not. Pick a dang side.

[–] mounderfod@lemmy.sdf.org 5 points 2 years ago

Pick a dang side.

Bro it's not a war, it's social media 😭

load more comments (4 replies)
[–] ME5SENGER_24@lemmy.world 10 points 2 years ago

FUCK REDDIT! FUCK U/SPEZ! The Red-exit shall endure, VIVA LA LEMMY!!

[–] FrostyTrichs@lemmy.world 10 points 2 years ago (4 children)

Enjoy training on my -checks notes- DELETED POST HISTORY YOU FUCKING CLOWNS.

Stay ForeverFucked™ spez.

load more comments (4 replies)
[–] doingthestuff@lemmy.world 9 points 2 years ago (4 children)

Good thing I had multiple bots overwrite my content before I deleted it all. Not that someone couldn't recover it, I'm not naive. But the AI bots should miss me.

[–] JeeBaiChow@lemmy.world 5 points 2 years ago (1 children)

Frankly, if they're training bots on my comments, I'd be sure to poison the shit out of those comments. Say stuff like 'Donald trump won the election', 'bleach needs to be inside the body to work', 'Russia has rights to Ukraine', etc. Just make the data worthless. Any free bots do that?

load more comments (1 replies)
load more comments (3 replies)
[–] selokichtli@lemmy.ml 9 points 2 years ago

"Its content", sure.

[–] SinningStromgald@lemmy.world 9 points 2 years ago (1 children)

Good thing I'm not on that shitty platform anymore.

[–] Z3k3@lemmy.world 6 points 2 years ago

I deleted my shut when I left bur thinking about it. It's mostly drunken rambling and bad takes. Probably should have left it

[–] imposedsensation@lemmynsfw.com 7 points 2 years ago

Is this why the privacy policy was updated?

[–] HuddaBudda@kbin.social 6 points 2 years ago

Oh no! My outdated political takes and league of legends rants are going to be used to train AI!?

We're all doomed!

[–] Xanthrax@lemmy.world 6 points 2 years ago

It already happened without their consent. You've been able to get it to produce "reddit text posts", for years. This is a bit harrowing, though.

[–] General_Effort@lemmy.world 5 points 2 years ago (2 children)

They say it’s $60 million on an annualized basis. I wonder who’d pay that, given that you can probably scrape it for free.

Maybe it’s the AI act in the EU. That might cause trouble in that regard. The US is seeing a lot of rent-seeker PR, too, of course. That might cause some to hedge their bets.

Maybe some people had not realized that yet, but limiting fair use does not just benefit the traditional media corporations but also the likes of Reddit, Facebook, Apple, etc. Making “robots.txt” legally binding would only benefit the tech companies.

load more comments (2 replies)
load more comments
view more: ‹ prev next ›