this post was submitted on 05 Jul 2023
147 points (96.2% liked)
Technology
73758 readers
3932 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
You realize that "conversation" is fake, right? There is no increased load on Twitter, Reddit, or other web services due to "AI data scraping". That was made up to distract from the material causes of Twitter's failure, namely:
Big tech companies that already run search engines already have a copy of all public Web pages, which they use for search engine indexing. They don't need to make a second copy for AI training; they can just use the same one.
Google can train Bard with the same copy of the public Web that they use to create Google Search; same with Microsoft, Baidu, or any other big company that runs a search engine.
And for everyone else, there's Common Crawl.
“Fake” from the side of data load, sure, I can see that, but there’s plenty of interest in trying to stave off the “dead internet” by incorporating new systems where bots and AI generated content aren’t profitable. That’s more what I was referring to.