this post was submitted on 29 Jul 2025
27 points (70.8% liked)

Technology

74055 readers
3813 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

It’s interesting how almost every model released in 2025 has specifically targeting coding. That focus has clearly been paying off: these coding models are getting really good now.

I still contend that this sort of task is uniquely positioned to show off LLMs. The idea that they'll turn into agents that can do real-world tasks remains a fantasy. Despite how impressive this is, they're losing money and have no real path to profitability.

Look up Ed Zitron's newsletter and podcast for more info on why the industry is a bubble. I'm genuinely impressed with this specific example, but our economy is gonna suffer when the bubble bursts.

you are viewing a single comment's thread
view the rest of the comments
[–] some_guy@lemmy.sdf.org 4 points 2 weeks ago

A demo that was open to the public (as in, not stage managed) where people could have the "agents" perform complex tasks without failing on a regular basis. Large training models are notoriously bad at anything they haven't been trained to do. They're worlds away from being able to interpret a new situation and "figure it out."