this post was submitted on 29 Jul 2025
27 points (70.8% liked)

Technology

73534 readers
2433 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

It’s interesting how almost every model released in 2025 has specifically targeting coding. That focus has clearly been paying off: these coding models are getting really good now.

I still contend that this sort of task is uniquely positioned to show off LLMs. The idea that they'll turn into agents that can do real-world tasks remains a fantasy. Despite how impressive this is, they're losing money and have no real path to profitability.

Look up Ed Zitron's newsletter and podcast for more info on why the industry is a bubble. I'm genuinely impressed with this specific example, but our economy is gonna suffer when the bubble bursts.

top 6 comments
sorted by: hot top controversial new old
[–] Gemini24601@lemmy.world 11 points 4 days ago (1 children)

So can my 12 year old ThinkPad. You do not need brand new hardware to run ai models, average hardware can do the same.

[–] panda_abyss@lemmy.ca 2 points 4 days ago

Which is objectively awesome

We’ve come a long way in 3 years.

[–] danzabia 5 points 4 days ago (1 children)

I still contend that this sort of task is uniquely positioned to show off LLMs. The idea that they’ll turn into agents that can do real-world tasks remains a fantasy. Despite how impressive this is, they’re losing money and have no real path to profitability.

I'm curious. What would it take to change your mind? I'd like to check in with you in two years to see what you think then.

[–] some_guy@lemmy.sdf.org 4 points 4 days ago

A demo that was open to the public (as in, not stage managed) where people could have the "agents" perform complex tasks without failing on a regular basis. Large training models are notoriously bad at anything they haven't been trained to do. They're worlds away from being able to interpret a new situation and "figure it out."

[–] GreenKnight23@lemmy.world 2 points 4 days ago (1 children)

but can it do it in assembly?

[–] some_guy@lemmy.sdf.org 4 points 4 days ago

That would impress me so much I'd die and come back to life.