this post was submitted on 02 Feb 2026
344 points (97.0% liked)

Technology

83295 readers
4581 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] WanderingThoughts@europe.pub 185 points 1 month ago* (last edited 1 month ago) (26 children)

Only until AI investor money dries up and vibe coding gets very expensive quickly. Kinda how Uber isn't way cheaper than a taxi now.

[–] percent 6 points 1 month ago (16 children)

I wouldn't be surprised if that's only a temporary problem - if it becomes one at all. People are quickly discovering ways to use LLMs more effectively, and open source models are starting to become competitive with commercial models. If we can continue finding ways to get more out of smaller, open-source models, then maybe we'll be able to run them on consumer or prosumer-grade hardware.

GPUs and TPUs have also been improving their energy efficiency. There seems to be a big commercial focus on that too, as energy availability is quickly becoming a bottleneck.

[–] XLE@piefed.social 4 points 1 month ago (3 children)

Can you cite some sources on the increased efficiency? Also, can you link to these lower priced, efficient (implied consumer grade) GPUs and TPUs?

[–] percent 2 points 1 month ago (1 children)

Oh, sorry, I didn't mean to imply that consumer-grade hardware has gotten more efficient. I wouldn't really know about that, but I assume most of the focus is on data centers.

Those were two separate thoughts:

  1. Models are getting better, and tooling built around them are getting better, so hopefully we can get to a point where small models (capable of running on consumer-grade hardware) become much more useful.
  2. Some modern data center GPUs and TPUs compute more per watt-hour than previous generations.
[–] XLE@piefed.social 2 points 1 month ago (1 children)

Can you provide evidence the "more efficient" models are actually more efficient for vibe coding? Results would be the best measure.

It also seems like costs for these models are increasing, and companies like Cursor had to stoop to offering people services below cost (before pulling the rug out from them).

[–] percent 1 points 1 month ago* (last edited 1 month ago)

Can you provide evidence the "more efficient" models are actually more efficient for vibe coding? Results would be the best measure.

Did I claim that? If so, then maybe I worded something poorly, because that's wrong.

My hope is that as models, tooling, and practices evolve, small models will be (future tense) effective enough to use productively so we won't need expensive commercial models.

To clarify some things:

  • I'm mostly not talking about vibe coding. Vibe coding might be okay for quickly exploring or (in)validating some concept/idea, but they tend to make things brittle and pile up a lot of tech debt if you let them.
  • I don't think "more efficient" (in terms of energy and pricing) models are more efficient for work. I haven't measured it, but the smaller/"dumber" models tend to require more cycles before they reach their goals, as they have to debug their code more along the way. However, with the right workflow (using subagents, etc.), you can often still reach the goals with smaller models.

There's a difference between efficiency and effectiveness. The hardware is becoming more efficient, while models and tooling are becoming more effective. The tooling/techniques to use LLMs more effectively also tend to burn a LOT of tokens.

TL;DR:

  • Hardware is getting more efficient.
  • Models, tools, and techniques are getting more effective.
load more comments (1 replies)
load more comments (13 replies)
load more comments (22 replies)