this post was submitted on 19 Dec 2025
177 points (99.4% liked)
Programming
24097 readers
182 users here now
Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!
Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.
Hope you enjoy the instance!
Rules
Rules
- Follow the programming.dev instance rules
- Keep content related to programming in some way
- If you're posting long videos try to add in some form of tldr for those who don't want to watch videos
Wormhole
Follow the wormhole through a path of communities !webdev@programming.dev
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Bad article, the number of issues aren't normalized to anything other than PRs. I expect AI authorized PRs to be somewhat bigger than a fully human authorized one.
Ideally they'd compare time to write + time to fix. My experience is that if you use test driven development, LLM isn't too bad. No worse than an intern.
I think it comes down to who is using the LLM. I had a junior dev once "presumably" AI gen a ton of code (broken trash). Then to fix it, they wrapped each function in a try catch block that dropped the error. Unit tests were mocked out to the extent they didn't test anything.
When I use an LLM, I have tests and hard constraints on the LLM. It isn't good enough to do everything, but it can generate about 80% of a simple app
For LLM generated code, it can also take a whole to read and understand. When I write code myself, I understand the intention, architecture, and so on. Machine written code is very different. I need to understand how it works. There’s often extraneous stuff in there or weird patterns.
I find that it basically can't do decent architecture. My last attempt to use it ended with it using casbin, but then rewriting it's own authorization framework and trying to use both at the same time 😶.
I think there is a lot of power here, but it needs very heavy guidance and handholding to do it well. Otherwise it makes very stupid intern level decisions
A friend has had good results using AIDD as an agent framework. It’s basically a built in project/product/scrum master that creates tickets and with that constraints.
Have you tried something like this?