this post was submitted on 20 Jul 2025
18 points (100.0% liked)

Hacker News

2664 readers
238 users here now

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

founded 1 year ago
MODERATORS
all 8 comments
sorted by: hot top controversial new old
[–] Xulai@mander.xyz 9 points 2 months ago

As someone who works in tech, currently testing AI integration into healthcare EHR- the current state of AI is simply not safe for anything outside transcription- and even that is error prone without strict re-reading (not scanning!) for error correction.

The errors can be subtle but life threatening. I highly recommended against integrating it - but the most lazy providers were already using AI illegally for their notes so this was seen as a middle road.

Medical care and provider training in the USA is not ok right now, and getting worse. AI and misinformation is accelerating the decline.

[–] Outwit1294@lemmy.today 3 points 2 months ago

Interesting. I have never seen the economic side of it being discussed outside of nvidia stock prices.

[–] hansolo@lemmy.today 3 points 2 months ago* (last edited 2 months ago) (1 children)

TL;DR: Three Hard Truths About AI Agents After building 12+ production systems, here's what I've learned: -Error rates compound exponentially in multi-step workflows. 95% reliability per step = 36% success over 20 steps. Production needs 99.9%+. Context windows create quadratic token costs. -Long conversations become prohibitively expensive at scale. -The real challenge isn't AI capabilities, it's designing tools and feedback systems that agents can actually use effectively.

The TL;DR of the TL;DR is compounding expensive, error-prone results.

[–] jbloggs777@discuss.tchncs.de 3 points 2 months ago* (last edited 2 months ago) (1 children)

It sounds like one should be building deliberate AI workflows with extra checks (automated or human in the loop) that make careful and cost efficient incremental progress toward a measurable goal.

Sounds like hard work... when we could just build 1,000,000 MCP servers instead. (raises pinkie to corner of mouth)

[–] Dadifer@lemmy.world 3 points 2 months ago (1 children)

I keep having the same question: would it benefit to have a separate agent whose job was to error-check the first agent?

[–] scribbler@lemmy.world 2 points 2 months ago

The three stooges didn't seem any less likely to get into trouble despite their strength in numbers

[–] PattyMcB@lemmy.world 1 points 2 months ago

Also... how many security vulnerabilities have those agents introduced?