this post was submitted on 15 Dec 2025
270 points (99.3% liked)

ChatGPT

716 readers
9 users here now

Welcome to the ChatGPT community! This is a place for discussions, questions, and interactions with ChatGPT and its capabilities.

General discussions about ChatGPT, its usage, tips, and related topics are welcome. However, for technical support, bug reports, or feature requests, please direct them to the appropriate channels.

!chatgpt@lemdro.id

Rules

  1. Stay on topic: All posts should be related to ChatGPT, its usage, and relevant discussions.
  2. No support questions/bug reports: Please refrain from posting individual support questions or bug reports. This community is focused on general discussions rather than providing technical assistance.
  3. Describe examples: When discussing or sharing examples of ChatGPT interactions, please provide proper context and explanations to facilitate meaningful discussions.
  4. No self-promotion: Avoid excessive self-promotion, spamming, or advertising of external products or services.
  5. No inappropriate content: Do not post or request explicit, offensive, or inappropriate content. Keep the discussions respectful and inclusive.
  6. No personal information: Do not share personal information, including real names, contact details, or any sensitive data.
  7. No harmful instructions: Do not provide or request instructions for harmful activities, illegal actions, or unethical behaviour.
  8. No solicitation: Do not solicit or engage in any form of solicitation, including but not limited to commercial, political, or donation requests.
  9. No unauthorized use: Do not use ChatGPT to attempt unauthorized access, hacking, or any illegal activities.
  10. Follow OpenAI usage policy: Adhere to the OpenAI platform usage policy and terms of service.

Thank you for being a part of the ChatGPT community and adhering to these rules!

founded 2 years ago
MODERATORS
top 46 comments
sorted by: hot top controversial new old
[–] Zachariah@lemmy.world 110 points 5 days ago (1 children)

… tests from earlier this year found that AI agents failed to complete tasks up to 70% of the time, making them almost entirely redundant as a workforce replacement tool. At best, they're a way for skilled employees to be more productive and save time on low-level tasks, but those tasks were already being handed off to lower-level employees. Having an AI do it and fail half the time isn't exactly a winning alternative.

[–] mr_account@lemmy.world 45 points 5 days ago (3 children)

"AI agents failed to complete tasks up to 70% of the time."

"Having AI do it and fail half the time"

Did AI write this article too? Fails again at basic math

[–] Psaldorn@lemmy.world 19 points 5 days ago* (last edited 5 days ago)

50% is up to 70%.

Probably the are different use cases and some fail more than others.

Shit, there are more that 2 copilots at Microsoft alone

[–] Saledovil@sh.itjust.works 3 points 4 days ago

Could be 70% overall, but only half of the tasks it's good at.

[–] Lon3star@lemmy.world 58 points 5 days ago (3 children)

Copilots numbers likely that high merely because it rides along with Office365... I tried using it a few times, and was completely useless. Even failed at sorting a spreadsheet with a few parameters

[–] Jhex@lemmy.world 14 points 5 days ago* (last edited 5 days ago) (1 children)

I wrote a page long documentation on a project. I asked Copilot to "format it to look nice but do not change a word"... it told me how to make some headings bold (would not do it itself) and what not... that's the "assist" I got

I couldn't even copy/paste the format since it's reply window does not apply and the text it provided was interlaced with its own stupid comments letting me know bold headings are more visible than regular font

I get better results just bouncing ideas off my cats

[–] ikidd@lemmy.world 6 points 5 days ago* (last edited 5 days ago) (1 children)

That was my experience. Wife had work telling her to use it, she asked me for help. I tried to get it to do things and all it would do is suggest stuff that we both knew perfectly well how to do with shortcuts. As for anything complex like have a chat and generate a document: fuck no. Might as well go to chat-jippity and copypasta it's result and format it yourself. Utter waste of time. I don't see why it's there, I can't find a use-case.

[–] Jhex@lemmy.world 4 points 5 days ago

I work with sensitive data... so I often grab a real message, gut all the PHI and refill with some fake data.

I had a project where I needed to do a lot of these, so I got DeepSeek to give me a list of superhero "real names", DOBs, gender and a few other fake things so I could automate filling these messages with fake data. This is the most success I have had with AI and even then it messed up (minimally) thinking for some reason that Wanda was a guy hahahaha

[–] Tollana1234567@lemmy.today 15 points 5 days ago (1 children)

i think MS is just putting AI into everything, just for the sake of it, they dont really care if its useful or not at this point. they just need to buy time to soften the blow when the bubble bursts.

[–] Rooster326@programming.dev 5 points 5 days ago (1 children)

We put it in everything, and bet all of our money on it to "soften the blow"

[–] swagmoney@lemmy.ca 2 points 5 days ago

yeah it really feels like incompetence rather than a strategy to weather the bubble pop :p

[–] frizzo@piefed.social 0 points 5 days ago* (last edited 5 days ago) (3 children)

Did you try asking copilot tips on how to use it properly? Edit* this was a joke. Ai is garbageware that nobody wants, just like ads and capitalism for capital sake. Now untwist your panties.

[–] confusedwiseman@lemmy.dbzer0.com 19 points 5 days ago (1 children)

Yes, I did. Everything I tried on copilot wanted me to upload corporate data to the cloud. (Yeah, NO). It told me it could help with my email…if I uploaded them individually. (Still bad practices here and breaking corp policy).

I expect LLMs should be really good pattern driven activity, but I’ve yet to figure out how to make this useful.

I tried to use a local LLM to summarize outline and discuss my *.md notes for annual review. It sucked at it if it didn’t completely crash the model. It couldn’t even provide a unique list of all tags in the files. (It took me about 30min manually). I thought that it would be good at that. I would have been better off spending to learn find | grep commands or spent time learning python.

I’m still searching, but maybe one day I’ll figure out a use for these.

[–] Rhaedas@fedia.io 10 points 5 days ago

We use it during down time for amusement to see how badly it can do things. Actually we did, it's gotten a bit boring as it's not even good at doing bad. It just sucks. Carrying on the MS tradition.

[–] Bluegrass_Addict@lemmy.ca 5 points 5 days ago

yeah, it errored out too

[–] BigBenis@lemmy.world 27 points 5 days ago (2 children)
[–] JeeBaiChow@lemmy.world 36 points 5 days ago (1 children)

HP: if your non original printer cartridge fails 7 out of 10 times, is it really a savings?

Microsoft: we want 80% of our work to be handled by these ais that fail 70% of the time.

Also fuck HP.

And Microsoft.

[–] Dwayne_Elizondo_Mountain_Dew_Camacho@sh.itjust.works 12 points 5 days ago* (last edited 5 days ago)

Say you hire an employee and you know he fucks up 50% of his tasks, that means you still have to do 50% of the work PLUS examine 100% of his output in great detail to figure out which 50% he fucked up.

Even if the employee was paid 0, I would want him gone.

[–] oppy1984@lemdro.id 24 points 5 days ago

My employer is all in on Microsoft, copilot is terrible, it can't even find a word in a document. Cntl+F find's it no problem.

Now when I have a tech issue that I need an answer for, the bing AI generally gets me a detailed answer on the first try. But it's my understanding that bings AI is just a reskined ChatGPT.

[–] Psythik@lemmy.world 19 points 5 days ago (2 children)

What's with all the shills in that comment section? Yeesh.

[–] SanctimoniousApe@lemmings.world 12 points 5 days ago* (last edited 5 days ago)

MS middle management trying to save their lucrative jobs.

[–] anyhow2503@lemmy.world 8 points 5 days ago (1 children)

All of them have exactly 1 post too.

[–] SaveTheTuaHawk@lemmy.ca 7 points 5 days ago (1 children)
[–] ikidd@lemmy.world 2 points 5 days ago (1 children)
[–] Natanael 2 points 4 days ago

Sorry used copilot for the math

[–] demizerone@lemmy.world 5 points 4 days ago

Gihub died for this.

[–] SanctimoniousApe@lemmings.world 16 points 5 days ago (2 children)

I hope all the money thrown at this "AI" (misnomer, IMHO - it's really just extremely overwrought pattern matching) causes at least some significant humbling (if not outright downfall) of some tech giants. I haven't programmed in a couple decades, and yet even I could tell they weren't gonna get to AGI offa this crap - I can't believe how badly some of these supposed techies fell for their own hype.

[–] massive_bereavement@fedia.io 8 points 5 days ago (2 children)

When discussing it, I often call it "simulated intelligence", because at the end of the day that's what neural networks are.

Edit: only to non-technical people, as simulations are a different thing.

[–] leftzero@lemmy.dbzer0.com 4 points 5 days ago* (last edited 4 days ago)

In science fiction I've often seen the term VI (Virtual Intelligence) to refer to machines that look intelligent, and could probably pass a Turing test, but aren't really intelligent (normally VI coexists with actual AI, often used as interfaces, where it would be a waste, or too risky, to use a proper AI).

LLMs look a bit like that, though they're probably too unreliable to use as an interface for anything important.

[–] ooli3@sopuli.xyz 4 points 5 days ago (1 children)

the correct term is Stochastic Parrot... that is what LLM do. It sound even more cool that AI imho

[–] Zexks@lemmy.world 0 points 5 days ago (4 children)
[–] MehBlah@lemmy.world 4 points 5 days ago

Doesn't matter. There is no cognition. Just word salads mixed and matched with no possibility of receiving "I don't know" for a answer.

[–] ooli3@sopuli.xyz 2 points 4 days ago

So they no more use probability to choose next word? I wonder how they do it now

[–] okamiueru@lemmy.world 2 points 5 days ago (1 children)

That's was a remarkably uninsightful way to approach that topic. Please link to more of these "studies", that one was way too short.

[–] mfed1122@discuss.tchncs.de -1 points 5 days ago (1 children)

The virgin cited study vs the Chad Ad Hominem

[–] okamiueru@lemmy.world 6 points 5 days ago

Did you read the study? It's hilarious. They're using LLMs to "grade" the number of observed "skills" based on the output of LLMs. They're using a stochastic parrot to evaluate another stochastic parrot, and concluding that there is some kind of emergent "skill" going on. Sheeeesh. It'd assume the authors of the paper are just having a laugh. But, one thing is for sure, the AI stupidity train keeps chugging along.

[–] Natanael 1 points 4 days ago

Still stochastic. Even now they still can't reliably do repeat tasks

[–] Jankatarch@lemmy.world 2 points 5 days ago

Comments in the post contain so much cope.