this post was submitted on 03 Aug 2025
268 points (87.2% liked)
Fuck AI
3612 readers
718 users here now
"We did it, Patrick! We made a technological breakthrough!"
A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Nope, I'm not ignoring them, but the post is specifically about exceptions. The OOP claims there are no exceptions and there is no ethical generative AI, which is false. Your comment only applies to the majority of massive LLMs hosted by massive corporations.
The CommonCorpus dataset is less than 8TB, so fits on a single hard drive, not a data center, and contains 2 trillion tokens, which is a relatively similar amount of tokens that small local LLMs are typically trained with (OLMo 2 7B and 13B were trained on 5 trilion tokens).
These local LLMs don't have high electricity use or environmental impact to train, and don't require a massive data center for training. The training cost in energy is high, but nothing like GPT4, and is only a one time cost anyway.
So, the OOP is wrong, there is ethical generative AI, trained only on data available in the public domain, and without a high environmental impact.