this post was submitted on 28 Jan 2025
845 points (94.7% liked)

memes

16790 readers
3044 users here now

Community rules

1. Be civilNo trolling, bigotry or other insulting / annoying behaviour

2. No politicsThis is non-politics community. For political memes please go to !politicalmemes@lemmy.world

3. No recent repostsCheck for reposts when posting a meme, you can only repost after 1 month

4. No botsNo bots without the express approval of the mods or the admins

5. No Spam/Ads/AI SlopNo advertisements or spam. This is an instance rule and the only way to live. We also consider AI slop to be spam in this community and is subject to removal.

A collection of some classic Lemmy memes for your enjoyment

Sister communities

founded 2 years ago
MODERATORS
 

Office space meme:

"If y'all could stop calling an LLM "open source" just because they published the weights... that would be great."

you are viewing a single comment's thread
view the rest of the comments
[–] Prunebutt@slrpnk.net 8 points 6 months ago (1 children)

Did "they" publish the training data? And the hyperparameters?

[–] thespcicifcocean@lemmy.world -2 points 6 months ago (1 children)

I mean, I downloaded it from the repo.

[–] Prunebutt@slrpnk.net 10 points 6 months ago (1 children)

You downloaded the weights. That's something different.

[–] thespcicifcocean@lemmy.world 1 points 6 months ago (1 children)

I may misunderstand, but are the weights typically several hundred gigabytes large?

[–] Prunebutt@slrpnk.net 8 points 6 months ago* (last edited 6 months ago) (1 children)

Yes. The training data is probably a few hundred petabytes.

[–] thespcicifcocean@lemmy.world 2 points 6 months ago (1 children)
[–] BradleyUffner@lemmy.world 3 points 6 months ago

Yeah, some models are trained on pretty much the entire content of the publicly accessible Internet.