this post was submitted on 24 Jul 2024
65 points (94.5% liked)

Tech

1762 readers
2 users here now

A community for high quality news and discussion around technological advancements and changes

Things that fit:

Things that don't fit

Community Wiki

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Naz@sh.itjust.works 39 points 1 year ago* (last edited 1 year ago) (3 children)

Anyone can download, but practically no one can run it.

With the absolute highest memory compression settings, the largest model that you can fit inside of 24GB of VRAM is 109 billion parameters.

Which means even with crazy compression, you need at the very least, ~100GB of VRAM to run it. That's only in the realm of the larger workstation cards which cost around $24,000 - $40,000 each, so y'know.

[–] xionzui@sh.itjust.works 9 points 1 year ago* (last edited 1 year ago) (1 children)

You can get usable performance on a CPU with good memory bandwidth. Apple studios are the best way to get that right now, but a good Epyc with 256GB of RAM works too.

Of course, you could also just run 5 GPUs

[–] IsThisAnAI@lemmy.world 4 points 1 year ago

"usable" sure if you want to wait 10 minutes a word.

[–] Varyk@sh.itjust.works 6 points 1 year ago

Thanks, I was wondering