The link is in the webpage, but it might be bugged? It's not visible in vanilla chrome or firefox for me (or maybe just not visible on linux).
https://webinstallers.gog-statics.com/download/GOG_Galaxy_2.0.exe
That's from the page's source.
The link is in the webpage, but it might be bugged? It's not visible in vanilla chrome or firefox for me (or maybe just not visible on linux).
https://webinstallers.gog-statics.com/download/GOG_Galaxy_2.0.exe
That's from the page's source.
It’s a GitHub link, what did you expect?
It should work in any generic cuda container, but yeah it’s more of a hobbyist engine. Honestly I just run it raw since it’s dependency free, except for system CUDA.
Vllm absolutely cannot CPU offload AFAIK, but small models will fit in your vram with room to spare.
I have not had to really mess with CachyOS for over a year, while “stable” distros were a nightmare for me.
…Yeah it just depends what you’re trying to get your system to do. Arch can range from incredibly hazardous to “it just works” depending on the person and thing, and so can Mint. I think most distros should be viewed that way.
I have trouble looking at my life when things are bad, TBH, especially dull/isolated kind of bad. It is not pleasant to write.
Writing some fiction for the sake of writing (and weaving personal issues in) has actually lead to some insights, though. And maybe some feeling/reality processing going forward. And happiness.
And when things get better, I want to journal (which I have failed to make time for in the past).
We will eventually have to change our economic system and adapt one with a much much lower consumption rate, figure ways to limit our population growth, or more than likely both.
Of course. 100% agree with this, even if better technology helps. It will have to be pretty soon.
But in the very short term? This is going to be a disaster, and the human population is shooting itself in the foot by not accepting immigration from extreme birth rate countries (where overpopulation is indeed an issue).
I get the joke, but how does onlyfans slide by?
WTF? Do they have dirt on finance execs or something?
…Actually that would make a lot of sense…
Thanks, I nearly choked on my drink imagining that…
Thanks for the TED talk (really)
You have to “unlock” them with a lot of tweaks. And to be clear, I’m just saying they’re better than Windows. Ugh, trying to compile anything on Windows…
Hardware wise, they’re far better for local code assistants, too, with the exception of a few exotic AMD laptops just now coming out.
I cannot concentrate with music playing.
Literally anything else? Like a truck crashing through the window or someone literally trying to engage me? Zero lost focus, to the extent I was initially diagnosed with a hearing disorder.
Not that that helps get stuff done…
Is it 3000 series or newer?
If so, with exllamav3, you can squeeze 32Bs in that 16GB card with relatively little loss. For instance: https://huggingface.co/turboderp/EXAONE-4.0-32B-exl3/tree/3.0bpw
The 3bpw weights are 13 GB, say another 1.5GB for some q5_q4 context, and you are looking at 14.5GB-15GB or so. It will be tight, but it will be leagues smarter than 14Bs.
24B Mistral models will fit much more easily. No need to CPU offload those on a 16GB card, you just need to be careful with your settings.