Which specs are as low as reasonable possible for local LLM models? Do you recommend some distro in particular?

SocialistVibes01@lemmy.ml · 10 hours ago

Which specs are as low as reasonable possible for local LLM models? Do you recommend some distro in particular?

meowmeow@quokk.au · 10 hours ago

A budget build is going to run you $4k+ for something like qwen3-coder:30b, and you’ll probably be annoyed at the speed of you’re used to Codex or Claude.

infinitevalence@discuss.online · 10 hours ago

I disagree Gemma 4 easily runs inside of a 16g GPU and is really pretty fast.

meowmeow@quokk.au · 9 hours ago

Fast is relative. I’m also commenting on the cost of the entire system, not just the gpu, fyi

infinitevalence@discuss.online · 9 hours ago

That’s fair, but nearly any modern CPU at least 32gb of RAM and a current GPU with 16gb is plenty. No need for a 4k system when a 1k-1.5k will do it.

If you’re willing to Frankenstein things some of the used AI/ML/mining cards can be a decent value.

meowmeow@quokk.au · 8 hours ago

Yes, but when you compare it to codex and Claude though, it’s significantly slower. Especially over time. Better crank that AC.

I think in a few years we will have current cloud levels running pretty efficiently on current computers.