I think gemma-4-26b-a4b and Qwen3.6-35B-A3B show that there's something very int...

chrisweekly · 2026-06-16T15:50:24 1781625024

Obtaining that 64GB RAM is a meaningful obstacle for many.

simonw · 2026-06-16T16:03:06 1781625786

I'm still amazed that you can run LLMs of this quality on a machine that costs less than $3,000.

I used to assume that anything GPT-4 equivalent or higher would need $30,000+ of server-class hardware.

That said... gemma-4-12b-qat is 7.15GB on disk so should run reasonably well in 16GB, that takes it down to MacBook Air territory https://lmstudio.ai/models/google/gemma-4-12b-qat

verdverm · 2026-06-17T14:30:36 1781706636

Second this notion. After picking up an OEM Spark and running qwen36moe/dense, I was thoroughly impressed with what such small models can do and the (reasonable) speeds you can get. I'm back to using open weight models via an API (wanted more capability for the time being), but will be getting more hardware soon (re: ds4-flash and the fable shot heard round the world)

frollogaston · 2026-06-16T19:09:28 1781636968

Not just RAM, VRAM, right? Though they're one and the same on the Mac.