Remix.run Logo
JeremyNT 6 hours ago

I also don't understand why people in this price bracket are buying Mac laptops instead of desktop computers with GPUs? Just to flex that it's portable?

mft_ 3 hours ago | parent | next [-]

(I'm not one of the people you're speaking of with a 128gb M5 but) if you want to run one of the medium-sized open-weights models (Qwen 27b, 35b, Gemma 4 26b, 31b) or larger, you get into an interesting optimisation space.

* yes, you can run it on an older/smaller GPU plus system RAM but performance will suffer

* if you want optimal GPU performance you need the model in VRAM plus context, so 24GB (3090, 4090) or 32GB (5090) cards, plus a system that's reasonable powerful to plug them in to. Ideally you'd have a multiple cards working together but for optimal performance this means either 2x 3090 or nvidia's workstation cards.

* you can go for a 128gb Strix Halo system, but the memory bandwidth isn't great and they're becoming increasingly more expensive (5.5k EUR for HP laptop, 3.9k EUR for GMKtec EVO-X2 mini PC)

* you can go for a 128gb DGX Spark (5k EUR+) which also has unspectacular memory bandwidth or RTX Spark (price unclear but probably not cheaper)

* or go for a Mac with a decent CPU and a good amount of RAM (bandwidth varies by model, but typically a bit better than Strix Halo/DGX Spark and worse than bespoke GPUs.

As usual with such questions, there are of course cheaper paths (if you want to accept the tradeoffs) but Macs are reasonable vs. competition for these workloads.

ctkhn 4 hours ago | parent | prev | next [-]

I don't even travel a ton but portability is huge. It's not a flex, it's a functional thing that lets me move around within my house or work while I'm at my parents or traveling or anywhere else. Other than my media collection that lives on my home server, I want most of my files to come with me on my laptop.

jeroenhd 6 hours ago | parent | prev | next [-]

A mac with a boatload of RAM can run models that will exceed the limits of any GPU not worth at least twice the Apple hardware itself.

You get fewer tokens per second, but at some point the balance between quality and quantity makes the large model size worth the spend.

When you're spending this kind of money, you may as well treat yourself to a pretty screen and some decent speakers. Nothing the competition doesn't offer these days, but you get them for free with the car-priced RAM upgrade so why go for less.

bastardoperator 4 hours ago | parent | prev | next [-]

I have a bunch of computers and gadgets, why settle on one?

LeBit 6 hours ago | parent | prev | next [-]

I think it is because desktop computers with GPUs with enough VRAM to run interesting models are insanely expensive, hard to source and consume a lot of electricity and dissipate a lot of heat.

redox99 5 hours ago | parent | prev | next [-]

Yeah, it's a much better idea to buy many used 3090s. 4090s or 5090s if you can afford it. Way faster.

aurareturn 3 hours ago | parent [-]

Probably depends on what you're trying to do.

You need an expensive motherboard, cooling, PSU(s) to use multiple high end GPUs together. Then there is the noise and the fact that you can't bring it on an airplane.

ilogik 6 hours ago | parent | prev [-]

What GPU can I buy with >100GB of memory?

verdverm 5 hours ago | parent [-]

DGX Spark is one, but really depends on how much you want to spend

aurareturn 3 hours ago | parent [-]

273GB/s bandwidth vs 614 GB/s of the M5 Max. And you're getting a whole laptop.

$5k for DGX Spark as well.

verdverm 3 hours ago | parent [-]

Prompt processing time is better on the spark, which aligns more with coding (more reading than writing).

I spent less than $4k, OEM are better boxes for cooling, no apple markup, I get a real Linux system for stuff like k3s.

aurareturn an hour ago | parent [-]

Yes, it's better on the Spark but the M5 is a lot closer than before with neural accelrators. After prompt processing, token generation speed on the M5 Max is 2.3x faster.

No Apple markup but you get the Nvidia market up instead. Prior to the recent Apple price increase due to RAM shortage, an M5 Max 128GB was a bargain if you want to run local LLMs.