▲ | Borealid 3 hours ago | |||||||
Did you casually glance at how the hardware in the Framework Desktop (Strix Halo) works before commenting? | ||||||||
▲ | michaelanckaert 3 hours ago | parent | next [-] | |||||||
I didn't glace at it, I read it :-) The architecture is a 'unified memory bus', so yes the GPU has access to that memory. My comment was a bit unfortunate as it implied I didn't agree with yours, sorry for that. I simply want to clarify that there's a difference between 'GPU memory' and 'system memory'. The Frame.work desktop is a nice deal. I wouldn't buy the Ryzen AI+ myself, from what I read it maxes out at about 60 tokens / sec which is low for my use cases. | ||||||||
▲ | ramon156 3 hours ago | parent | prev [-] | |||||||
These don't run 200B models at all, results show it can run 13B at best. 70B is ~3 tk / s according to someone on Reddit. | ||||||||
|