| ▲ | AzN1337c0d3r 2 hours ago |
| Most workstation class laptops (i.e. Lenovo P-series, Dell Precision) have 4 DIMM slots and you can get them with 256 GB (at least, before the current RAM shortages). There's also the Ryzen AI Max+ 395 that has 128GB unified in laptop form factor. Only Apple has the unique dynamic allocation though. |
|
| ▲ | the_pwner224 an hour ago | parent | next [-] |
| Yep, I have a 13" gaming tablet with the 128 GB AMD Strix Halo chip (Ryzen AI Max+ 395, what a name). Asus ROG Flow Z13. It's a beast; the performance is totally disproportionate to its size & form factor. I'm not sure what exactly you're referring to with "Only Apple has the unique dynamic allocation though." On Strix Halo you set the fixed VRAM size to 512 MB in the BIOS, and you set a few Linux kernel params that enable dynamic allocation to whatever limit you want (I'm using 110 GB max at the moment). LLMs can use up to that much when loaded, but it's shared fully dynamically with regular RAM and is instantly available for regular system use when you unload the LLM. |
| |
| ▲ | wilkystyle 7 minutes ago | parent [-] | | What operating system are you using? I was looking at this exact machine as a potential next upgrade. |
|
|
| ▲ | lambda an hour ago | parent | prev [-] |
| > Only Apple has the unique dynamic allocation though. What do you mean? On Linux I can dynamically allocate memory between CPU and GPU. Just have to set a few kernel parameters to set the max allowable allocation to the GPU, and set the BIOS to the minimum amount of dedicated graphics memory. |
| |
| ▲ | AzN1337c0d3r 37 minutes ago | parent [-] | | Maybe things have changed but the last time I looked at this, it was only max 96GB to the GPU. And it isn't dynamic in the sense you still have to tweak the kernel parameters, which require a reboot. Apple has none of this. | | |
| ▲ | the_pwner224 29 minutes ago | parent | next [-] | | Strix Halo you can get at least 115 GB to the GPU (out of 128 GB total), I've tested that configuration. I don't have any reason to believe you couldn't use 120+ GB of VRAM. Setting the kernel params is a one-time initial setup thing. You have 128 GB of RAM, set it to 120 or whatever as the max VRAM. The LLM will use as much as it needs and the rest of the system will use as much it needs. Fully dynamic with real-time allocation of resources. Honestly I literally haven't even thought of it after setting those kernel args a while ago. So: "options ttm.pages_limit=31457280 ttm.page_pool_size=31457280", reboot, and that's literally all you have to do. Oh and even that is only needed because the AMD driver defaults it to something like 35-48 GB max VRAM allocation. It is fully dynamic out of the box, you're only configuring the max VRAM quota with those params. I'm not sure why they choice that number for the default. | |
| ▲ | lambda 25 minutes ago | parent | prev [-] | | You do have to set the kernel parameters once to set the max GPU allocation, I have it set to 110 GiB, and you have to set a BIOS setting to set the minimum GPU allocation, I have it set to 512 MiB. Once you've set those up, it's dynamic within those constraints, with no more reboots required. On Windows, I think you're right, it's max 96 GiB to the GPU and it requires a reboot to change it. |
|
|