| ▲ | tripleee 3 hours ago | ||||||||||||||||
Christ GPU prices have gotten crazy How do AMD cards perform with LLMs? A 9070 is sold for ~$600 and has 16GB VRAM | |||||||||||||||||
| ▲ | overgard 2 hours ago | parent | next [-] | ||||||||||||||||
In my personal experience, I wouldn't bother with 16GB cards for coding -- the useful models are _slightly_ too large to work at any reasonable speed | |||||||||||||||||
| ▲ | lambda 3 hours ago | parent | prev [-] | ||||||||||||||||
That should do pretty well. Memory bandwidth is the biggest bottleneck for token generation, at 644 GB/s you should be able to do pretty well on a 9070, while prompt proessing is more compute bound and Nvidia tends to have the edge there. 16 GiB won't fit you much, so you'd probably want at least 2x, and preferably 3x of those, and then you need a motherboard, power, etc. that can handle that. | |||||||||||||||||
| |||||||||||||||||