▲ | Sohcahtoa82 6 days ago | ||||||||||||||||||||||||||||||||||
I don't know what anybody would do with such a weak card. My RTX 5090 is about 10x faster (measured by FP32 TFLOPS) and I still don't find it to be fast enough. I can't imagine using something so slow for AI/ML. Only 2.2 tokens/sec on an 8B parameter Llama model? That's slower than someone typing. I get that it's a budget card, but budget cards are supposed to at least win on a pure price/performance ratio, even with a lower baseline performance. The 5090 is 10x faster but only 6-8x the price, depending on where in the $2-3,000 price range you can find one at. | |||||||||||||||||||||||||||||||||||
▲ | dragonwriter 6 days ago | parent | next [-] | ||||||||||||||||||||||||||||||||||
> My RTX 5090 is about 10x faster (measured by FP32 TFLOPS) and I still don't find it to be fast enough. I can't imagine using something so slow for AI/ML. Only 2.2 tokens/sec on an 8B parameter Llama model? That's slower than someone typing. Its also orders of magnitudr slower than what I normally see cited by people using 5090s; heck, its even much slower than I see on my own 3080Ti laptop card for 8B models, though usually won’t use more than an 8bpw quant for that size model. | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||
▲ | clifflocked 6 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||
I feel as though you are measuring tokens/s wrong, or have a serious bottleneck somewhere. On my i5-10210u (no dedicated graphics, at standard clock speeds), I get ~6 tokens/s on phi4-mini, a 4b model. That means my laptop CPU with a power draw of 15 watts, that was released 6 years ago, is performing better than a 5090. > The 5090 is 10x faster but only 6-8x the price I don't buy into this argument. A B580 can be bought at MSRP for 250$. A RTX 5090 from my local Microcenter is around 3250$. That puts it at around 1/13th the price. Power costs can also be a significant factor if you choose to self-host, and I wouldn't want to risk system integrity for 3x the power draw, 13x the price, a melting connector, and Nvidia's terrible driver support. EDIT: You can get an RTX 5090 for around 2500$. I doubt it will ever reach MSRP though. | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||
▲ | jpalawaga 6 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||
you have outlier needs if an rtx, the fastest consumer grade card, is not good enough for you. the intel card is great for 1080p gaming. especially if you're just playing counterstrike, indie games, etc, you don't need a beast. very few people are trying to play 4k tombraider on ultra with high refresh rate. | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||
▲ | adgjlsfhk1 6 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||
The B60 is ridiculously good for scientific workloads. it's 50% more fp64 flops than a 5090 and 3/4ths the VRAM for 1/4th the price. | |||||||||||||||||||||||||||||||||||
▲ | ohdeargodno 6 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||
[dead] |