| ▲ | villgax 6 hours ago | ||||||||||||||||||||||||||||||||||||||||
100 times more chips for equivalent memory, sure. | |||||||||||||||||||||||||||||||||||||||||
| ▲ | m4r1k 5 hours ago | parent | next [-] | ||||||||||||||||||||||||||||||||||||||||
Check the specs again. Per chip, TPU 7x has 192GB of HBM3e, whereas the NVIDIA B200 has 186GB. While the B200 wins on raw FP8 throughput (~9000 vs 4614 TFLOPs), that makes sense given NVIDIA has optimized for the single-chip game for over 20 years. But the bottleneck here isn't the chip—it's the domain size. NVIDIA's top-tier NVL72 tops out at an NVLink domain of 72 Blackwell GPUs. Meanwhile, Google is connecting 9216 chips at 9.6Tbps to deliver nearly 43 ExaFlops. NVIDIA has the ecosystem (CUDA, community, etc.), but until they can match that interconnect scale, they simply don't compete in this weight class. | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
| ▲ | croon 6 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||
Ironwood is 192GB, Blackwell is 96GB, right? Or am i missing something? | |||||||||||||||||||||||||||||||||||||||||
| ▲ | NaomiLehman 6 hours ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||||||||
I think it's not about the cost but the limits of quickly accessible RAM | |||||||||||||||||||||||||||||||||||||||||