▲ | bahmboo 18 hours ago | |||||||
"In recent inference tests run on a 3-billion-parameter LLM developed from IBM’s Granite-8B-Code-Base model, NorthPole was 47 times faster than the next most energy-efficient GPU and was 73 times more energy efficient than the next lowest latency GPU." It's also fascinating that they are experimenting with analog memory because it pairs so well with model weights | ||||||||
▲ | anyfoo 16 hours ago | parent | next [-] | |||||||
Yeah, analog memory fits so incredibly well. Who cares if it's not "exact" and fuzzes around a bit if it's only used for weights and has massive efficiency advantages. Weights are never "exact" themselves, and it doesn't matter if they don't always read exactly the same. You basically just get some extra "temperature" for free! A bit beautiful that we might end up partially going back to analog computers, which were quickly replaced by digital ones. | ||||||||
| ||||||||
▲ | imtringued 7 hours ago | parent | prev [-] | |||||||
Their NorthPole chip doesn't look much different than the Groq LPU or Tenstorrent's hardware or even just AMD's NPU design. The tenstorrent cards have a pretty big amount of SRAM considering their price. |