Remix.run Logo
chrislattner 2 days ago

Modular/Mojo is faster than NVIDIA's libraries on their own chips, and open source instead of binary blob. See the 4 part series that culimates in https://www.modular.com/blog/matrix-multiplication-on-blackw... for Blackwell for example.

fooblaster 2 days ago | parent [-]

thanks