Remix.run Logo
BoorishBears 2 hours ago

If they really managed this from pre-training a 1.6 T parameter model through to post-training without NVIDIA, Dwarkesh Patel got what he wanted.

Jabrov 2 hours ago | parent [-]

Who? What did he want?

gardnr an hour ago | parent [-]

Dwarkesh Patel has AI/ML guests on his podcast. BoorishBears may have been referring to the Jensen Huang episode where they discuss TPUs: https://youtu.be/Hrbq66XqtCo?t=982