Remix.run Logo
varispeed 9 days ago

I rented H100 for training a couple of times and I found that they couldn't do training at all. Same code worked fine on Mac M1 or RTX 5080, but on H100 I was getting completely different results.

So I wonder what I could be doing wrong. In the end I just use RTX 5080 as my models fit neatly in the available RAM.

* by not working at all, I mean the scripts worked, but results were wrong. As if H100 couldn't do maths properly.