▲ | jszymborski a day ago | |||||||||||||||||||||||||||||||
People familiar with exotic RNNs and improvements to LSTMs know this problem all too well. The moment your lstm isnt a bog standard lstm, it loses all the speed-ups from cuDNN and it becomes borderline unusable for anything but toy models. | ||||||||||||||||||||||||||||||||
▲ | tpurves a day ago | parent [-] | |||||||||||||||||||||||||||||||
These would be inherently temporary problems though right? If it became eventually clear that alternate methods were the way forward, NVDIA would be highly motivated to do the optimization work wouldn't they? Any new step functions that can forestall the asymptotic plateauing of AI progress are things they desperately need. | ||||||||||||||||||||||||||||||||
|