| ▲ | embedding-shape 2 days ago | |||||||||||||
Why is cuda sub-optimal compared to SPIR-V? I don't think I know the internals enough to understand if it's supposed to be obvious why one is better than the other. I'm currently sitting and learning cuda for ML purposes, so happy to get more educated :) | ||||||||||||||
| ▲ | jb1991 2 days ago | parent [-] | |||||||||||||
Just depends on how the manufacturer of the GPU handles code written in different languages. For example, what level of API access, what level of abstraction, and how is the source compiled i.e. how optimized is it. For example, on an apple GPU, you’ll see benchmarks that openCL and metal can vary depending on the tasks. | ||||||||||||||
| ||||||||||||||