| ▲ | HarHarVeryFunny 6 hours ago | |
> All of the optimizations Deepseek have done are in software and it goes down to the PTX assembly level DeepSeek are still using NVIDIA (PTX) to train on, but for inference have already transitioned to Huawei Ascend chips, and inference speed is what this paper is addressing. | ||