Remix clone Hacker News

new | show | ask | jobs Github

	▲	HarHarVeryFunny 6 hours ago
		> All of the optimizations Deepseek have done are in software and it goes down to the PTX assembly level DeepSeek are still using NVIDIA (PTX) to train on, but for inference have already transitioned to Huawei Ascend chips, and inference speed is what this paper is addressing.