Remix clone Hacker News

new | show | ask | jobs Github

	▲	nirw4nna 7 hours ago
		I'm currently chipping away at DSC, a tensor library I wrote from scratch to play with large language models. Last week I re-wrote flash attention from scratch in CUDA and was able to get good perf. [1]: https://github.com/nirw4nna/dsc [2]: https://x.com/nirw4nna/status/1968812772944126329