Remix.run Logo
yu3zhou4 4 hours ago

An open course on building high performance LLM inference engine! Hope to finish by the end of April

https://github.com/jmaczan/tiny-vllm