An open course on building high performance LLM inference engine! Hope to finish by the end of April
https://github.com/jmaczan/tiny-vllm