Remix.run Logo
TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B(github.com)
3 points by trykhlieb a day ago | 1 comments
a day ago | parent [-]
[deleted]