Remix.run Logo
datadrivenangel 2 hours ago

Yeah I've got the q4 gpt-oss-120b running at ~40-60 tokens per second on an M5 Pro.