Remix.run Logo
cat_plus_plus 2 hours ago

At least for transformers, it can be kind of fixed with MOE + NVFP4 for small working set despite large resident size.