Remix.run Logo
drra 4 days ago

Absolutely. Anyone working on inference token level knows how wasteful it all is especially in multimodal tokens.