| ▲ | kristianp 4 days ago | |
Openrouter has an "exacto" [1] option to favour higher quality providers for a given model. Have you found any benefits to using that? Edit: Kimi K2 uses int4 during its training as well as inference [2]. I wonder if that affects the quality if different gguf creators may not convert these correctly? [1] https://openrouter.ai/docs/guides/routing/model-variants/exa... [2] https://www.reddit.com/r/LocalLLaMA/comments/1pzfuqg/why_kim... | ||
| ▲ | gertlabs 4 days ago | parent [-] | |
I did not know about this! We've put a lot of effort into probing providers and their offerings and auto-selecting the best options. I wonder how well their exacto option works. Going to test it out, thanks! | ||