▲ | wanderingmind 6 days ago | |
Why should we start fine tuning gemma when it is so bad. Why not instead focus the fine-tuning efforts on Qwen, when it starts off with much, much better outputs? | ||
▲ | mdp2021 6 days ago | parent [-] | |
Speed critical applications, I suppose. Have you compared the speeds? (I did. I won't give you number (which I cannot remember precisely), but Gemma was much faster. So, it will depend on the application.) |