Remix.run Logo
justinhj 5 hours ago

We see the same with Google's Flash models. It's easier to make a small capable model when you have a large model to start from.

karmasimida 5 hours ago | parent [-]

Flash models are nowhere near Pro models in daily use. Much higher hallucinations, and easy to get into a death sprawl of failed tool uses and never come out

You should always take those claim that smaller models are as capable as larger models with a grain of salt.

justinhj 3 hours ago | parent [-]

Flash model n is generally a slightly better Pro model (n-1), in other words you get to use the previously premium model as a cheaper/faster version. That has value.

karmasimida 2 hours ago | parent [-]

They do have value, because they are much much cheaper.

But no, 3.0 flash is not as good as 2.5 pro, I use both of them extensively, especially in translation. 3.0 flash will confidently mistranslate some certain things, while 2.5 pro will not.