Remix.run Logo
GaggiX 5 days ago

They went too far, now the Flash model is competing with their Pro version. Better SWE-bench, better ARC-AGI 2 than 3.0 Pro. I imagine they are going to improve 3.0 Pro before it's no more in Preview.

Also I don't see it written in the blog post but Flash supports more granular settings for reasoning: minimal, low, medium, high (like openai models), while pro is only low and high.

minimaxir 5 days ago | parent | next [-]

"minimal" is a bit weird.

> Matches the “no thinking” setting for most queries. The model may think very minimally for complex coding tasks. Minimizes latency for chat or high throughput applications.

I'd prefer a hard "no thinking" rule than what this is.

GaggiX 5 days ago | parent [-]

It still supports the legacy mode of setting the budget, you can set it to 0 and it would be equivalent to none reasoning effort like gpt 5.1/5.2

minimaxir 5 days ago | parent [-]

I can confirm this is the case via the API, but annoyingly AI Studio doesn't let you do so.

skerit 5 days ago | parent | prev | next [-]

> They went too far, now the Flash model is competing with their Pro version

Wasn't this the case with the 2.5 Flash models too? I remember being very confused at that time.

JohnnyMarcone 5 days ago | parent [-]

This is similar to how Anthropic has treated sonnet/opus as well. At least pre opus 4.5.

To me it seems like the big model has been "look what we can do", and the smaller model is "actually use this one though".

jug 5 days ago | parent | prev [-]

I'm not sure how I'm going to live with this!