| ▲ | zurfer 5 days ago | ||||||||||||||||||||||||||||||||||
It's a cool release, but if someone on the google team reads that: flash 2.5 is awesome in terms of latency and total response time without reasoning. In quick tests this model seems to be 2x slower. So for certain use cases like quick one-token classification flash 2.5 is still the better model. Please don't stop optimizing for that! | |||||||||||||||||||||||||||||||||||
| ▲ | edvinasbartkus 5 days ago | parent | next [-] | ||||||||||||||||||||||||||||||||||
Did you try setting thinkingLevel to minimal? thinkingConfig: { thinkingLevel: "low", } More about it here https://ai.google.dev/gemini-api/docs/gemini-3#new_api_featu... | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||
| ▲ | retropragma 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||
That's more of a flash-lite thing now, I believe | |||||||||||||||||||||||||||||||||||
| ▲ | Tiberium 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||
You can still set thinking budget to 0 to completely disable reasoning, or set thinking level to minimal or low. | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||
| ▲ | bobviolier 5 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||
This might also have to do with it being a preview, and only available on the global region? | |||||||||||||||||||||||||||||||||||