▲ | XCSme 6 days ago | ||||||||||||||||
I tried 4.1-mini and 4.1-nano. The response are a lot faster, but for my use-case they seem to be a lot worse than 4o-mini(they fail to complete the task when 4o-mini could do it). Maybe I have to update my prompts... | |||||||||||||||||
▲ | XCSme 6 days ago | parent | next [-] | ||||||||||||||||
Even after updating my prompts, 4o-mini still seems to do better than 4.1-mini or 4.1-nano for a data-processing task. | |||||||||||||||||
| |||||||||||||||||
▲ | jjani 5 days ago | parent | prev [-] | ||||||||||||||||
That sounds incredibly disappointing given how high their benchmarks are, indicating they might be overtuned for those, similar to Llama4. | |||||||||||||||||
|