▲ | anotherpaulg 6 days ago | |||||||
I just finished updating the aider polyglot leaderboard [0] with GPT-4.1, mini and nano. My results basically agree with OpenAI's published numbers. Results, with other models for comparison:
Aider v0.82.0 is also out with support for these new models [1]. Aider wrote 92% of the code in this release, a tie with v0.78.0 from 3 weeks ago. | ||||||||
▲ | pzo 6 days ago | parent | next [-] | |||||||
Did you benchmarked combo: DeepSeek R1 + DeepSeek V3 (0324)? There is combo on 3rd place : DeepSeek R1 + claude-3-5-sonnet-20241022 and also V3 new beating claude 3.5 so in theory R1 + V3 should be even on 2nd place. Just curious if that would be the case | ||||||||
▲ | purplerabbit 6 days ago | parent | prev [-] | |||||||
What model are you personally using in your aider coding? :) | ||||||||
|