▲ | jareds a day ago | ||||||||||||||||||||||
I'll look at it when this shows up on https://aider.chat/docs/leaderboards/ I feel like keeping up with all the models is a full time job so I just use this instead and hopefully get 90% of the benefit I would by manually testing out every model. | |||||||||||||||||||||||
▲ | evantbyrne a day ago | parent | next [-] | ||||||||||||||||||||||
Are these just leetcode exercises? What I would like to see is an independent benchmark based on real tasks in codebases of varying size. | |||||||||||||||||||||||
| |||||||||||||||||||||||
▲ | a day ago | parent | prev [-] | ||||||||||||||||||||||
[deleted] |