| ▲ | lacoolj 21 hours ago | ||||||||||||||||||||||
lol I love how OpenAI just straight up doesn't compare their model to others on these release pages. Basically telling us they know Gemini and Opus are better but they don't want to draw attention to it | |||||||||||||||||||||||
| ▲ | qwesr123 21 hours ago | parent | next [-] | ||||||||||||||||||||||
Not sure why they don't compare with others, but they are actually leading on the benchmarks they published. See here (bottom) for a chart comparing to other models: https://marginlab.ai/blog/swe-bench-deep-dive/ | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | dbbk 20 hours ago | parent | prev [-] | ||||||||||||||||||||||
This was the one thing I scanned for. No comparison against Opus. See ya. | |||||||||||||||||||||||
| |||||||||||||||||||||||