Remix.run Logo
gordon_freeman 4 days ago

It seems like the progress from GPT-4 to GPT-5 has plateaued: for most prompts, I actually find GPT-4 more understandable than GPT-5 [1].

[1] Read the answers from GPT-4 and 5 for this math question: "Ugh I hate math, integration by parts doesn't make any sense"

energy123 4 days ago | parent [-]

Basic prose is a saturated bench. You can't go above 100% so by definition progress will stall on such benchmarks.

RugnirViking 4 days ago | parent [-]

You say that, but I can imagine a good maths textbook and a bad one, both technically correct and well written prose, but one is better at taking the student on a journey and understanding where people fall off and common misunderstandings without odiously re-explaining everything