Remix.run Logo
p1esk 8 days ago

Yes, I’d be curious about his experience with GPT-5 Thinking model. So far I haven’t seen any blunders from it.

eru 8 days ago | parent [-]

I've seen plenty of blunders, but in general it's better than their previous models.

Well, it depends a bit on what you mean by blunders. But eg I've seen it confidently assert mathematically wrong statements with nonsense proofs, instead of admitting that it doesn't know.

grey-area 8 days ago | parent [-]

In a very real sense it doesn’t even know that it doesn’t know.

eru 8 days ago | parent [-]

Maybe. But in math you can either produce the proof (with each step checkable) or you can't.