I've actually seem really good outputs from the regular Grok 4. The issue seemed to be that it didn't explain anything and just made some changes, which like, I said, were pretty good. I never wanted a faster version, I just wanted a bit more feedback and explanations for suggested changes.

I recently found it much more valuable, and why I am now preferring GPT-5 over Sonnet 4, is that if I start asking it to give me different architectural choices, its really quite good at summarizing trade-offs and and offering step-by-step navigation towards problem solving. I am liking this process a lot more than trying to "one shot" or getting tons of code completely rewritten, thats unrelated to what I am really asking for. This seems to be a really bad problem with Opus 4.1 Thinking or even Sonnet Thinking. I don't think it's accurate, to rate models on "one-shoting" a problem. Rate it on, how easy it is to work with, as an assistant.

▲

Szpadel 5 days ago | parent | next [-]

I had that issue with gpt-5 that when it wanted to do something in one way that was just plain wrong in this project, and no matter what I said it just kept doing the same action.

it was completely unsterable. I get why people are often upset by "you're right" of Claude models, but that's what I usually want from model.

I guess there is different in expectations depending on experience level of developer, but I want to have final saying what is the right way

▲

cft 5 days ago | parent | prev | next [-]

I have the same experience, except while I agree that GPT-5 is better than Sonnet 4 for architecture and deep thinking, Sonnet 4 still seems to be better for just banging out code when you have a well-defined and a very detailed plan.

▲

Demiurge 5 days ago | parent | prev [-]

Sometimes it's obvious, but in this case, why are you downmodding my comment? I'm genuinely curious, what am I saying, that is so offensive or wrong?

	▲	5 days ago \| parent \| next [-]
		[deleted]
	▲	oblio 4 days ago \| parent \| prev [-]
		I didn't downvote, but: 1. A lot of people are interesting in maintaining AI hype. 2. People work differently.