Remix.run Logo
jatora 3 hours ago

The claims of 4.6 or 4.7 being superior genuinely make me laugh. Adapt your workflow if needed and use the superior model instead of just kneejerk believing they actually enshittified a model with zero evidence except vibes on an undeterministic model output. Jesus.

dandellion 3 hours ago | parent | next [-]

Your vibes are definitely better than his vibes.

dymk 3 hours ago | parent [-]

What about all the benchmarks that show improvements in each generation?

digitaltrees 3 hours ago | parent [-]

Many of the improvements are the result of agentic loops and an emphasis on autonomy. Some of us don’t like that because the models go rogue and ignore design patterns, architecture, coding guidelines or other things that are important.

My friends and colleagues that like the agentic autonomy don’t care about the code, they feel like if it works it works and if an AI system is the only intelligence able to understand it that is ok.

I still want to be in the loop. They don’t.

8note 2 hours ago | parent | next [-]

the more agentic focused the better though?

sonnet 5 is very noticeably a much better model than any opus that ive touched

it actually does the things i want it to, and uses tools and triggers skills appropriately, vs trying to make stuff up

bakies 2 hours ago | parent | prev [-]

Agentic coding should absolutely care about all the things you listed.

sudosteph 3 hours ago | parent | prev | next [-]

4.6 was the last model that let you disable adaptive thinking and set max thinking token budget. I liked having that available, and still use it sometimes.

dijit 3 hours ago | parent | prev | next [-]

Bro, it's all vibes.

Models get dumber during the day and smarter during the night, I swear.

but I'm not willing to scientifically verify this, so I'm just going to go off of vibes- just like everyone seems to be doing with projects.

human305893 an hour ago | parent | next [-]

In my case it that I'm tired and more likely to miss issues or mistakes. My idea of good enough is at a much lower level when it's 10pm and I'm about to knock off and go to bed in an hour.

rplnt 3 hours ago | parent | prev [-]

These vibes are pretty obvious even with casual use. Weekends are so much better.

solenoid0937 3 hours ago | parent | prev [-]

4.8 is much better than either of them as well.