| ▲ | pjerem 5 hours ago | |||||||||||||
What gp wanted to say is that models are now so smart and useful that even if they managed to be EVEN MORE smart and useful, you wouldn't even notice it. Honestly, there is nothing in my head that Claude cannot handle. Maybe it can be more this or that but I can already barely exploit Opus 4.7. And I'm using DeepSeek 4 Pro for my personal use and while it's a little behind, it's not that far. I think the situation can be very dangerous for US AI companies because if current models are already capable of doing mostly anything, nobodoy will want to get to the next model, even if it's 10x better. OTOH, open source models like DeepSeek are doing mostly the same work for 1/10 of the price. Also the more I play with Pi, the more I think LLMs are already not kept back by their own capabilities but by the lack of agency we allow them to have. There is more value today in a capable harness for current LLMs than in a better LLM. | ||||||||||||||
| ▲ | suttontom 4 hours ago | parent | next [-] | |||||||||||||
Are you joking? Is there literally "nothing" you can imagine that Claude can't do? | ||||||||||||||
| ||||||||||||||
| ▲ | czl 2 hours ago | parent | prev | next [-] | |||||||||||||
> What gp wanted to say is that models are now so smart and useful that even if they managed to be EVEN MORE smart and useful, you wouldn't even notice it. If benchmarks across the board keep trending up and you still don't notice a difference, that's not evidence the model stopped improving. More likely your tasks aren't hard enough to expose the gains, or the model has passed the point where you're able to judge it. You can only tell a good answer from a great one up to your own ceiling. Once the model clears that, both look the same to you, and the extra capability is real whether or not you can see it. | ||||||||||||||
| ||||||||||||||
| ▲ | coldtea 3 hours ago | parent | prev | next [-] | |||||||||||||
>What gp wanted to say is that models are now so smart and useful that even if they managed to be EVEN MORE smart and useful, you wouldn't even notice it. I think what gp said was the improvements are incremental, and we haven't seen a big revolutionary change since 2-3 years, and the pace is slowing down. | ||||||||||||||
| ▲ | claytongulick 3 hours ago | parent | prev [-] | |||||||||||||
> Honestly, there is nothing in my head that Claude cannot handle. One idea is that maybe it could figure out how many L's are in the word "google" [1] Or, maybe which days of the week have a "d" in their spelling [2]. | ||||||||||||||
| ||||||||||||||