| ▲ | ankit219 9 hours ago | ||||||||||||||||
Like it or not, it's a fundraising strategy. They have followed it mutliple times (eg: vague posts about how much their inhouse model is writing code, online RL, and lines of code etc. earlier) and it was less vague before. They released a model and did not give us the exact benchmarks or even tell us the base model for the same. This is not to imply there is no substance behind it, but they are not as public about their findings as one would like them to be. Not a criticism, just an observation. | |||||||||||||||||
| ▲ | themafia 8 hours ago | parent | next [-] | ||||||||||||||||
I don't like it. It's lying in order to capture more market value than they're entitled to. The ends do not justify the means. This is a criticism. | |||||||||||||||||
| |||||||||||||||||
| ▲ | Jcampuzano2 8 hours ago | parent | prev | next [-] | ||||||||||||||||
Never releasing the benchmarks or being openly benched unlike literally every other model provider always irked me. I think they know they're on the backfoot at the moment. Cursor was hot news for a long time but now it seems terminal based agents are the hot commodity and I rarely see cursor mentioned. Sure they already have enterprise contracts signed but even at my company we're about to swap from a contract with cursor to Claude code because everyone wants to use that instead now - especially since it doesn't tie you to one editor. So I think they're really trying to get "something" out there that sticks and puts them in the limelight. Long context/sessions are one of the hot things especially with Ralph being the hot topic so this lines up with that. Also I know cursor has its own cli but I rarely see mention of it. | |||||||||||||||||
| ▲ | alfalfasprout 9 hours ago | parent | prev | next [-] | ||||||||||||||||
Unfortunately all the major LLM companies have realized the truth doesn't really matter anymore. We even saw this with the GPT-5 launch with obviously vibe coded + nebulous metrics. Diminishing returns are starting to really set in and companies are desperate for any illusion to the contrary. | |||||||||||||||||
| ▲ | PlatoIsADisease 6 hours ago | parent | prev [-] | ||||||||||||||||
I used to hate this, I've seen Apple do it with claims of security and privacy, I've seen populist demagogues do this with every proposal they make. Now I realize this is just the reality of the world. Its just a reminder not to trust, instead verify. Its more expensive, but trust only leads to pain. | |||||||||||||||||
| |||||||||||||||||