| ▲ | BoiledCabbage 6 hours ago | ||||||||||||||||||||||
What is this nonsense? An AI generated article about single ai run test which in theory had many components and the AI judge declared deepseek "won"? How many runs were there on each test to account for some temperature variance? Only one. Did deepseek write better code? Did GPT's code have bugs when doing the regex? The AI "news" article doesn't actually say that. It says that grok thought that GPT's approach could have bugs so it declared deep seek the winner. This is absolute worthless methodology. And barely measurable methodology - nothing more than a prompt. No definition of what the scoring approach actually is. No definition of what "precision" actually means in this context. This is absolutely worthless and has no business being in the site, forget about on the front page. So why is it's on the front page? Because it aligns with the current "feels" of the community that deepseek will get better and it shows "bad things" about the en vogue to dislike closed models. I happen to agree with both of the views, but this site is utterly worthless. If you want HN to be astro-turfed to the max, just up vote content like this without any critical reading of the. I mean the past 6 months of "here is my chat gpt blog post of how to use a coding agent" are 1000x better than this "news article". Seriously the amount of respect I've lost recently for the HN community is incredible. A bit harsh, but very true. Maybe it's generational thing, maybe it's due to the state of politics, maybe it's a side effect of me getting older, but recently online has turned into nothing but people explicitly (or implicitly) writing about their "team". Comments on this post are nothing but people who clearly see themselves as being on "team deepseek" or "team open models" or some similar variant writing posts in support even though this is probably one of the worst "articles" to make it to the front page on ages. It clearly doesn't matter. It supports something on their "team" so they support it via comments. If kills any form of intellectual discussion. It's all just "this is my team". | |||||||||||||||||||||||
| ▲ | sourcecodeplz 6 hours ago | parent [-] | ||||||||||||||||||||||
Have you even used deepseek pro/flash? Yes, it is astroturfed to the maxx. There is a reason for that. The performance/price ratio beats anything available today. | |||||||||||||||||||||||
| |||||||||||||||||||||||