| ▲ | simplyluke 3 hours ago |
| I think most users of websites like reddit, x, and yes even HN don't realize how much traffic is inorganic. Marketing firms, government agencies, and many other interested parties with money to burn are absolutely aware that you search "best {product} reddit" I've commented on this before, but I strongly suspect much of the narrative around AI is being formed with strong inputs from these patterns. What's your basis for thinking that codex is best for planning, but opus is best for implementing? Is it based on extensive experimentation and first hand experience in a non-deterministic environment, or is it that you saw a large number of people on HN and X say that? Why was the dominant narrative on cursor coming within spitting distance of opus with a MUCH smaller team and less capital "LOL THEY USED KIMI!!" instead of "wow, open source models + a bit of RLHF training and some clever context management got within spitting distance of the industry giant and way cheaper"? The latter sentiment is a whole lot more damaging for a company eyeing an IPO with existing investors with very deep pockets. |
|
| ▲ | Aurornis an hour ago | parent | next [-] |
| > Why was the dominant narrative on cursor coming within spitting distance of opus with a MUCH smaller team and less capital "LOL THEY USED KIMI!!" instead of "wow, open source models + a bit of RLHF training and some clever context management got within spitting distance of the industry giant and way cheaper"? The latter sentiment is a whole lot more damaging for a company eyeing an IPO with existing investors with very deep pockets. This comment is interesting because you took a narrative that was being pushed and marketed (Cursor was close to Opus) and accepted it as the ground truth. The dominant narrative I saw around that, at least in my bubbles, was disappointment when they actually tried it and discovered it was not, in fact, close to Opus. |
| |
| ▲ | simplyluke 26 minutes ago | parent [-] | | Those were both narratives going around, but one was clearly winning in terms of volume, and that's what I'm speaking to here. The dismissal towards the open source models has always smelled more like marketing campaigns to me than the actual sentiments of any hackers I know. We all want the open source options to close the gap, but the labs are definitely staying ahead. My own experience was relatively similar, good, but with a notable gap that went beyond cherrypicked benchmarks. |
|
|
| ▲ | codezero 3 hours ago | parent | prev | next [-] |
| At a previous company our marketing team had a $50k/mo budget with an agency that got their basically verbatim posts posted by all the tech blogs like TechCrunch, venture beat, Huffington Post etc. I got really aware of the tech media and I read every story as intentional marketing. |
| |
| ▲ | genewitch 2 hours ago | parent [-] | | don't fall into the Gell-Mann Amnesia trap. Any media that has advertisements is already not in your interest. If a media has to weigh losing an advertiser or telling the truth, very few would choose truth. Scruples don't put food on the table, believe me, i know. This means that marketing budgets run everything, from the morning news talk to the evening nightly news, and everything between, is carefully crafted to keep you watching those commercials. On the internet, everything is trying to filter you into conversions or purchases, or steal your identity and cut out the middleman. PBS and NPR like to say they're advertiser free but they aren't, they just call it "underwriting", and it entails the same wariness over bucking the advertiser's wishes. sorry, underwriters wishes. edit to add a solution the solution is value for value. You publish, if people like your stuff, you tell them to contribute time, talent, or treasure to your product, be it a youtube channel, a podcast, or even an e-zine (remember those...) | | |
| ▲ | stavros 42 minutes ago | parent [-] | | Yep, banning advertising would be great here. You can't accept money unless it's for buying your product, and that's it. Then again, I'm sure some loopholes would be found. |
|
|
|
| ▲ | kylecazar 3 hours ago | parent | prev | next [-] |
| A good reason to find specific individuals with relevant knowledge and follow their writing directly. Think simonw and his pelicans... but there are lesser known trustworthy voices as well. It just takes some time to find them for a given area of interest. Also bring back blogrolls. |
| |
| ▲ | autoexec 2 hours ago | parent | next [-] | | > A good reason to find specific individuals with relevant knowledge and follow their writing directly. As soon as they get popular enough they'll be approached with offers to shill in exchange for huge piles of money. That's the entire point of "influencers". Trusted people being turned into secret advertisers and billboards. | | |
| ▲ | geerlingguy an hour ago | parent | next [-] | | Not all are swayed. The hard thing is finding which ones are, and which ones aren't. I rely on a web of trust. When I see another new hot AI trend, I check it against whether any of the people I've followed via RSS or manually curated on Twitter, Mastodon, etc (many of whom I met IRL) have said anything about it. There's still a an undercurrent of people blogging and posting and chatting who are trustworthy and haven't sold their soul to marketing. Or at least are clear when they say things that are marketing. But it is ever harder to find those voices, especially if you're new to an industry. | | |
| ▲ | cogman10 21 minutes ago | parent [-] | | It's hard to express, but it seems the best way to sus-out who is a shill and who's authentic is by comparing across reviews for a product. It's almost a bit like AI speak. The shills will all have very similar sounding content. They'll all hit on the same (ad copy) points. They might mix in a few negative tidbits, but generally speaking you'll catch them all praising the same wizbang features. Mkbhd is my favorite baseline shill. He practically just reads the product sheet. You know if he says it, it was probably given to him by the person paying for the review and, indeed, you can find the points he brings up echoed in other people's reviews. On the flip side, I generally trust Gamers Nexus to not shill. Primarily because their lack of playing ball has actually hurt their access. I've enjoyed your videos as well. They don't come off as a shill particularly because there's a number of products where the negative points you've put out have been strong enough to actually discourage a purchase. They haven't been weak "The colors could pop more". | | |
| ▲ | sph 2 minutes ago | parent [-] | | > It's hard to express, but it seems the best way to sus-out who is a shill and who's authentic is by comparing across reviews for a product. Brandolini’s law strikes again: you really have to pay attention to catch a shill. 99% of the time when you’re not paying attention and intentionally shopping for a particular product is when they get you. |
|
| |
| ▲ | 2 hours ago | parent | prev | next [-] | | [deleted] | |
| ▲ | boxedemp 26 minutes ago | parent | prev [-] | | Unless you're RMS |
| |
| ▲ | SyneRyder 3 hours ago | parent | prev [-] | | I agree, but I also think the point about "Is [your opinion] based on extensive experimentation and first hand experience" is really important. Relying on other bloggers is still delegating your thinking to others. Having your own objective measures and your own direct experience is useful, and sometimes it might contradict the prevailing wisdom. |
|
|
| ▲ | SkyPuncher 23 minutes ago | parent | prev | next [-] |
| > What's your basis for thinking that codex is best for planning, but opus is best for implementing? I for one work on an agentic product where we use all 3 of the major frontier models. The models absolutely have preferences and "personality" that lead to different characteristics. In my eyes: * Gemini - consistently the best at pure reasoning and tunability. Flash models are particularly good at latency sensitive small-scale reasoning. The tradeoff is they struggle with some basic behavior, like tool calling. * Claude - consistently good at long standing sessions. Opus may or may not be the best model, but it was the first model that crossed the "holy shit" threshold. I understand it's quirks/nuances and it's consistently solid. It's the best for me because I've learn how to be incredibly effective with it. * ChatGPT - Probably really good, but probably not worth switching from Claude. Last time I used their frontier model, it was a bit random. It would have moments of brilliance immediately followed by falling flat on it's face. |
|
| ▲ | emmelaich 3 hours ago | parent | prev | next [-] |
| https://paulgraham.com/submarine.html |
| |
| ▲ | Chaosvex 2 hours ago | parent [-] | | I like how that article claims PR firms don't lie and then proceeds to discuss how their best PR campaign was effectively a lie. > We estimated, based on some fairly informal math, that there were about 5000 stores on the Web. We got one paper to print this number, which seemed neutral enough. But once this "fact" was out there in print, we could quote it to other publications, and claim that with 1000 users we had 20% of the online store market. | | |
| ▲ | lmm 2 hours ago | parent [-] | | It sounds like they did good-faith estimate that there were 5000 stores out there and really believed they had 20% of the market? I wouldn't call that a lie as such. | | |
| ▲ | Chaosvex an hour ago | parent | next [-] | | To add to what the other poster said, it's a logical leap to go from 5000 stores to 20% share based on having 1000 users. What does the number of stores have to do with the number of users? It doesn't make any sense and that's because it's a lie. | |
| ▲ | GolfPopper an hour ago | parent | prev | next [-] | | They made up a number, and then quoted that number to other people (presumably with the intent to benefit themselves) without disclosing that they'd made up the number in the first place. That seems to jump right past 'lie' into 'fraud' or worse. I have this growing belief that what's wrong with America is that we've tossed a great deal of virtue (both personal and public) into the woodchipper, using a lot of euphemisms like "marketing" or "puffery". And the rot is not in any way confined to marketing - it's just that marketing is a very obvious example of it. The rot has made its way into education, relationships, entertainment, governance, infrastructure, what used to be called 'news', and on and on. We collectively gaslight ourselves to avoid dealing with the reality that we're constantly defecating in our own minds, contaminating ourselves with patterns of thought and action that are antithetical to our own continued well-being as individuals and collectives. To borrow a word from Pierre Teilhard de Chardin, we are poisoning the noosphere. | |
| ▲ | stavros an hour ago | parent | prev [-] | | Yeah but the "good faith" math had a big margin of error, and if I estimate 5k-20k shops and pick the lower number that just happens to make my company look great, that kind of changes things. |
|
|
|
|
| ▲ | raincole 3 hours ago | parent | prev | next [-] |
| > Why was the dominant narrative on cursor coming within spitting distance of opus with a MUCH smaller team and less capital And how do we know that? How do we know Cursor is "withing spitting distance of opus" (whatever it means)? Let me guess: > that you saw a large number of people on HN and X say that |
| |
| ▲ | gfody 2 hours ago | parent [-] | | I'm pretty sure this exact concern was the impetus for slashdot's friend:foe system, HN should implement something |
|
|
| ▲ | 2 hours ago | parent | prev | next [-] |
| [deleted] |
|
| ▲ | absoluteunit1 an hour ago | parent | prev | next [-] |
| > I think most users of websites like reddit, x, and yes even HN don't realize how much traffic is inorganic. Came here to say this - I have always been extremely cautious and assumed most things online were just marketing tactics. But I never realize how far and how strategic some of these campaigns are. I’ve recently started really getting my hands dirty with marketing for an app I’m building and the things I’ve learned in the past year have made me questions many of my views on things. At some point you realize that it’s all marketing or some form of effort to exert influence. A good book somewhat related to this is Attention Merchants |
| |
|
| ▲ | mumbisChungo 3 hours ago | parent | prev | next [-] |
| this is why oldschool chat > social media curating for trust and expertise and diversity of opinion |
| |
|
| ▲ | majormajor 2 hours ago | parent | prev | next [-] |
| Remember Quibi? All the money in the world can't actually turn a turd into a market leader. If you have a good product you have to play the marketing game to avoid getting left behind. If you have a bad product you try to play it and you still don't get picked up. (This last bit is where things usually turn into an argument about "no, obviously [this thing I don't like] is bad and is only popular because of the marketing", which assumes taste is more universal than it is.) |
| |
| ▲ | ericjmorey an hour ago | parent [-] | | Quibi doesn't seem like a good example. It wasn't marketed as the next big thing. It was a trial balloon that popped. |
|
|
| ▲ | apsurd 3 hours ago | parent | prev | next [-] |
| related: Cursor composer line of models is so good relative to cost. "auto" served me just fine until they recommended Composer and I've been continually happy with it. Then Claude Code with Opus dropped and everyone went bananas and I gotta say I just assumed I'm too casual to know how bad Cursor has been? But then I think maybe not really? Granted, I'm not orchestrating 100 Agents doing overnight work. But relating this to your point, if the CC-camp + HN hadn't proclaimed otherwise, I would have no idea what breakthrough CC+Opus made. (Cursor was first with plan mode right?) |
| |
| ▲ | operatingthetan 3 hours ago | parent | next [-] | | I don't think Cursor was _that bad_ in it's time. But the 'psyop' here is that anyone is using an AI-IDE going forward at all. I see people who say they are still using them and are so excited, but then I talk to engineers I actually know and it's all CLI tools. | | |
| ▲ | gen220 24 minutes ago | parent | next [-] | | FWIW, cursor (company) has a CLI tool/harness similar to Claude Code called agent. It’s existed for a long time, is quite good, and it is under-marketed (ironic for this thread). (Double-ironic disclosure… I work for Cursor. If you have ideas to make agent better hmu) | |
| ▲ | majormajor 2 hours ago | parent | prev | next [-] | | > I don't think Cursor was _that bad_ in it's time. But the 'psyop' here is that anyone is using an AI-IDE going forward at all. I see people who say they are still using them and are so excited, but then I talk to engineers I actually know and it's all CLI tools. This is just the old "surely nobody actually likes Lady Gaga, all the people I actually know think her stuff sucks, it's just all bought and paid for" reasoning trap all over again... | | |
| ▲ | operatingthetan 2 hours ago | parent [-] | | You couldn't even keep your analogy straight. I didn't say the people I know said anything at all about Cursor. If someone is clear about offering an anecdote, it's dishonest to pretend as if they were making a real and reasoned argument. |
| |
| ▲ | 2 hours ago | parent | prev [-] | | [deleted] |
| |
| ▲ | stavros 21 minutes ago | parent | prev [-] | | I haven't used Cursor much lately, but with Opus/Codex I can program with very few bugs without having to look at any code at all, over months of working on the same codebase. I don't think any other model can do that, no? |
|
|
| ▲ | cucumber3732842 2 hours ago | parent | prev [-] |
| >I've commented on this before, but I strongly suspect much of the narrative around AI is being formed with strong inputs from these patterns. "The AI talks down to me like Reddit because it's trained on Reddit" has been a running joke/quip/gripe on the "less refined" parts of the internet for awhile now. |