| ▲ | ConceitedCode 10 hours ago |
| I suspect we'll address this by just going back to older ranking algorithms for search. We'll go back to the primary signal of good content being links from trusted sources. People gaming the content based algorithms will eventually cause their own downfall. |
|
| ▲ | iuvcaw 10 hours ago | parent | next [-] |
| Ironically this post is doing wonders for its page rank, as people are linking to it in the comments |
| |
| ▲ | dang 9 hours ago | parent | next [-] | | <a href="https://oneuptime.com/blog" rel="nofollow">https://oneuptime.com/blog</a>
https://news.ycombinator.com/item?id=47641348(By coincidence, see also https://news.ycombinator.com/item?id=47641829) | |
| ▲ | Retr0id 10 hours ago | parent | prev | next [-] | | Now that we have better ML, maybe we could take "link sentiment" into account too. | | |
| ▲ | oliveroot 9 hours ago | parent | next [-] | | I think they have something better - “link rank” which essentially takes into account the quality of backlink. I believe it is nuanced enough to have different rank per “topic”, or “keyword” etc. but admittedly just kinda guessing from the outside. The last time I tried to build something like this I realized it’s useless without first having a gigantic amount of data already crawled. When I started crawling I realized I would never catch Google. I think without Wikipedia the LLMs might have taken 10 more years to surpass them. | | |
| ▲ | cyanydeez 6 hours ago | parent [-] | | Crawlers would need to use backlinks but also rank vector similarity to ensure the linked content matches the linked intent. Some kind of rainbow shades of how relevent the link is to the linkee and reverse. |
| |
| ▲ | zahlman 9 hours ago | parent | prev [-] | | I don't know how good it was, but sentiment analysis was definitely a thing pre-ChatGPT. | | |
| ▲ | Retr0id 9 hours ago | parent [-] | | It was pretty basic though, and even a frontier LLM might struggle to infer that OP is a negative-sentiment link, without sufficient context. |
|
| |
| ▲ | Aurornis 8 hours ago | parent | prev | next [-] | | rel=nofollow is used to signal that links should not be used by search crawlers for authority calculations on most sites with user-submitted content, including Hacker News. You basically have to use nofollow for comments otherwise your site becomes a big target for SEO link spam. | |
| ▲ | politelemon 9 hours ago | parent | prev [-] | | I wonder if we ought to be flagging it then? There's already so much uninteresting AI slop observations. |
|
|
| ▲ | whstl 6 hours ago | parent | prev | next [-] |
| This has been the status quo for more than a decade. In the past SEO blogspam was done by cheap freelancers, and there were several agencies selling the service. Experts identify blogspam quite easily, but laypeople eat it up and use as reference in conversations and to make decisions. Google has known about it, has been in contact with such agencies and companies, and has been refusing to do anything about it for the longest time. |
|
| ▲ | vohk 9 hours ago | parent | prev | next [-] |
| I don't have a ton of hope just yet because I think it's still an incentives problem rather than a technical one. I got tired of the increasing AI slop in my YouTube Music feed and switched to Deezer a few months ago. Since then, not a single AI artist I've been able to spot. If a relatively marginal player like that can manage it, why can't Spotify or YTM? My suspicion is simply that Deezer actually actually tries. It's the same problem with Google and search. Kagi and others have demonstrated that you can produce better results with an infinitesimal fraction of the budget, and Google is still plenty competent where they care to be. This won't start to get fixed until they see a financial incentive to do so. |
| |
| ▲ | VladVladikoff 9 hours ago | parent | next [-] | | Maybe it’s that AI music isn’t being spammed as hard at ‘platform I’ve never heard of before’? | | | |
| ▲ | conception 9 hours ago | parent | prev | next [-] | | Spotify 100% rather buy/produce AI music than pay artists. Also they demonetized most of their artists so if they can pump AI songs that sound enough like what you listen to and then stop promoting them they don’t have to pay anyone. | |
| ▲ | cyanydeez 6 hours ago | parent | prev [-] | | Its not a technical problem. Its a public good we refuse to turn into a government service for nebulous reasons. |
|
|
| ▲ | bakugo 9 hours ago | parent | prev | next [-] |
| > I suspect we'll address this Who is "we"? Definitely not Google or any other major tech company, they're all actively encouraging this. > trusted sources. What trusted sources are there that haven't yet been taken over by AI? |
| |
| ▲ | 8 hours ago | parent | next [-] | | [deleted] | |
| ▲ | dvfjsdhgfv 8 hours ago | parent | prev [-] | | > Who is "we"? Definitely not Google or any other major tech company, they're all actively encouraging this. Google has been fighting aggressively to replace its search results with snippets, now generated by LLMs, to avoid sending traffic to other websites. If they continue, they will basically lead Google Search to a tipping point where a good competitor can take this market by storm. Microsoft also believed Windows is indestructible and now they have a rude awakening. | | |
| ▲ | onion2k 8 hours ago | parent [-] | | The fact is what people really want from a search engine is a single perfect result that answers their query exactly. An LLM does the 'single result' bit, but it's dubious whether or not it's a perfect answer. Most of the time that's probably not very important so long as the answer satisfies the search enough that the user is happy. Google is trying to turn Search into that product e.g. the single answer to a given search. They could do that now with Gemini, but the ads in the results are what makes them money, and the backlash to embedding adverts into the output of Gemini would drive millions of people to OpenAI overnight. They have to do it slowly. Give it 5 years though, and search engine results pages will be a thing of the past. | | |
| ▲ | dvfjsdhgfv 8 hours ago | parent [-] | | > Most of the time that's probably not very important Well... Maybe, but what's the point of an answer if you can't trust it? For ultra-fast answers for unimportant stuff I keep Cerebras tab open. |
|
|
|
|
| ▲ | ctoth 10 hours ago | parent | prev | next [-] |
| [dead] |
|
| ▲ | 9 hours ago | parent | prev [-] |
| [deleted] |