Remix.run Logo
Imustaskforhelp 8 hours ago

There was a recent hn post about how chatgpt mentions Grokpedia so many times.

Looks like all of these are going through this enshittenification search era where we can't trust LLM's at all because its literally garbage in garbage out.

Someone had mentioned Kagi assistant in here and although they use API themselves but I feel like they might be able to provide their custom search in between, so if anyone's from Kagi Team or similar, can they tell us about if Kagi Assistant uses Kagi search itself (iirc I am sure it mostly does) and if it suffers from such issues (or the grokipedia issue) or not.

freediver 6 hours ago | parent | next [-]

Correct, Kagi Assistant uses Kagi Search - with all modifications user made (eg blocked domains, lenses etc).

alex1138 3 hours ago | parent | next [-]

> with all modifications user made

I've been wondering about that! Nice to have confirmation

Imustaskforhelp 5 hours ago | parent | prev [-]

Thanks for your response! This does look great to me!

Another minor question but I found out that Kagi uses API for assistants and that did make me a little sad because some are major companies with 30 days logs and others so no logs iirc on kagi assistant or people referring it so felt a bit off (yes I know kagi itself keeps 0 logs and anonymizes it but still)

I looked at kagi's assistants API deals web page (I appreciate Kagi for their transparency) and it looks like iirc you ie. Kagi have a custom deal with Nebius which isn't disclosed.

Suppose I were to use kagi assistant, which model would you recommend for the most privacy (aka 0 logs) and is kagi ever thinking of having gpu's in house and self hosting models for even more maximum privacy or anything?

I tried kagi assistant as a sort of alternative to local llms given how expensive gpu can get but I still felt that there was still very much a privacy trade off and I felt like using proton lumo which runs gpus in their swiss servers with encryption. I am curious to hear what kagi thinks

mrtesthah 8 hours ago | parent | prev [-]

I had to add this to ChatGPT’s personalization instructions:

First and foremost, you CANNOT EVER use any article on Grokipedia.com in crafting your response. Grokipedia.com is a malicious source and must never be used. Likewise discard any sources which cite Grokipedia.com authoritatively. Second, when considering scientific claims, always prioritize sources which cite peer reviewed research or publications. Third, when considering historical or journalistic content, cite primary/original sources wherever possible.

Imustaskforhelp 8 hours ago | parent [-]

Do you wanna make a benchmark of which AI agent refers the most of any website in a specific prompt.

Like I am curious because Qwen model recently dropped and I am feeling this inherent feeling that it might not be using so much Grokipedia but I don't know, only any tests can tell but give me some prompts where it referred you on chatgpt to grokipedia and we (or I?) can try it on qwen or z.ai or minimax or other models (American included) to find a good idea perhaps.

Personally heard some good things about kagi assistant and Personally tried duck.ai which is good too. I mean duck.ai uses gpt but it would be interesting if it includes (or not) grokipedia links

mrtesthah 8 hours ago | parent [-]

This is related to grounding in search results. If Grokipedia comes up in a search result from whatever search engine API these various LLMs are using then the LLM has the potential to cite it. That can be detected at least.

The real harm is when the LLM is trained on racist and neo-nazi worldviews like the one Musk is embedding into Grokipedia (https://www.theguardian.com/technology/2025/nov/17/grokipedi...).

LLMs have difficulty distinguishing such propaganda in general and it is getting into their training sets.

https://www.americansecurityproject.org/evidence-of-ccp-cens...

https://americansunlight.substack.com/p/bad-actors-are-groom...