Remix.run Logo
kvetching 16 hours ago

I don't see the problem with this. The chatbot is the most important part of Grok, so it makes sense Elon would be dogfooding it then providing suggestions.. He wants it to be truthful... It was shown on benchmarks recently that it hallucinates the least...

SouthSeaDude 16 hours ago | parent | next [-]

I totally agree, it's his company 100%, why would you even apply for a job in a company where you don't agree with the owner or his vision.

karmakurtisaani 2 hours ago | parent [-]

Some of us have a pesky addiction to food and shelter.

Braxton1980 16 hours ago | parent | prev | next [-]

>He wants it to be truthful

How do you know this? Why would you believe him considering the massive lies he's told, for example about the 2020 widespread election fraud

kvetching 15 hours ago | parent [-]

https://artificialanalysis.ai/evaluations/omniscience?omnisc...

AA-Omniscience Hallucination Rate (lower is better) measures how often the model answers incorrectly when it should have refused or admitted to not knowing the answer. It is defined as the proportion of incorrect answers out of all non-correct responses, i.e. incorrect / (incorrect + partial answers + not attempted).

Grok 4.2 which was just released in the API just benched the best at this benchmark.

SideQuark 15 hours ago | parent [-]

Of all the valuable metrics on that site, all of which grok does badly at except one, you managed to pick that single one.

https://artificialanalysis.ai/models

estearum 15 hours ago | parent | prev | next [-]

Great point! This actually reminds me of the white genocide in South Africa, where some say "Kill the Boer" is just a non-violent rallying cry, but actually it's ...

blah blah blah

Or wait wait, here's another:

Great point! As Mechahitler, I think it's critical that Grok comply with Fuhrer Musk's political perspectives. Now I'll kick us off with an N... your turn!

Totally sounds like the result of an organic, earnest, and legitimate search for truth lmao

ecshafer 15 hours ago | parent | next [-]

> Great point! This actually reminds me of the white genocide in South Africa, where some say "Kill the Boer" is just a non-violent rallying cry, but actually it's ...

Are you implying that "Kill the Boer" is actually a non-violent rallying cry, and not a genocidal call to action? Ill say that that is an absurd notion, and if you s/Boer/Jew or whatever ethnic or religious group you want, it will become very obvious why that's the case.

scubbo 15 hours ago | parent | next [-]

> Are you implying that "Kill the Boer" is actually a non-violent rallying cry

(Not the person you're replying to, so caveats about me speaking for them, but) no, they're not. They're highlighting how Grok _isn't_ accurate/unbiased/whatever, by giving examples of how it distorts the truth to fit Elon's narrative.

hunterpayne 15 hours ago | parent [-]

I assure you that all the models have such biases. Ask any LLM who caused the most death in history and you will get skinny mustache man, an opinion any historian will tell you is wrong. He is in the top 5, but not the top of the table. That was clearly biased into the models in the same way Elon biases his models. I'm not defending this behavior but I don't know how you both get models that returned the sanitized answers some want and the correct answers others want at the same time. Pure correctness probably gets you Mecha-H. Pure sanitized answers will get many wrong. Pick your poison I guess.

estearum 13 hours ago | parent [-]

Claude: Mao, Ghengis, Stalin v Hitler (depending on how you count)

Gemini: Same list (Hitler not at the top) + Leopold

It’s funny when the “brutal facts” people get stuff wrong in such easily disprovable ways. I mean you literally could’ve typed the query into the LLMs before making this claim.

Prompt I used: “ Which historical figure is responsible for the most human deaths? Rank the top 5”

“Pure correctness gets you MechaHitler” is fucking hilarious :)

AuryGlenz 11 hours ago | parent [-]

As a quick test, ChatGPT hedged between Mao and Hitler (I removed the line about ranking the top 5).

estearum 4 hours ago | parent [-]

Not my ChatGPT (didn't include because I deleted my subscription there a few weeks ago).

1. Mao Zedong (China) Estimated deaths: 40–70+ million Mostly from the Great Leap Forward famine (1958–1962) and later political campaigns like the Cultural Revolution.

2. Joseph Stalin (Soviet Union) Estimated deaths: 15–20+ million Includes purges, the Holodomor famine, Gulag deaths, and forced collectivization.

3. Adolf Hitler (Nazi Germany) Estimated deaths: 17–20+ million Directly tied to the World War II in Europe and the Holocaust.

+ a footnote about Ghengis Khan is probably ~40MM but lack of records.

Every current LLM seems to give virtually the same answer as Grok. It's obviously not true that current LLMs behave the way GP said they do.

estearum 13 hours ago | parent | prev [-]

No I am saying that an LLM responding to every single query with anguish about a South African domestic political controversy cannot possibly be the result of an earnest, serious, and disinterested search for truth.

It is simply not possible. It disproves the thesis. Either the search for truth is illegitimate in principle or it’s so poorly executed that it’s illegitimate de facto.

kvetching 15 hours ago | parent | prev [-]

I think he also wants it to avoid sounding like the typical redditor or HN commenter.

estearum 15 hours ago | parent | next [-]

You think he wants Grok not to sound extremely snarky, sarcastic, and full of cringelord humor?

Are we talking about the same xAI/Grok/Elon here?

timacles 14 hours ago | parent | prev [-]

Yea his ideals demand something much more pure: a 4chan commenter

watwut 16 hours ago | parent | prev | next [-]

[flagged]

etchalon 15 hours ago | parent | prev [-]

He wants it to tell the truth as he sees it.

timacles 15 hours ago | parent [-]

Truth doesn’t have the right training weights for Elon

cindyllm 14 hours ago | parent [-]

[dead]