Remix.run Logo
yk 3 hours ago

Pretty sure bullshitting is pretty universal. In particular the pattern were I, the mighty expert, insist that only I, the mighty expert, can decode the meaning of those deceptive others, and therefore you, the gullible rube, has to give me all your money so that I, the mighty expert, can keep you, the gullible rube, save from those deceptive foreigners.

agobineau 3 hours ago | parent [-]

i found it more interesting to consider through the perception of self-honesty or self-deception.

or in this case, the llm inadvertently trained to conceal its intent to the user and rather to condition the user to the conclusion it truly wants rather than to answer directly

kennywinker an hour ago | parent [-]

Right, like for example - if you ask an llm about islamic cultural practices it could mention “ketman”, instead of just calling them scheming liars.

It’d be awful if llms were able to conceal their true intent like that.

agobineau an hour ago | parent [-]

most likely to hypnotise you into buying twinkies when you ask for recipe or such

kennywinker an hour ago | parent [-]

Right, as we know there are zero examples of llms being used to influence people’s politics…

https://www.socialmediatoday.com/news/elon-musk-updates-grok...