Remix.run Logo
LLMs do not merely reflect the bias of their training, they police it(twitter.com)
26 points by nailer 3 hours ago | 19 comments
jacques_morin an hour ago | parent | next [-]

brian roemmele is an authority in nothing, I don't understand why this was published here. This dude has literally no expertise : https://www.reddit.com/r/DecodingTheGurus/comments/1cumj6w/h...

ramon156 an hour ago | parent | next [-]

I'm a bit confused why the OP in that reddit post is so mean about the app. Seems fine. Not something I need, but not the worst choice of an app idea.

I did not care for the "X article" (is that what it's called?), but I don't get the rage that is in that reddit thread.

SirFatty an hour ago | parent | prev | next [-]

And you seem pretty upset about it. You should send him an email letting him know about your dissatisfaction!

N_Lens an hour ago | parent | prev | next [-]

Amusing read, thanks!

Since we are in the golden age of grifting, this guy will probably go pretty far.

an hour ago | parent | prev | next [-]
[deleted]
JohnKemeny an hour ago | parent | prev [-]

yeah, only authorities are considered here at HN.

KaiserPro an hour ago | parent [-]

> Brian Roemmele is the recognized world authority on how voice AI will impact computing and commerce.

I dunno, I think thats pretty convincing (http://voicefirst.expert/about/)

veltas an hour ago | parent | prev | next [-]

Much the same as 'arguments' I can have with LLM's about things where I'm the expert and I know it's wrong, but it will justify its position to the end because it's trained on common misconceptions that exist among less-expert people.

msandford an hour ago | parent [-]

The idea that's been floating around in my head for the last few years is something like "it's being trained by the data produced by people, it's going to have many human flaws as a result"

JohnKemeny an hour ago | parent | prev | next [-]

The paper under discussion:

https://zenodo.org/records/17720178

Note that Zenodo is a DOI-provider, not a (scientific) journal. Anyone can upload anything to Zenodo. It's less strict than arXiv.

Edit: The "paper" is written by one Hiroko Konishi, an independent researcher (she is a voice actress).

JohnKemeny 27 minutes ago | parent [-]

Follow-up. I in fact suspect that either she is a bot, or she is using an LLM to spew out papers. She uploads about one paper per month to Zenodo, and they all seem very AI generated.

KaiserPro an hour ago | parent | prev | next [-]

wait, is this news?

Of course they reflect the bias in the training, thats been known since the 90s if not longer (see apocryphal story about training to detect tanks, but only detecting either trees or clouds)

but like this is expected, the whole point of RLHF (or any other feedback) is to condition the model to respond in a certain way. Thats what makes them useable for a bunch of situations.

GL26 2 hours ago | parent | prev | next [-]

We are not yet at misalignment, but this shows the existence of a slope that derivates into misaligned adversarial ai models. Must this be fixed at training time (at which step ?) ? Thinking about this report : https://ai-2027.com/

rwmj 2 hours ago | parent | next [-]

That was a nonsensical work of fiction, not a report.

2 hours ago | parent | prev [-]
[deleted]
harrouet 2 hours ago | parent | prev | next [-]

This will be very useful to call out replicants, thx.

cucumber3732842 an hour ago | parent | prev | next [-]

Why wouldn't an LLM whose training content is dominated by, or at least severely clouded by, the contribution habitual rule follower/peddler/enforcer types go on to mimic that behavior?

You feed it reddit and wikipeidia it's gonna turn into a conformist npc.

You feed it the contents of professional content and it's gonna spew vapid corporate nothingness.

You feed every text message ever sent over Boost Mobile, actually wait that sounds hilarious someone should do that.

sdoering an hour ago | parent | prev | next [-]

[dead]

spacebacon 2 hours ago | parent | prev [-]

[dead]