Remix.run Logo
ossa-ma 3 hours ago

These tropes emerge from the distribution of the LLM itself and from my experimentation it's actually very difficult to get an LLM to change its language. Especially when you consider they've been RLHFed to the max to speak the way they do.

vidarh 3 hours ago | parent | next [-]

Changing the style is easy: Just feed it a writing sample, and tell it to review its own writing against the style of the writing sample.

That won't entirely weed out these tropes, but it will massively change the style.

Then add a few specific rules and make it review its writing, instead of expecting it to get it right while writing.

To weed out the tropes is largely a question of enforcing good writing through rules.

A whole lot of the tropes are present because a lot of people write that way. It may have been amplified by RLHF etc., but in that case it's been amplified because people have judged those responses to be better - after all that is what RLHF is.

vidarh 2 hours ago | parent | prev | next [-]

Just as long as you're aware you'll get a shitload of false positives. E.g. see: https://news.ycombinator.com/item?id=47135703

fooker 3 hours ago | parent | prev [-]

I just gave it a try and all the state of the art models successfully avoided the tropes when told to.