Remix.run Logo
mountainriver an hour ago

What is this comment? It’s an RL paper, these are standard RL terms

greesil an hour ago | parent [-]

It's a comment. On Hacker News. Not the RL subreddit, or whatever. I'm just amazed at the jargon. I'm sure it's useful, but one could just call it model output.

antonvs 2 minutes ago | parent [-]

> one could just call it model output.

That would be incorrect. My other reply attempts to address this.