| ▲ | jamesblonde 3 hours ago | |
I got turned off in the first paragraph with the misuse of the term "back pressure". "back pressure" is a term from data engineering to specifically indicate a feedback signal that indicates a service is overloaded and that clients should adapt their behavior. Backpressure != feedback (the more general term). And in the agentic world, we use the term 'context' to describe information used to help LLMs make decisions, where the context data is not part of the LLM's training data. Then, we have verifiable tasks (what he is really talking about), where RL is used in post-training in a harness environment to use feedback signals to learn about type systems, programming language syntax/semantics, etc. | ||
| ▲ | patates 3 hours ago | parent | next [-] | |
We all live in our own various small circles, in which many terms get misused. Isomorphic in front end circle means something completely different than any other use, for example. This is how languages evolve. I'm not trying to discount any attempt to correct people, especially when it gets confusing (like here, I was also confused honestly), but we could formulate it nicer IMHO. | ||
| ▲ | paradite 3 hours ago | parent | prev | next [-] | |
If you want to pedantic: Context is also a misnomer, where in fact it's just a part of prompt. Prompt itself is also a misnomer, where in fact it's just part of model input. Model input is also a misnomer, in fact it's just first input token + prefill for model output to generate more output. Harness is also a misnomer, where it's just scaffold / tools around the model input/output. | ||
| ▲ | ghuntley 2 hours ago | parent | prev | next [-] | |
the back pressure terminology comes from me. essentially it’s the wheel - you need to add backpressure to the agentic flywheel. see https://ghuntley.com/pressure i have the pleasure to work with moss and he came up with a way to explain what is in my head with ease. | ||
| ▲ | SubiculumCode 3 hours ago | parent | prev | next [-] | |
It is perhaps more generally known in the plumbing sense of pressure causing resistance to the desired direction of flow, but yeah, a poor word choice...at least it isn't AI written though. | ||
| ▲ | oofbey 3 hours ago | parent | prev | next [-] | |
Ironic that you nitpick the author’s word choice of “back pressure” and then completely misuse the term RL in your complaint. | ||
| ▲ | atoav 3 hours ago | parent | prev [-] | |
Well it does sound technical. | ||