| ▲ | rabf 2 days ago | |||||||||||||||||||||||||||||||
Positive reinforcement works better that negative reinforcement. If you the read prompt guidance from the companies themselves in their developer documentation it often makes this point. It is more effective to tell them what to do rather than what not to do. | ||||||||||||||||||||||||||||||||
| ▲ | sally_glance 2 days ago | parent | next [-] | |||||||||||||||||||||||||||||||
This matches my experience. You mostly want to not even mention negative things because if you write something like "don't duplicate existing functionality" you now have "duplicate" in the context... What works for me is having a second agent or session to review the changes with the reversed constraint, i.e. "check if any of these changes duplicate existing functionality". Not ideal because now everything needs multiple steps or subagents, but I have a hunch that this is one of the deeper technical limitations of current LLM architecture. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | nomel 2 days ago | parent | prev [-] | |||||||||||||||||||||||||||||||
Could you describe what this looks like in practice? Say I don't want it to use a certain concept or function. What would "positive reinforcement" look like to exclude something? | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||