▲ | Backprompting: Leveraging synthetic production data for health advice guardrails(arxiv.org) | |
27 points by PaulHoule 3 days ago | 1 comments | ||
▲ | mentalgear 3 days ago | parent [-] | |
> We test our technique in one of the most difficult and nuanced guardrails: the identification of health advice in LLM output, and demonstrate improvement versus other solutions. Our detector is able to outperform GPT-4o by up to 3.73%, despite having 400x less parameters. |