▲ | iskhare 7 days ago | |||||||
You say your rubric approach is “better than llm as a judge.” Can you please elaborate on what makes you say that? | ||||||||
▲ | AbhinavX 7 days ago | parent [-] | |||||||
LLM as a judge for agent usually has context overload and even if you have a really good prompt for your evaluation, LLMs hallucinate because there is just too much information to ingest. So we created an agentic pipeline to basically do evaluations on rubrics which have better results and dont miss intricacies due to the overloaded context. | ||||||||
|