▲ | stereobit a day ago | |
DAG sounds interesting. Might help me to solve my biggest challenge with evals right now, which is testing subjective metrics e.g. “is this a good email” | ||
▲ | jeffreyip 15 hours ago | parent [-] | |
Do check it out, the early feedback has been great: https://docs.confident-ai.com/docs/metrics-dag |