▲ | jinusunil a day ago | |
How do you evaluate the output of your trace tool? Are some benchmarks for tracing tools? | ||
▲ | xinweihe 6 hours ago | parent [-] | |
Yep, we're working on a golden test set with known root causes to benchmark and track agent performance over time. It's taking a bit of work to get right, but we're on it and definitely open to contributions! |