interesting. So, you mean say if an agent is working on automatically doing a PR review, how many such calls to agents are failing, how much time they are taking, etc?
Lot of this you can do with traces today which trace AI specific calls