I'm trying to wrap my mind around this. Anything you explore and share is awesome. Thanks for the blog post.
If you want to test it across coding tasks, have a look at https://github.com/adam-s/testing-claude-agent