I’m considering trying the api directly for a bit with Claude code to compare but need a test quite first to compare all 3.