multi-model arbitration, synthesis, parallel reasoning etc. Judging large models with small models is quite effective.