I've done this with Cursor because I have similar issues with inconsistent allowance consumption there. I mostly use Claude models but I've had to disable Opus 4.6 because it just EATS tokens in it's thinking steps.