Do you ever replace ChatGPT models with cheaper, distilled, quantized, etc ones to save cost?
He literally said no to this in his GP post