this prevents you from using any commercial model then, because commercial models need to hide thoughts to prevent distillation