| ▲ | kovek 3 hours ago | |
Does thinking about how to offload matter? | ||
| ▲ | locknitpicker 2 hours ago | parent [-] | |
A discussion on how to avoid paying the price of running an expensive model is not about the expensive model. You can triage things running a cheap model with Ollama. Heck, throw in gpt4.1 which is free. | ||