| ▲ | lostmsu 2 hours ago | |
Qwen recommends to preserve_thinking: true for agentic/coding workloads. | ||
| ▲ | rayboy1995 12 minutes ago | parent [-] | |
Thanks!! I had disabled that previously while debugging, I can confirm this is helping accuracy from what I can tell so far. (And speed since the cache is preserved more often!) | ||