| ▲ | horsawlarway 4 hours ago | |
Possibly - but we've also seen that spending more tokens on a task can improve the quality of the output (reasoning, CoT, etc). So it's not impossible to have things that seem orthogonal, like generation speed or context length, have an impact on quality of result. | ||