| ▲ | mg 6 hours ago | |||||||
It is the other way round. In an interactive session, adding "Fine, but make the button red" after the model generated a first solution more than doubles the tokens used. As the model now not only gets the original code and the feature request but also the updated code plus the change request as input tokens. Sending a feature request to an LLM and then sending the feature request again with "The button shall be red" only doubles the tokens used. | ||||||||
| ▲ | jgilias 5 hours ago | parent | next [-] | |||||||
The cost is far from linear though. Because of prompt caching and the fact that generally output tokens are a lot more expensive than input tokens. | ||||||||
| ||||||||
| ▲ | ryan_glass 5 hours ago | parent | prev | next [-] | |||||||
"Make the button red" probably doesn't need an LLM at all. | ||||||||
| ||||||||
| ▲ | Chirono 5 hours ago | parent | prev [-] | |||||||
That’s usually not true due to caching. It may be true if you leave a large gap in between, but if you send “make it red” right after, then it’s purely incremental | ||||||||