| ▲ | sbinnee 2 days ago | |
Price is appealing to me. I have been using gemini 3 flash mainly for chat. I may give it a try. input: $0.14/$0.28 (whereas gemini $0.5/$3) Does anyone know why output prices have such a big gap? | ||
| ▲ | girvo 2 days ago | parent | next [-] | |
Output is what the compute is used for above all else; costs more hardware time basically than prompt processing (input) which is a lot faster | ||
| ▲ | tokenmaxxinej 2 days ago | parent | prev [-] | |
input tokens are processed at 10-50 times the speed of output tokens since you can process then in batches and not one at a time like output tokens | ||