| ▲ | toomuchtodo 6 hours ago | ||||||||||||||||
You decouple the workloads from human interaction (ie when you submit the job to the queue vs when it is scheduled to execute) so when they run is not a consideration, if possible. The economic incentives encourage solving this, and if it can’t be solved, it buckets customer cohort by willingness (or unwillingness) to pay for access during peak times. | |||||||||||||||||
| ▲ | stavros 6 hours ago | parent [-] | ||||||||||||||||
Sure, but if I ask the LLM a question, I'd like it to respond now, instead of tonight. | |||||||||||||||||
| |||||||||||||||||