▲ | vlovich123 4 days ago | |||||||
The value of figuring out how to make their LLM serving deterministic might help them track this down. There was a recent paper about how the received wisdom that kept assigning it to floating point associativity actually overlooked the real reasons for non-determinism [1]. [1] https://thinkingmachines.ai/blog/defeating-nondeterminism-in... | ||||||||
▲ | ants_everywhere 4 days ago | parent | next [-] | |||||||
network traffic and machine load aren't deterministic. I think for the near term, getting full determinism (e.g. for auditing) is going to only be feasible for batch jobs that are not cost sensitive. A google search isn't deterministic. Neither is loading upvote count on social media. It's common advice in distributed systems to have a graceful degradation state instead of becoming unavailable. That wouldn't be possible in a system that's completely deterministic. | ||||||||
| ||||||||
▲ | mmaunder 4 days ago | parent | prev [-] | |||||||
It has a big impact on performance to do determinism. Which leaves using another model to essentially IQ test their models with reporting and alerting. |