Remix.run Logo
cjtrowbridge 11 hours ago

This brings up a point many will not be aware of. If you know the random seed and the prompt, and the hash of the model's binary file; the output is completely deterministic. You can use this information to check whether they are in fact swapping your requests out to cheaper models than what you're paying for. This level of auditability is a strong argument for using open-source, commodified models, because you can easily check if the vendor is ripping you off.

TZubiri 9 hours ago | parent [-]

Pretty sure this is wrong, requests are batched and size can affect the output, also gpus are highly parallel, there can be many race conditions.

TeMPOraL 2 hours ago | parent [-]

Yup. Floating point math turns race conditions into numerical errors, reintroducing non-determinism regardless of inputs used.