| ▲ | ls612 2 hours ago | |
The main reason the Chinese labs are releasing models as open weights is because they don't have the compute necessary to provide all of the inference. For the US frontier models something like 80-90% of the lifetime compute required for the model is inference rather than training. China wants to shepherd as much of their limited compute as possible towards training to keep up in the race. | ||
| ▲ | londons_explore an hour ago | parent [-] | |
With nearly everyone using inference accelerators, the pool of hardware is no longer shared between training and use. | ||