| ▲ | nsingh2 3 hours ago | |
It's going to be expensive to serve (also not generally available), considering they said it's the largest model they've ever trained. I suspect it's going to be used to train/distill lighter models. The exciting part for me is the improvement in those lighter models. | ||
| ▲ | azan_ an hour ago | parent | next [-] | |
What's interesting is that scaling appears to continue to pay off. Gwern was right - as always. | ||
| ▲ | AstroBen 2 hours ago | parent | prev [-] | |
It seems inevitable that costs will come down over time. Expensive models today will be cheap models in a few years. | ||