Remix.run Logo
nsingh2 3 hours ago

It's going to be expensive to serve (also not generally available), considering they said it's the largest model they've ever trained.

I suspect it's going to be used to train/distill lighter models. The exciting part for me is the improvement in those lighter models.

azan_ an hour ago | parent | next [-]

What's interesting is that scaling appears to continue to pay off. Gwern was right - as always.

AstroBen 2 hours ago | parent | prev [-]

It seems inevitable that costs will come down over time. Expensive models today will be cheap models in a few years.