| ▲ | zozbot234 10 hours ago | |
I don't think anyone knows for sure how much mileage/scalability LLMs have. Given what we do know, I suspect if you can afford to spend more compute on even longer training runs, you can still get much better results compared to SOTA, even for "simple" domains like text/language. | ||
| ▲ | airstrike 9 hours ago | parent [-] | |
I think we're pretty much out of "spend more compute on even longer training runs" atp. | ||