| ▲ | zozbot234 5 hours ago | |
> Mythos is a much bigger pre train Do we have data to substantiate that claim? | ||
| ▲ | solenoid0937 5 hours ago | parent [-] | |
It's pretty common knowledge. Spud is the only other PT comparable with Mythos. Both Spud and Mythos can also scale via inference time compute. Meta simply did not have enough compute online, long enough ago, to have a similar PT. | ||