| ▲ | knbknb 2 days ago | |
Does "major number release" mean that it is actually an order of magnitude more compute effort that went into creating this model? Or is this fundamentally a different model architecture, or a completely new tech stack on top of which this model was created (and the computing effort was actually less than before, in the v3 major relase? | ||