| ▲ | kyboren 4 hours ago | |
Yes, but bigger models are still more capable. Models shrinking (iso-performance) just means that people will train and use more capable models with a longer context. | ||
| ▲ | sipjca 2 hours ago | parent [-] | |
Of course they are! Both are important and will be around and used for different reasons | ||