| ▲ | colechristensen 3 hours ago | ||||||||||||||||
No, they're actually training weights based on context before compaction. Context is context, this is splitting the model into persistent weights and malleable ones which are periodically updated. | |||||||||||||||||
| ▲ | delis-thumbs-7e 3 hours ago | parent [-] | ||||||||||||||||
Wouldn’t that be extremely computationaly expensive considering how resource incentive training is? | |||||||||||||||||
| |||||||||||||||||