| ▲ | AstroBen 2 hours ago | |
Our DNA does contain our pre-training, though. It's not true that we're an entirely blank slate. | ||
| ▲ | davebren 2 hours ago | parent [-] | |
Pre-training is not a good term if you are trying to compare it to LLM pre-training. Closer would be the model's architecture and learning algorithms which has been designed through decades of PhD research, and my point on that is that the differences are still much greater than the similarities. | ||