| ▲ | malaya_zemlya 3 hours ago | |
You can download a base model (aka foundation, aka pretrain-only) from huggingface and test it out. These were produced without any RL. However, most modern LLMs, even base models, would be not just trained on raw internet text. Most of them were also fed a huge amount of synthetic data. You often can see the exact details in their model cards. As a result, if you sample from them, you will notice that they love to output text that looks like:
This is not your typical internet page. | ||
| ▲ | octoberfranklin 3 hours ago | parent [-] | |
You often can see the exact details in their model cards. Bwahahahaaha. Lol. /me falls off of chair laughing Come on, I've never found "exact details" about anything in a model card, except maybe the number of weights. | ||