Remix.run Logo
BariumBlue an hour ago

Hah, I was just thinking that Python likely has a vast ocean of training data, but it's likely of lower quality, being much of it is written by beginners and those who aren't primarily programmers.

topham 42 minutes ago | parent [-]

There's a broken idea that AI know Python because they're written in Python.

Not how any of it works.

gertlabs 19 minutes ago | parent [-]

While recent models are capable of generalizing to any language at this point, I do think there are weights from their pretraining corpus that still leak through into how they create their responses. We observed similar language preference patterns across models from different providers, btw.