▲ | tucnak 6 days ago | ||||||||||||||||
Unfortunately, it doesn't quite work like that. Google this: transfer learning. | |||||||||||||||||
▲ | bigmadshoe 6 days ago | parent [-] | ||||||||||||||||
I’m work in ML and I don’t understand your point. Transfer learning usually refers to leveraging data for a different task to help with a task for which you have limited data. You’re saying that the knowledge gained from the other languages transfers to English? I don’t think for a 270M parameter model the bottleneck is the availability of enough English language training data. | |||||||||||||||||
|