I'm LLM training right now!
I'm using a well established distributed data pipeline for my LLM training.