Because you provide them with the "problem" and the "solution" and once you have both you can scale your RL pipeline.