▲ | magicalhippo 4 days ago | |
> but when you write a technical paper you must try to minimise the number of different possible readings of each sentence Fair point. It would indeed have been much more clear had they written something like this instead: a fully autonomous framework that generates its own fine-tuning/RL training data from scratch. |