| ▲ | ClaireBookworm 2 days ago | |
What sort of fine tuning data was needed to allow the model to self-drive? One hour of video of someone driving, or extra labeling? | ||
| ▲ | nee1r 2 days ago | parent | next [-] | |
i actually drove the car (with arrow keys) around south park for around ~45 minutes as finetuning data, no extra labelling other than that. think the car line graph is super cool because you actually see the videegame prior working | ||
| ▲ | g413n 2 days ago | parent | prev [-] | |
relevant note is that we finetuned by having the human also use arrow keys which keeps it in-distribution but also slower to collect | ||