| ▲ | littlestymaar 9 hours ago | |
If what you refer to by “on demand training ” is fine tuning, it's going to be much more efficient on a small model than a big one. | ||
| ▲ | red75prime 8 hours ago | parent [-] | |
LoRA can work with big models. But I mean sample-efficient RL. | ||