Remix.run Logo
lostmsu 3 days ago

Why do you need 50k? Can't you tune using LoRA?

Danau5tin 3 days ago | parent [-]

Exactly my first thought when I realised the cost! Currently LoRA is not supported by rLLM (The team told me they aim to support in next release), but it is certainly possible to port to verl directly or another RL framework for sure. I just did not have the time to port again (already done 2x as other RL frameworks had issues)