Remix.run Logo
stefanwebb 5 days ago

There’s a similar library that also includes data synth and LLM-as-a-Judge: https://github.com/oumi-ai/oumi

BoorishBears 4 days ago | parent [-]

Yet another framework lying about Deepseek support.

I've been trying to actually finetune Deepseek (not distills) and there are few options

3abiton 4 days ago | parent [-]

Which version were you trying? Doesn't unsloth already support finetuning?

BoorishBears 4 days ago | parent [-]

Previous V3 base

Unsloth doesn't have an official multi-GPU story: there's hacked together solutions but they're finicky as it is for smaller models

In general Deepseek has very few resources on finetuning, that get even further muddied by people referring to the distills when they claim to be finetuning it.