▲ | v3ss0n 4 days ago | |
Those LLM influencers don't know what is a distill. Deepseek R1 8B IS A DISTILLED Qwen2 .you should be using qwen3 8b-14b instead a lot better | ||
▲ | zer0tonin 4 days ago | parent | next [-] | |
That's literally what I ended up doing in the article tho? | ||
▲ | Mars008 4 days ago | parent | prev [-] | |
"Deepseek R1" sounds cooler, everybody heard about it. |