| ▲ | kamranjon 8 hours ago | |||||||
The hugging face models are already up and seem to be the original models with the speculative decoding module built in which is very cool: Flash: https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash-DSpark Pro: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-DSpark Excited to see if this makes it into DwarfStar for local inference, have been using the flash model extensively since the 2-bit quants were made available by antirez. | ||||||||
| ▲ | ilaksh 5 hours ago | parent [-] | |||||||
Any chance they will have this for Qwen 27 b also? | ||||||||
| ||||||||