| ▲ | madiator 5 hours ago | |
Check out OpenThoughts. It has a widely used dataset, a model that beats the deepseek's smaller reasoning models, and a paper that talks in detail about the data curation methodology. | ||
| ▲ | lambda 2 hours ago | parent | next [-] | |
Oh, neat, I hadn't heard of that. From the blog, it looks like there hasn't been much progress for a few months, but if you check their HF it looks like they have a series of 32B models trained on top of Qwen3 32B with different numbers of training examples that they've uploaded a few days ago: https://huggingface.co/collections/open-thoughts/openthinker... So looks a little bit more research oriented than intended for production use, but still neat to see this effort. | ||
| ▲ | yogthos 4 hours ago | parent | prev [-] | |
neat | ||