| ▲ | another_twist an hour ago | |
Thats great. I think we need to start researching how to get cheaper models to do math. I have a hunch it should be possible to get leaner models to achieve these results with the right sort of reinforcement learning. | ||
| ▲ | alansaber 25 minutes ago | parent [-] | |
Deepseek wrote a decent paper on this https://github.com/deepseek-ai/DeepSeek-Math-V2/blob/main/De... | ||