Remix.run Logo
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf](github.com)
18 points by fspeech an hour ago | 5 comments
agentultra 6 minutes ago | parent | next [-]

So it's designed for informal proofs and it "verifies" based on a rubric fitting function and human interaction, is that right?

What's the use case for a system like this?

awei 18 minutes ago | parent | prev | next [-]

Something weird here, why is it so hard to have a deterministic program capable of checking a proof or anything math related, aren't maths super deterministic when natural language is not. From first principles, it should be possible to do this without a llm verifier.

zaxioms 23 minutes ago | parent | prev | next [-]

It's cool, but I genuinely cannot fathom why they are targeting natural language proofs instead of a proof assistant.

mamami a minute ago | parent [-]

Natural language is a lot more, well, readable than say lean. You get a lot less intuition and understanding of what the model is attempting to do in the first place.

photon_lines 37 minutes ago | parent | prev [-]

Exciting stuff from a fantastic team.