DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf]

agentultra 6 minutes ago | parent | next [-]

So it's designed for informal proofs and it "verifies" based on a rubric fitting function and human interaction, is that right?

What's the use case for a system like this?

▲

awei 18 minutes ago | parent | prev | next [-]

Something weird here, why is it so hard to have a deterministic program capable of checking a proof or anything math related, aren't maths super deterministic when natural language is not. From first principles, it should be possible to do this without a llm verifier.

▲

zaxioms 23 minutes ago | parent | prev | next [-]

It's cool, but I genuinely cannot fathom why they are targeting natural language proofs instead of a proof assistant.

	▲	mamami a minute ago \| parent [-]
		Natural language is a lot more, well, readable than say lean. You get a lot less intuition and understanding of what the model is attempting to do in the first place.

▲

photon_lines 37 minutes ago | parent | prev [-]

Exciting stuff from a fantastic team.