▲ | martincsweiss 2 days ago | |||||||
This is a super interesting claim - can you point to these benchmarks? | ||||||||
▲ | cubefox 2 days ago | parent | next [-] | |||||||
https://deepmind.google/models/gemini-diffusion/#benchmarks > Gemini Diffusion’s external benchmark performance is comparable to much larger models, whilst also being faster. That doesn't necessarily mean that they scale as well as autoregressive models. | ||||||||
| ||||||||
▲ | mdp2021 2 days ago | parent | prev | next [-] | |||||||
Try this one: # d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning | ||||||||
▲ | mountainriver 2 days ago | parent | prev [-] | |||||||
|