▲ | cs702 3 days ago | |
I would read the blog post by the lead author instead of watching this video: https://alexiglad.github.io/blog/2025/ebt/ Also, see: https://www.reddit.com/r/MachineLearning/comments/1lu1ia0/r_... | ||
▲ | programjames 2 days ago | parent [-] | |
TLDR; Train an "energy" model that checks if the output is correct (rather than just outputting something), and gradient descent to find good outputs. Using transformers. |