| ▲ | 7777777phil 3 days ago | |
AlphaZero worked because chess and Go have terminal rewards and positions you can prove are right or wrong. General intelligence has neither, and the leap from self-play in a well-defined game to self-play in arbitrary environments is the hard part Silver isn't really demoing. Sara Hooker's stuff on scaling laws lines up here (1) (1) https://philippdubach.com/posts/the-most-expensive-assumptio... | ||
| ▲ | naveen99 2 days ago | parent [-] | |
Math, physics, markets, computation… | ||