| ▲ | j2kun 13 hours ago |
| They claim the algorithm "discovered" the new techniques, but the methods described in section 5 do not seem all that novel to me. It smells like it could be "laundering" the literature [1] and reshuffling existing techniques. This is not inherently a bad thing, but I would hope that if it is borrowing existing techniques, the appropriate citation would eventually make it into this paper. [1]: https://www.argmin.net/p/lore-laundering-machines |
|
| ▲ | Q6T46nT668w6i3m 12 hours ago | parent | next [-] |
| You’re not kidding. I just looked. There isn’t anything novel in that section. I assumed from the description they found novel methods but this is standard GPU Gems advice. |
|
| ▲ | AlexCoventry 13 hours ago | parent | prev | next [-] |
| In the future, we will all be Jürgen Schmidhuber. :-) |
| |
| ▲ | hedgehog 8 hours ago | parent [-] | | I hate to break it to you but the original work on that topic was by Schmidhuber & Schmidhuber back in 1963. |
|
|
| ▲ | alyxya 13 hours ago | parent | prev [-] |
| There generally aren't new techniques when optimizing something ubiquitous. Instead, there are a lot of ways to apply existing techniques to create new and better results. Most ideas are built on top of the same foundational principles. |
| |
| ▲ | slashdave 11 hours ago | parent [-] | | I am not sure about that. However, what is clear is that if there is a new technique, it will not be found by this LLM. | | |
| ▲ | CapsAdmin 9 hours ago | parent [-] | | It's generally true, isn't it? Otherwise we'd have ground breaking discoveries every day about some new and fastest way to do X. The way I see it, mathematicians have been trying (and somewhat succeeding every 5~ years) to prove faster ways to do matrix multiplications since the 1970s. But this is only in theory. If you want to implement the theory, you suddenly have many variables you need to take care of such as memory speed, cpu instructions, bit precision, etc. So in practice, an actual implementation of some theory likely have more room to improve. It is also likely that LLM's can help figure out how to write a more optimal implementation. |
|
|