| ▲ | nasvay_factory a day ago | |||||||||||||
I wrote about that a while ago: https://paxamans.github.io/blog/titans/ | ||||||||||||||
| ▲ | moffkalast a day ago | parent [-] | |||||||||||||
Are there any pretrained models with this architecture yet or is it all still completely theoretical beyond Google's unverifiable claims? They published the original Titans paper last year and nobody seems to have built on the idea. | ||||||||||||||
| ||||||||||||||