| ▲ | hyperhello 4 hours ago | |||||||||||||
From context then, I infer that a transformer is not comprised of matrix multiplications, because it would simply be one that adds two 10-digit numbers. | ||||||||||||||
| ▲ | medi8r 4 hours ago | parent [-] | |||||||||||||
A transformer tokenizes input, does a bunch of matmul and relu set up in a certain way. It doesn't get to see the raw number (just like you don't when you look at 1+1 you need visual cortex etc. first.) | ||||||||||||||
| ||||||||||||||