| ▲ | UncleOxidant 41 minutes ago |
| That sounds kind of amazing. But you're not actually doing the machine learning in Racket, are you? Is your Racket code generating other code like PyTorch? |
|
| ▲ | noosphr 22 minutes ago | parent [-] |
| I'm doing the learning in racket because the bottleneck is human understanding. That mnist takes 30 minutes per epoch isn't a worry when I don't even know what vector addition should look like. |
| |
| ▲ | UncleOxidant 10 minutes ago | parent [-] | | This is a complete tangent, but since you mentioned MNIST: I accidentally discovered Tsetlin machines this week when someone on r/Julia asked if anyone with an AMD GPU could run the benchmark in their package called Tsetlin.jl. I've got an AMD GPU so I was happy to oblige. Then I looked at what the benchmark was doing: it was training an MNIST classifier to 98% accuracy in 9 seconds - that seemed like a couple of orders of magnitude too fast. I was flabbergasted and wondered what the heck this thing was and that's when I learned about Tsetlin machines. I went on (with the help of Claude) to implement one in an FPGA and again was flabbergasted when it only took 2k LUTs to implement a Tsetlin machine for MNIST classification in hardware. |
|