| ▲ | Soul Player C64 – A real transformer running on a 1 MHz Commodore 64(github.com) | |||||||||||||||||||||||||||||||
| 125 points by adunk 13 hours ago | 33 comments | ||||||||||||||||||||||||||||||||
| ▲ | wk_end 10 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||
> 25K parameters is about 70 million times smaller than GPT-4. It will produce broken sentences. That's the point - the architecture works at this scale. Since it seems to just produce broken and nonsensical sentences (at least based on the one example given) I'm not sure if it does work at this scale. Anyway, as written this passage doesn't really make a whole lot of sense (the point is that it produces broken sentences?), and given that it was almost certainly written by an AI, it demonstrates that the architecture doesn't work especially well at any scale (I kid, I kid). | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | mixmastamyk 9 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Just reminded me of the random sentence generator program on my Vic-20. I had changed most of the words to all the bad words a preteen could think up. So many laughs with the neighborhood kids. | ||||||||||||||||||||||||||||||||
| ▲ | daemonologist 9 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
You can chat with the model on the project page: https://indiepixel.de/meful/index.html It (v3) mostly only says hello and bye, but I guess for 25k parameters you can't complain. (I think the rather exuberant copy is probably the product of Claude et al.) | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | arketyp 2 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
I love these counterfactual creations on old hardware. It highlights the magical freedom of creativity of software. | ||||||||||||||||||||||||||||||||
| ▲ | borsch_not_soup 7 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Interesting, I’ve always thought neural network progress was primarily bottlenecked by compute. If it turns out that LLM-like models can produce genuinely useful outputs on something as constrained as a Commodore 64—or even more convincingly, if someone manages to train a capable model within the limits of hardware from that era—it would suggest we may have left a lot of progress on the table. Not just in terms of efficiency, but in how we framed the problem space for decades. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | classichasclass 10 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
If you're running this in VICE, run it under the SuperCPU with warp mode on. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | rahen an hour ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
A little disappointed to see PyTorch + Claude here. I was hoping for some "demo-scene" hand-crafted 6502 assembly, and hopefully training on the C64. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | anyfoo 10 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
This would have blown me away back in the late 80s/early 90s. (Or maybe not, if it doesn't perform better than random, I haven't actually tried it out yet. Some more examples would have been nice!) I wonder how far you could push this while still staying period correct, e.g. by adding a REU (RAM Expansion Unit), or even a GeoRAM (basically a REU on steroids). SuperCPU would also be an option, but for me it's always blurring the line of "what is a C64" a bit too much, and it likely just makes it faster anyway. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | djmips 5 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Dissapointed - there was no 6502 code in the GitHub repo. | ||||||||||||||||||||||||||||||||
| ▲ | Vaslo 7 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Load”*”,8,1 Brings back memories | ||||||||||||||||||||||||||||||||
| ▲ | brcmthrowaway 9 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
How does this compare to ELIZA? | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | harel 11 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Eliza called, and asked if we saw her grand kids... | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | Lerc 10 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Ok now we need 1541 flash attention. I'm not sure what the venn diagram of knowledge to understand what that sentence is suggesting looks like, it's probably more crowded in the intersection than one might think. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | bighead1 11 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
i hate ai, and i love the c64, but i'll allow it. | ||||||||||||||||||||||||||||||||
| ▲ | ghstinda 10 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||
but can you make mac keyboards feel like a c64c? | ||||||||||||||||||||||||||||||||