It makes sense that a next token predictor could execute assembly code. This is fascinating work, especially with the memory implementation.