I don't know what kind of code sysbench is using, but I get far better with a very simple `memcpy()` loop:
See https://news.ycombinator.com/item?id=48523343