Apple also implemented x86 memory semantics for aarch64 to allow for simpler translation and faster execution.
In HW?