ARM has THREE instruction sets. (four?) aarch32, aarch64, and various incarnations of Thumb. (A PI 5 supports all three).
Okay? x86-64 has like twenty extensions. What's your point?
The point was that, with respect to the effects of implementing multiple decode stages for multiple instruction sets, ARM doesn't have an advantage over x86 in that respect.
(M1 does, because they don't implement aarch32, or thumb).