| ▲ | jxors 5 hours ago | |||||||
This flowchart hides the most awful parts (IMO) of x86 prefixes: some combinations of prefixes are invalid but still parsed and executed, like combining two segment overrides, or placing a legacy prefix after a REX prefix. The CPU also doesn't care if you use prefixes that aren't valid for a specific instruction, for example a REP on a non-repeatable instruction. The LOCK prefix is the only prefix that makes the sane choice to reject invalid combinations, rather than silently accept them. Also, the (E)VEX prefix doesn't behave like the other prefixes: it must be placed last, and can therefore only appear once. All other prefixes can be repeated. | ||||||||
| ▲ | bonzini 32 minutes ago | parent | next [-] | |||||||
Yes, I wish this was this simple. :) There are many other complications: * Some instructions require VEX.L or VEX.W to be 0 or 1, and some encodings result in completely different instructions if you change VEX.L. * Different bits of the EVEX prefix are valid depending on the opcode byte. * Some encodings (called groups) produce different instructions depending on bits 3-5 of the modrm byte (the second byte after all prefixes). Some encodings further produce different groups depending on whether bits 6-7 (mod) of the modrm byte identifies a register or not. * Some instructions read a whole vector register but only a scalar if the same instruction has a memory operand. Sometimes this is clear in the manual, sometimes it is not, sometimes the manual is downright wrong. * Some instructions do not allow using the legacy high-8-bits registers even though they don't do anything with bits 8 and above of the operand: they only want a 32- or 64-bit register as their operand. * APX (EVEX map 4) looks a lot like legacy map 0, but actually a few instructions were moved there from other maps for good reasons, a few more were moved there for no apparent reason (SHLD/SHRD iirc), and a few more are new. * REX2 does not extend SSE and AVX instructions to 32 registers even though REX does extend them to 16. * Intel defines a thing called VEX instruction classes, which makes sense except for a dozen or two instructions where it doesn't. For these, sometimes AMD uses a different class, sometimes doesn't; sometimes AMD's choice makes sense, sometimes it doesn't. And many more that I found out while writing QEMU's current x86 decoder (which tries to be table based but sometimes that's just impossible). | ||||||||
| ▲ | peterfirefly 4 hours ago | parent | prev | next [-] | |||||||
> The CPU also doesn't care if you use prefixes that aren't valid for a specific instruction, for example a REP on a non-repeatable instruction. This is one of the reasons why the x86 could be extended so much. PAUSE is just REP NOP, for example. Segment prefixes in front of conditional branches were used as static branch prediction hints (which I believe have returned in some newer Intel CPUs). Useful if you want to make a hint on newer CPUs that is harmless on older CPUs. Some prefixes have become part of the encoding for certain SIMD instructions, but that is a different case because those prefixes aren't hints. | ||||||||
| ||||||||
| ▲ | vardump 5 hours ago | parent | prev [-] | |||||||
I wonder whether there are some prefixes that cause (some) CPUs to execute the instruction a lot slower. | ||||||||