| ▲ | camel-cdr 3 days ago | |||||||
> It's a shame though that Zcmp extension didn't get into RVA23 even as an optional extension Zcmp is only for embedded applications without D support. You wouldn't want an instruction with up to 13 destinations in high performance designs anyways. If you want load/store pair, we already have that, you can just interpret two adjacent 16-bit load or stores as a single 32-bit instruction. | ||||||||
| ▲ | Joker_vD a day ago | parent [-] | |||||||
> You wouldn't want an instruction with up to 13 destinations in high performance designs anyways. Why not? Code density matters even in high-performance designs although I guess the "millicode routines" can help with that somewhat. Still, the ordering of stores/loads is undefined, and they are allowed to be re-done however many times, so... it shouldn't be onerous to implement? Expanding it into μops during the decoding stages seems straightforward. | ||||||||
| ||||||||