| ▲ | Remnant44 5 hours ago | |||||||
which honestly, shouldn't be neccessary today with avx512. There's essentially no reason to prefer the aligned load/store commands over the unaligned ones - if the actual pointer is unaligned it will function correctly at half the throughput, while if it_is_ aligned you will get the same performance as the aligned-only load. No reason for the compiler to balk at vectorizing unaligned data these days. | ||||||||
| ▲ | dmpk2k 4 hours ago | parent [-] | |||||||
> There's essentially no reason to prefer the aligned load/store commands over the unaligned ones - if the actual pointer is unaligned it will function correctly at half the throughput Getting a fault instead of half the performance is actually a really good reason to prefer aligned load/store. To be fair, you're talking about a compiler here, but I never understood why people use the unaligned intrinsics... | ||||||||
| ||||||||