| ▲ | aseipp 2 hours ago | |
Knights Landing is a major outlier; the cores there were extremely small and had very few resources dedicated to them (e.g. 2-wide decode) relative to the vector units, so of course that will dominate. You aren't going to see 40% of the die dedicated to vector register files on anything looking like a modern, wide core. The entire vector unit (with SRAM) will be in the ballpark of like, cumulative L1/L2; a 512-bit register is only a single 64 byte cache line, after all. | ||
| ▲ | dlcarrier an hour ago | parent | next [-] | |
Also, the Knights Landing/Mill implementation is completely different from modern AVX-512. It's Ice Lake and Zen 4 that introduced modern AVX-512. | ||
| ▲ | Aurornis an hour ago | parent | prev [-] | |
True! But even if only 20% of the die area goes to AVX-512 in larger cores, that makes a big difference for high core count CPUs. That would be like having a 50-core CPU instead of a 64-core CPU in the same space. For these cloud native CPU designs everything that takes significant die area translates to reduced core count. | ||