There are some solutions that try to tackle this in HPC. For example https://github.com/LLNL/mpibind is deployed on El Capitan.
Would be interesting to see if something similar appears for cloud workloads.