▲ | stonogo 3 days ago | |||||||||||||||||||||||||
Probably something more like this? https://github.com/aws-samples/amazon-fsx-tutorial/tree/mast... | ||||||||||||||||||||||||||
▲ | foota 2 days ago | parent [-] | |||||||||||||||||||||||||
So... The issue is that I'm not using lustre. As far as I can tell, NVME over fabrics (nvme-of) for RDMA is implemented by kernel modules nvmet-rdma and nvme-rdma (the first being for the target). This kernel modules supports infibiband and I think fiber channels, but _not_ EFA, and EFA itself is not an implementation of infiniband. There are user space libraries that paper over these differences when using them for just network transport (E.g., libfabrics) and EFA sorta pretends to be IB, but afaict this is just meant to ease integration at the user space level. Unfortunately, since there's no kernel module support for EFA in the nvme-of kernel modules, it doesn't seem possible to use without lustre. I don't know exactly how they're doing it for lustre clients. There seems to be a lustre client kernel module though, so my guess is that it's in there? The lustre networking module, lnet, does have an EFA integration, but it seems to only be as a network transit. I don't see anything in lustre about nvme-of though, so I'm not sure. Maybe there's something I'm missing though, and it'll just work if I give it try :) | ||||||||||||||||||||||||||
|