Support ibv_reg_dmabuf_mr for buffer allocated by cuMemMalloc #513

seagater · 2025-04-24T18:26:03Z

Fix issue #496
For buffer allocated by cuMemMalloc, use ibv_reg_dmabuf_mr to register a dma-buf based memory region.

src/include/ibverbs_wrapper.hpp

src/ib.cc

Binyang2014 · 2025-04-24T19:15:16Z

src/ib.cc

+  if (cuMemAlloc) {
+    int fd;
+    cuMemGetHandleForAddressRange(&fd, dptr, pages * pageSize, CU_MEM_RANGE_HANDLE_TYPE_DMA_BUF_FD, 0);
+    this->mr = IBVerbs::ibv_reg_dmabuf_mr2(pd, 0, pages * pageSize, addr, fd,


Please double check the size here. I think we need to use another size instead of pages * pageSize

Binyang2014 · 2025-04-24T19:15:51Z

src/ib.cc

+  bool cuMemAlloc = mscclpp::isCuMemMapAllocated((void*)dptr);
+  if (cuMemAlloc) {
+    int fd;
+    cuMemGetHandleForAddressRange(&fd, dptr, pages * pageSize, CU_MEM_RANGE_HANDLE_TYPE_DMA_BUF_FD, 0);


Maybe we need to use buff directly instead of dptr, please double-check the API doc

The second parameter should be CUdeviceptr dptr.
CUresult cuMemGetHandleForAddressRange ( void* handle, CUdeviceptr dptr, size_t size, CUmemRangeHandleType handleType, unsigned long long flags )
https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__MEM.html#group__CUDA__MEM_1g51e719462c04ee90a6b0f8b2a75fe031

…cated always return flase

…b.com/microsoft/mscclpp into qinghuazhou/support_ibv_reg_dmabuf_mr

… dmabuf

Binyang2014 · 2025-04-29T19:44:46Z

src/ib.cc

+#if !defined(__HIP_PLATFORM_AMD__)
+  MSCCLPP_CUTHROW(cuCtxGetDevice(&dev));
+  MSCCLPP_CUTHROW(cuDeviceGetAttribute(&dmaBufSupported, CU_DEVICE_ATTRIBUTE_DMA_BUF_SUPPORTED, dev));
+#endif  // !defined(__HIP_PLATFORM_AMD__)


Do we need this micro, we map hip function to cu function, maybe you can refer this file: https://github.com/microsoft/mscclpp/blob/main/include/mscclpp/gpu.hpp

Seems CU_DEVICE_ATTRIBUTE_DMA_BUF_SUPPORTED is not supported by hip
https://rocm.docs.amd.com/projects/HIPIFY/en/latest/reference/tables/CUDA_Driver_API_functions_supported_by_HIP.html

Binyang2014 · 2025-04-30T16:51:56Z

src/ib.cc

+
+    int fd;
+    MSCCLPP_CUTHROW(cuMemGetHandleForAddressRange(&fd, base + alignedOffset, alignedUserBufferSize,
+                                                  CU_MEM_RANGE_HANDLE_TYPE_DMA_BUF_FD, 0));


seems we can use cuMemGetHandleForAddressRange(&fd, addr, pages, CU_MEM_RANGE_HANDLE_TYPE_DMA_BUF_FD, 0)) Don't need to call cuMemGetAddressRange ?

seagater added 3 commits April 23, 2025 23:33

Support ibv_reg_dmabuf_mr

2c5b7a6

Add ibv_reg_dmabuf_mr to the IBVerbs

bcbbf08

Update clang-format

3f354e5

seagater requested review from Binyang2014 and chhwang April 24, 2025 19:06

Binyang2014 reviewed Apr 24, 2025

View reviewed changes

src/include/ibverbs_wrapper.hpp Outdated Show resolved Hide resolved

src/ib.cc Outdated Show resolved Hide resolved

Binyang2014 reviewed Apr 24, 2025

View reviewed changes

chhwang and others added 8 commits April 24, 2025 16:38

Merge branch 'main' into qinghuazhou/support_ibv_reg_dmabuf_mr

dbc94b5

Skip ibv_reg_dmabuf_mr for AMD platform since mscclpp::isCuMemMapAllo…

69402e8

…cated always return flase

Merge branch 'qinghuazhou/support_ibv_reg_dmabuf_mr' of https://githu…

00d00b6

…b.com/microsoft/mscclpp into qinghuazhou/support_ibv_reg_dmabuf_mr

Add check for dma buf support; Change api name to ibv_reg_dmabuf_mr

8b45808

Merge branch 'main' into qinghuazhou/support_ibv_reg_dmabuf_mr

60d8aa8

Calculate alignedOffset, alignedUserBufferSize and offsetInDmaBuf for…

71def1e

… dmabuf

Update clang-format

68beb6e

Merge branch 'main' into qinghuazhou/support_ibv_reg_dmabuf_mr

aa6c563

Binyang2014 reviewed Apr 29, 2025

View reviewed changes

Binyang2014 reviewed Apr 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support ibv_reg_dmabuf_mr for buffer allocated by cuMemMalloc #513

Support ibv_reg_dmabuf_mr for buffer allocated by cuMemMalloc #513

seagater commented Apr 24, 2025

Binyang2014 Apr 24, 2025

Binyang2014 Apr 24, 2025

seagater Apr 25, 2025

Binyang2014 Apr 29, 2025

seagater Apr 30, 2025

Binyang2014 Apr 30, 2025

Support ibv_reg_dmabuf_mr for buffer allocated by cuMemMalloc #513

Are you sure you want to change the base?

Support ibv_reg_dmabuf_mr for buffer allocated by cuMemMalloc #513

Conversation

seagater commented Apr 24, 2025

Binyang2014 Apr 24, 2025

Choose a reason for hiding this comment

Binyang2014 Apr 24, 2025

Choose a reason for hiding this comment

seagater Apr 25, 2025

Choose a reason for hiding this comment

Binyang2014 Apr 29, 2025

Choose a reason for hiding this comment

seagater Apr 30, 2025

Choose a reason for hiding this comment

Binyang2014 Apr 30, 2025

Choose a reason for hiding this comment