[ 🚧 Draft] : Adding host-mr for pinned bounce buffer to `rmm::device_scalar` #1985

JigaoLuo · 2025-07-12T11:18:29Z

Description

This is an initial draft for Issue #1959 that adds a host memory resource for a pinned bounce buffer into rmm::device_scalar.

It’s an early attempt and still incomplete, particularly in the number of constructors and unit tests. I expect this PR draft will spark some discussion, and I subscribe to the idea that “perfect is the enemy of good.” I hope it's okay to use this draft as a starting point for conversation. If any major changes are needed, I’m happy to revisit or even drop the current approach for a better one.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

Signed-off-by: Jigao Luo <[email protected]>

copy-pr-bot · 2025-07-12T11:18:33Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

devavret · 2025-07-17T14:04:02Z

cpp/include/rmm/device_scalar.hpp

+    }
  }

  // Disallow passing literals to set_value to avoid race conditions where the memory holding the


I believe you can do this too now? copy to bounce buffer immediately and then do the copy async

Hi @devavret , thanks!
Sorry. I wasn’t sure I understood you correctly.
These 3 lines refer to the else case, where no host-pinned bounce buffer is allocated. That’s actually the situation we’re currently only having in RMM.
Do you mean we just allocate a buffer for every function call of this?

I think @devavret is referring to the deleted function below.

I get it now. Let’s first discuss where the buffer should be placed, as mentioned above. Once that’s settled, I can address this detailed point.

harrism · 2025-07-17T22:26:34Z

cpp/include/rmm/device_scalar.hpp

+      return **_host_bounce_buffer;
+    } else {
+      // Case: Copying with pageable host memory — may trigger an implicit synchronization.
+      return _storage.front_element(stream);


This makes me wonder if the bounce buffer copying support should be in device_buffer instead

Hi @harrism, thanks! Yes, that’s also my main concern. It’s why I didn’t prepare a well-considered pull request but only a draft here, as there’s a chance it’ll need to be rewritten again, just like I already did in cuDF.

We can discuss where the buffer should live. cuDF places the buffer in its scalar header, and that’s the behavior I’d like to mimic.
Alternatively, storing it in the RMM devicevector header is also possible, since the element call results in a copy as well. The twin issue is here: #1955

cpp/include/rmm/device_scalar.hpp

harrism · 2025-07-17T22:28:46Z

cpp/include/rmm/device_scalar.hpp

+    }
  }

  // Disallow passing literals to set_value to avoid race conditions where the memory holding the


I think @devavret is referring to the deleted function below.

bdice

This is a nice starting point. I think I agree with the suggestion that this should be implemented in the device buffer, and then exposed in the device scalar. Small vectors and other containers based on the device buffer would benefit from this, too.

cpp/include/rmm/device_scalar.hpp

JigaoLuo · 2025-07-18T19:51:07Z

This is a nice starting point. I think I agree with the suggestion that this should be implemented in the device buffer, and then exposed in the device scalar. Small vectors and other containers based on the device buffer would benefit from this, too.

Hi @bdice
Thanks for the code review! I’m happy to rewrite it again if needed. One follow-up question: should the device buffer follow cuDF’s design, where both the host and device buffers have the same size?

JigaoLuo · 2025-07-23T07:19:42Z

Hi all reviewers,

I’d like to close this draft and move to a new one #1996, where I’ll be rewriting device_buffer. If you think this draft should be reopened, feel free to do so or just ping me. Thanks—and let’s continue the discussion in the new PR draft!

the first draft

2996403

Signed-off-by: Jigao Luo <[email protected]>

JigaoLuo requested a review from a team as a code owner July 12, 2025 11:18

github-project-automation bot added this to RMM Project Board Jul 12, 2025

JigaoLuo requested review from davidwendt and lamarrr July 12, 2025 11:18

JigaoLuo changed the title ~~[:construction: Draft] : Adding host-mr for pinned bounce buffer to rmm::device_scalar~~ [ 🚧 Draft] : Adding host-mr for pinned bounce buffer to rmm::device_scalar Jul 12, 2025

JigaoLuo changed the title ~~[ 🚧 Draft] : Adding host-mr for pinned bounce buffer to rmm::device_scalar~~ [:construction: Draft] : Adding host-mr for pinned bounce buffer to rmm::device_scalar Jul 12, 2025

JigaoLuo changed the title ~~[:construction: Draft] : Adding host-mr for pinned bounce buffer to rmm::device_scalar~~ [ 🚧 Draft] : Adding host-mr for pinned bounce buffer to rmm::device_scalar Jul 12, 2025

JigaoLuo marked this pull request as draft July 12, 2025 11:20

JigaoLuo mentioned this pull request Jul 12, 2025

[FEA] Support specifying a host mr for rmm::device_scalar #1959

Open

devavret reviewed Jul 17, 2025

View reviewed changes

harrism reviewed Jul 17, 2025

View reviewed changes

optional with hasvalue check

ad09350

JigaoLuo mentioned this pull request Jun 23, 2025

[Story] Towards a faster Parquet reader with pipelining and multistream optimization rapidsai/cudf#18892

Open

bdice reviewed Jul 18, 2025

View reviewed changes

cpp/include/rmm/device_scalar.hpp Outdated Show resolved Hide resolved

cpp/include/rmm/device_scalar.hpp Outdated Show resolved Hide resolved

cpp/include/rmm/device_scalar.hpp Outdated Show resolved Hide resolved

cpp/include/rmm/device_scalar.hpp Outdated Show resolved Hide resolved

github-actions bot added the CMake label Jul 18, 2025

JigaoLuo force-pushed the adding-buffer-device_scalar branch from 9e10849 to ad09350 Compare July 19, 2025 06:59

github-actions bot removed the CMake label Jul 19, 2025

optional with value call

196cb94

JigaoLuo mentioned this pull request Jul 23, 2025

[ 🚧 Draft] : Adding host-mr for pinned bounce buffer to rmm::device_buffer #1996

Draft

3 tasks

JigaoLuo closed this Jul 23, 2025

github-project-automation bot moved this to Done in RMM Project Board Jul 23, 2025

[ 🚧 Draft] : Adding host-mr for pinned bounce buffer to rmm::device_scalar #1985

[ 🚧 Draft] : Adding host-mr for pinned bounce buffer to rmm::device_scalar #1985

Uh oh!

Conversation

JigaoLuo commented Jul 12, 2025

Description

Checklist

Uh oh!

copy-pr-bot bot commented Jul 12, 2025

Uh oh!

devavret Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

JigaoLuo Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harrism Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

JigaoLuo Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

harrism Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

JigaoLuo Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

harrism Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

bdice left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JigaoLuo commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JigaoLuo commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[ 🚧 Draft] : Adding host-mr for pinned bounce buffer to `rmm::device_scalar` #1985

[ 🚧 Draft] : Adding host-mr for pinned bounce buffer to `rmm::device_scalar` #1985

JigaoLuo Jul 17, 2025 •

edited

Loading

JigaoLuo Jul 18, 2025 •

edited

Loading

JigaoLuo commented Jul 18, 2025 •

edited

Loading

JigaoLuo commented Jul 23, 2025 •

edited

Loading