Adding nvtx memory regions to pool MR #1952

nirandaperera · 2025-06-10T22:09:05Z

Description

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

Signed-off-by: niranda perera <[email protected]>

harrism · 2025-06-11T00:16:09Z

cpp/include/rmm/mr/device/detail/stream_ordered_memory_resource.hpp

                  rmm::detail::format_bytes(size) + ")",
                rmm::out_of_memory);
-    auto const block = this->underlying().get_block(size, stream_event);
+    auto const block = get_block(size, stream_event);


❓ question: ‏ Why drop the CRTP indirection here? This doesn't seem related to this PR.

@harrism that's right. But when I was reading the code, what I gathered was, get_block is not implemented by the derived class. It's not mentioned here as well. https://github.com/nirandaperera/rmm/blob/adding_nvtx_pool/cpp/include/rmm/mr/device/detail/stream_ordered_memory_resource.hpp#L70-L76
So, IINM, we can simply call the method, without the indirection.

OK, I see. Good catch.

harrism · 2025-06-11T00:20:57Z

cpp/include/rmm/mr/device/pool_memory_resource.hpp

 #endif

+#ifdef RMM_NVTX
+    void* heap_key;


So this adds some overhead on every suballocation. And the insertion into the nvtx_heaps map is a small overhead on upstream allocations.

Can you please benchmark this cost with the random allocations benchmark with NVTX on and off and report it in the PR? Is NVTX enabled by default? Depending on these costs, we may want it off by default.

@harrism Yes, there is an overhead here. In particular

Inserting and querying from the nvtx_heaps_ unordered map.

calling lower_bound on unstream_blocks_ set (which is logarithmic)

I think we can alleviate 2, if we add a void* upstream_ member to the block class, rather than the bool head. Then IINM, is_head() will be upstream_ == ptr_. But then, we are adding additional 3-bytes to the block class.

Do you think its a worthwhile change?

I would like to see benchmarks, if you don't mind. :)

Signed-off-by: niranda perera <[email protected]>

copy-pr-bot · 2025-06-12T00:03:10Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

wence- · 2025-07-30T15:15:22Z

@nirandaperera Have you had a chance to run the benchmarks Mark was looking for to see any perf differences?

nirandaperera added 2 commits June 9, 2025 17:31

init

4f63732

Signed-off-by: niranda perera <[email protected]>

dummy test

bb11510

Signed-off-by: niranda perera <[email protected]>

nirandaperera requested a review from a team as a code owner June 10, 2025 22:09

nirandaperera requested review from bdice and vyasr June 10, 2025 22:09

github-project-automation bot added this to RMM Project Board Jun 10, 2025

harrism reviewed Jun 11, 2025

View reviewed changes

harrism added non-breaking Non-breaking change feature request New feature or request labels Jun 11, 2025

nirandaperera added 4 commits June 11, 2025 12:23

adding debug logs

fa8227f

Signed-off-by: niranda perera <[email protected]>

Merge branch 'main' of github.com:rapidsai/rmm into adding_nvtx_pool

6ffbc2d

precommit

a6317d4

Signed-off-by: niranda perera <[email protected]>

adding example

f1f453c

Signed-off-by: niranda perera <[email protected]>

nirandaperera requested a review from a team as a code owner June 12, 2025 00:02

github-actions bot added the CMake label Jun 12, 2025

nirandaperera marked this pull request as draft June 12, 2025 00:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding nvtx memory regions to pool MR #1952

Adding nvtx memory regions to pool MR #1952

Uh oh!

nirandaperera commented Jun 10, 2025

Uh oh!

harrism Jun 11, 2025

Uh oh!

nirandaperera Jun 11, 2025

Uh oh!

harrism Jun 11, 2025

Uh oh!

harrism Jun 11, 2025

Uh oh!

nirandaperera Jun 11, 2025

Uh oh!

nirandaperera Jun 11, 2025

Uh oh!

harrism Jun 11, 2025

Uh oh!

copy-pr-bot bot commented Jun 12, 2025

Uh oh!

wence- commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding nvtx memory regions to pool MR #1952

Are you sure you want to change the base?

Adding nvtx memory regions to pool MR #1952

Uh oh!

Conversation

nirandaperera commented Jun 10, 2025

Description

Checklist

Uh oh!

harrism Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

nirandaperera Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

harrism Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

harrism Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

nirandaperera Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

nirandaperera Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

harrism Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

copy-pr-bot bot commented Jun 12, 2025

Uh oh!

wence- commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants