Adding nvtx memory regions to pool MR by nirandaperera · Pull Request #1952 · rapidsai/rmm

nirandaperera · 2025-06-10T22:09:05Z

Description

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

Signed-off-by: niranda perera <niranda.perera@gmail.com>

harrism · 2025-06-11T00:16:09Z

                  rmm::detail::format_bytes(size) + ")",
                rmm::out_of_memory);
-    auto const block = this->underlying().get_block(size, stream_event);
+    auto const block = get_block(size, stream_event);


❓ question: ‏ Why drop the CRTP indirection here? This doesn't seem related to this PR.

@harrism that's right. But when I was reading the code, what I gathered was, get_block is not implemented by the derived class. It's not mentioned here as well. https://github.com/nirandaperera/rmm/blob/adding_nvtx_pool/cpp/include/rmm/mr/device/detail/stream_ordered_memory_resource.hpp#L70-L76
So, IINM, we can simply call the method, without the indirection.

OK, I see. Good catch.

harrism · 2025-06-11T00:20:57Z

 #endif

+#ifdef RMM_NVTX
+    void* heap_key;


So this adds some overhead on every suballocation. And the insertion into the nvtx_heaps map is a small overhead on upstream allocations.

Can you please benchmark this cost with the random allocations benchmark with NVTX on and off and report it in the PR? Is NVTX enabled by default? Depending on these costs, we may want it off by default.

@harrism Yes, there is an overhead here. In particular

Inserting and querying from the nvtx_heaps_ unordered map.

calling lower_bound on unstream_blocks_ set (which is logarithmic)

I think we can alleviate 2, if we add a void* upstream_ member to the block class, rather than the bool head. Then IINM, is_head() will be upstream_ == ptr_. But then, we are adding additional 3-bytes to the block class.

Do you think its a worthwhile change?

I would like to see benchmarks, if you don't mind. :)

Signed-off-by: niranda perera <niranda.perera@gmail.com>

copy-pr-bot · 2025-06-12T00:03:10Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

wence- · 2025-07-30T15:15:22Z

@nirandaperera Have you had a chance to run the benchmarks Mark was looking for to see any perf differences?

nirandaperera added 2 commits June 9, 2025 17:31

init

4f63732

Signed-off-by: niranda perera <niranda.perera@gmail.com>

dummy test

bb11510

Signed-off-by: niranda perera <niranda.perera@gmail.com>

nirandaperera requested a review from a team as a code owner June 10, 2025 22:09

nirandaperera requested review from bdice and vyasr June 10, 2025 22:09

github-project-automation Bot added this to RMM Project Board Jun 10, 2025

harrism reviewed Jun 11, 2025

View reviewed changes

harrism added non-breaking Non-breaking change feature request New feature or request labels Jun 11, 2025

nirandaperera added 4 commits June 11, 2025 12:23

adding debug logs

fa8227f

Signed-off-by: niranda perera <niranda.perera@gmail.com>

Merge branch 'main' of github.com:rapidsai/rmm into adding_nvtx_pool

6ffbc2d

precommit

a6317d4

Signed-off-by: niranda perera <niranda.perera@gmail.com>

adding example

f1f453c

Signed-off-by: niranda perera <niranda.perera@gmail.com>

nirandaperera requested a review from a team as a code owner June 12, 2025 00:02

github-actions Bot added the CMake label Jun 12, 2025

nirandaperera marked this pull request as draft June 12, 2025 00:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding nvtx memory regions to pool MR#1952

Adding nvtx memory regions to pool MR#1952
nirandaperera wants to merge 6 commits into
rapidsai:branch-25.08from
nirandaperera:adding_nvtx_pool

nirandaperera commented Jun 10, 2025

Uh oh!

harrism Jun 11, 2025

Uh oh!

nirandaperera Jun 11, 2025

Uh oh!

harrism Jun 11, 2025

Uh oh!

harrism Jun 11, 2025

Uh oh!

nirandaperera Jun 11, 2025

Uh oh!

nirandaperera Jun 11, 2025

Uh oh!

harrism Jun 11, 2025

Uh oh!

copy-pr-bot Bot commented Jun 12, 2025

Uh oh!

wence- commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nirandaperera commented Jun 10, 2025

Description

Checklist

Uh oh!

harrism Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

nirandaperera Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

harrism Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

harrism Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

nirandaperera Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

nirandaperera Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

harrism Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

copy-pr-bot Bot commented Jun 12, 2025

Uh oh!

wence- commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants