Share simulated memory resource state by bdice · Pull Request #2399 · rapidsai/rmm

bdice · 2026-05-17T23:04:16Z

Description

Makes copied simulated_memory_resource instances share their simulated address cursor and allocation range state. This preserves replay benchmark behavior when the resource is copied into factories or type-erased wrappers.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

copy-pr-bot · 2026-05-17T23:04:19Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

coderabbitai · 2026-05-17T23:07:37Z

📝 Walkthrough

Walkthrough

The simulated_memory_resource class transitions from direct raw pointer members (begin_, end_) to storing its allocation range inside a std::shared_ptr-managed state struct. The public method signatures remain unchanged, but copy and move semantics now exhibit aliasing: instances sharing the state object will observe allocations performed by any co-owner.

Changes

Shared State Refactoring

Layer / File(s)	Summary
Allocation range shared state refactoring `cpp/benchmarks/utilities/simulated_memory_resource.hpp`	`<memory>` header added for `std::shared_ptr`. Private nested `state` struct (lines 120–131) replaces raw `begin_`/`end_` members with `std::shared_ptr<state> state_`. Constructor (lines 35–37) initializes state via `std::make_shared<state>`. The `allocate` method (lines 62–67) accesses range pointers through `state_->begin` and `state_->end` with unchanged bounds-check logic.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'Share simulated memory resource state' directly and clearly summarizes the main change: making copied instances share state via shared_ptr instead of owning independent state.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Description check	✅ Passed	The pull request description clearly explains the purpose of the changes: making copied simulated_memory_resource instances share state to preserve replay benchmark behavior.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

cpp/benchmarks/utilities/simulated_memory_resource.hpp (2)

57-68: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Synchronize the shared allocation cursor.

Now that copies alias state_, concurrent allocate() calls from different instances can read the same begin value and hand out overlapping simulated ranges. Guard the cursor update inside state with a mutex or atomic reservation.

🔒 Proposed fix

+#include <mutex>
+
   void* allocate([[maybe_unused]] cuda::stream_ref stream,
                  std::size_t bytes,
                  [[maybe_unused]] std::size_t alignment = rmm::CUDA_ALLOCATION_ALIGNMENT)
   {
+    std::lock_guard<std::mutex> lock{state_->mutex};
+
     // NOLINTNEXTLINE(cppcoreguidelines-pro-bounds-pointer-arithmetic)
     RMM_EXPECTS(state_->begin + bytes <= state_->end,
                 "Simulated memory size exceeded (failed to allocate " +
                   rmm::detail::format_bytes(bytes) + ")",
                 rmm::bad_alloc);
     auto* ptr = static_cast<void*>(state_->begin);
     state_->begin += bytes;  // NOLINT(cppcoreguidelines-pro-bounds-pointer-arithmetic)
     return ptr;
   }
@@
   struct state {
     state(char* begin, std::size_t memory_size_bytes)
       : begin{begin},
         end{begin + memory_size_bytes}  // NOLINT(cppcoreguidelines-pro-bounds-pointer-arithmetic)
     {
     }
 
+    std::mutex mutex{};
     char* begin{};
     char* end{};
   };

As per coding guidelines, "Ensure thread safety in memory resources: use synchronization primitives (mutex, atomic) when accessing shared data structures like free lists from multiple threads".

Also applies to: 120-131

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@cpp/benchmarks/utilities/simulated_memory_resource.hpp` around lines 57 - 68,
The allocate() implementations are not thread-safe because multiple instances
can alias state_ and concurrently read/advance state_->begin, causing
overlapping allocations; modify both allocate(cuda::stream_ref, std::size_t,
std::size_t) and the other allocate overload (the one around lines 120-131) to
synchronize access to state_->begin by either (a) adding a mutex inside the
shared state object and locking it around the check (state_->begin + bytes <=
state_->end), the pointer capture (ptr = state_->begin) and the advance
(state_->begin += bytes), or (b) implement an atomic reservation loop that uses
an atomic<size_t> for state_->begin and does compare_exchange to reserve bytes;
ensure the chosen sync primitive is part of the shared state structure so copies
of the resource share the same lock/atomic and prevent overlapping ranges.

42-45: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Fix move semantics to prevent dereferencing null state_ after move.

When simulated_memory_resource is moved using the default move operations, the source object's state_ becomes null (as guaranteed for std::shared_ptr). If the moved-from instance is subsequently used, calling allocate() will dereference the null pointer at line 62 or 66, causing undefined behavior.

Implement custom move constructor and assignment operator to share the state instead, ensuring moved-from instances remain valid if accidentally used:

↔️ Proposed fix

-  simulated_memory_resource(simulated_memory_resource&&)                 = default;
-  simulated_memory_resource& operator=(simulated_memory_resource&&)      = default;
+  simulated_memory_resource(simulated_memory_resource&& other) noexcept : state_{other.state_} {}
+  simulated_memory_resource& operator=(simulated_memory_resource&& other) noexcept
+  {
+    state_ = other.state_;
+    return *this;
+  }

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@cpp/benchmarks/utilities/simulated_memory_resource.hpp` around lines 42 - 45,
The defaulted move ctor/operator for simulated_memory_resource lets the source
object's std::shared_ptr state_ become null which causes allocate() (lines that
dereference state_) to UB; replace the defaulted
simulated_memory_resource(simulated_memory_resource&&) and
operator=(simulated_memory_resource&&) with custom implementations that copy
(share) the shared_ptr state_ instead of moving it, so both moved-from and
moved-to instances keep a valid state_; ensure the move-assignment also handles
self-assignment and preserves any existing state_ cleanup semantics used by the
class.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Outside diff comments:
In `@cpp/benchmarks/utilities/simulated_memory_resource.hpp`:
- Around line 57-68: The allocate() implementations are not thread-safe because
multiple instances can alias state_ and concurrently read/advance state_->begin,
causing overlapping allocations; modify both allocate(cuda::stream_ref,
std::size_t, std::size_t) and the other allocate overload (the one around lines
120-131) to synchronize access to state_->begin by either (a) adding a mutex
inside the shared state object and locking it around the check (state_->begin +
bytes <= state_->end), the pointer capture (ptr = state_->begin) and the advance
(state_->begin += bytes), or (b) implement an atomic reservation loop that uses
an atomic<size_t> for state_->begin and does compare_exchange to reserve bytes;
ensure the chosen sync primitive is part of the shared state structure so copies
of the resource share the same lock/atomic and prevent overlapping ranges.
- Around line 42-45: The defaulted move ctor/operator for
simulated_memory_resource lets the source object's std::shared_ptr state_ become
null which causes allocate() (lines that dereference state_) to UB; replace the
defaulted simulated_memory_resource(simulated_memory_resource&&) and
operator=(simulated_memory_resource&&) with custom implementations that copy
(share) the shared_ptr state_ instead of moving it, so both moved-from and
moved-to instances keep a valid state_; ensure the move-assignment also handles
self-assignment and preserves any existing state_ cleanup semantics used by the
class.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 4b8ca723-9e73-4194-bf84-da79170079b0

📥 Commits

Reviewing files that changed from the base of the PR and between b187020 and 12ebc67.

📒 Files selected for processing (1)

cpp/benchmarks/utilities/simulated_memory_resource.hpp

Share simulated memory resource state

12ebc67

bdice added bug Something isn't working non-breaking Non-breaking change labels May 17, 2026

github-project-automation Bot added this to RMM Project Board May 17, 2026

coderabbitai Bot reviewed May 17, 2026

View reviewed changes

bdice mentioned this pull request May 17, 2026

[FEA] Support memory resources from CCCL 3.2 #2011

Open

51 tasks

bdice self-assigned this May 21, 2026

bdice moved this to In Progress in RMM Project Board May 21, 2026

Merge branch 'main' into pr-share-simulated-mr-state

7c01b93

bdice marked this pull request as ready for review May 21, 2026 21:54

bdice requested a review from a team as a code owner May 21, 2026 21:54

bdice requested review from harrism and miscco May 21, 2026 21:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Share simulated memory resource state#2399

Share simulated memory resource state#2399
bdice wants to merge 2 commits into
rapidsai:mainfrom
bdice:pr-share-simulated-mr-state

bdice commented May 17, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented May 17, 2026

Uh oh!

coderabbitai Bot commented May 17, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bdice commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

copy-pr-bot Bot commented May 17, 2026

Uh oh!

coderabbitai Bot commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

bdice commented May 17, 2026 •

edited

Loading

coderabbitai Bot commented May 17, 2026 •

edited

Loading