add support for multigraph to disjoint sampling by ChuckHastings · Pull Request #5520 · rapidsai/cugraph

ChuckHastings · 2026-05-15T16:40:04Z

Adds support for multigraph to the disjoint sampling implementation. Changes include:

Use of partial results from the first sampling attempt rather than discarding all sampled edges - should result in faster convergence in edge cases
Removal of fast-failure checks for multi-graph
A couple of additional tests that verify that it works

Closes #5500

copy-pr-bot · 2026-05-15T16:40:08Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

…ling

alexbarghi-nv

Tests are passing for disjoint sampling now - approved.

seunghwak

Review the de-duplicate function first.

If I understood the problem correctly, I think we can simplify this code quite a bit. Let me know if I misunderstood the problem.

seunghwak · 2026-05-15T22:35:15Z

  std::optional<rmm::device_uvector<int>> keep_ranks{std::nullopt};
  rmm::device_uvector<vertex_t> keep_majors(result_minors.size(), handle.get_stream());
  std::optional<rmm::device_uvector<int32_t>> keep_labels{std::nullopt};
  rmm::device_uvector<vertex_t> keep_minors(result_minors.size(), handle.get_stream());


Something very tedious, but why define keep_labels before keep_minors? This violates (majors, minors, labels) ordering before here.

seunghwak · 2026-05-16T00:51:40Z

+  //    index so it tie-breaks rows that share the same (label, minor).
  rmm::device_uvector<vertex_t> local_positions(keep_minors.size(), handle.get_stream());
  thrust::sequence(
    handle.get_thrust_policy(), local_positions.begin(), local_positions.end(), size_t{0});


In single GPU, local_positions == keep_positions, so isn't this just a duplicate?

seunghwak · 2026-05-16T00:55:41Z

+                     local_positions.end(),
+                     keep_positions.data(),
+                     tmp.begin());
+      keep_positions = std::move(tmp);


In SG, this is just same as keep_positions = std::move(local_positions), right?

seunghwak · 2026-05-16T01:21:17Z

-                           std::optional<rmm::device_uvector<int32_t>>&& result_labels,
-                           bool call_from_sampling)
+                           std::optional<rmm::device_uvector<int32_t>>&& result_labels)
 {


If I understood correctly, we want to keep just one edge per (label, minor) or minor (if no label).

I think this code is overly complicated for this purpose.

In SG, what we need is

auto key_pair_first = thrust::make_zip_iterator(result_labels->begin(), result_minors.begin()); thrust::sort_by_key( handle.get_thrust_policy(), key_pair_first, key_pair_first + result_labels->size(), thrust::make_zip_iterator(result_majors.begin(), tmp_edge_indices->begin())); auto [keep_count, keep_flags] = detail::mark_entries(handle, result_labels->size(), detail::is_first_in_run_t<decltype(key_pair_first)>{key_pair_first}); thrust::partition(...); // use keep_flags as stencil copy the second half to discard vectors, then the first half becomes keep vectors.

We need an additional step to discard duplicate (label, minor) edges across GPUs.

If we want to minimize the communication volume,

we can sort the keep (label, minor) pairs by the owning GPU ID.

We call shuffle_values which return rx_counts as well. We update keep_flags, send the keep flags back using rx_counts as tx_counts. Using the keep flags, discard the original (label, minor) pairs. Go back to the first half, keep in the first half only if the pair is in the survived (label, minor) pair. Move to the discard partition otherwise.

Or if you want something simpler,

shuffle (label, minor, rank) triplets based on the minor's vertex partition ID. Run, sort and unique to keep only one rank value per (label, minor). Shuffle back based on the rank. Now you have survived (label, minor) pairs. This involves additional rank values, and need to shuffle back triplets instead of just flags, but might be simpler.

seunghwak · 2026-05-16T01:38:38Z

+  rmm::device_uvector<size_t> carryover_frontier_capacity(0, handle.get_stream());

  cugraph::key_bucket_view_t<vertex_t, tag_t, multi_gpu, false> active_bucket_view =
    key_bucket_view;


auto active_key_bucket_view = key_bucket_view;

seunghwak · 2026-05-16T01:41:14Z

            Ks,
            active_major_labels,
            with_replacement);
      }


else {} here can't happen (as bias should be float or double), right?
Better document this,

else { CUGRAPH_FAIL("should not be reached."); }

seunghwak · 2026-05-16T01:54:23Z

+
+    if (discarded_majors.size() != 0) {
+      size_t const num_types = Ks.size();
+      CUGRAPH_EXPECTS(num_types >= 1, "Ks must be non-empty.");


Should we better check this at the beginning of this function?

seunghwak · 2026-05-16T02:40:24Z

+        carryover_frontier_capacity = std::move(agg_counts);
+      } else {
+        CUGRAPH_EXPECTS(
+          std::holds_alternative<rmm::device_uvector<int32_t>>(discarded_tmp_indices),


We defined edge_type_t at the beginning of this function, shouldn't we use it?

seunghwak · 2026-05-16T02:41:46Z

+        if (agg_labels) {
+          thrust::sort(handle.get_thrust_policy(),
+                       thrust::make_zip_iterator(agg_labels->begin(), agg_majors.begin()),
+                       thrust::make_zip_iterator(agg_labels->end(), agg_majors.end()));
+        } else {
+          cugraph::detail::sort_ints(
+            handle, raft::device_span<vertex_t>{agg_majors.data(), agg_majors.size()});
+        }


This code is common regardless of num_types, should we better sort before the if else block.

add support for multigraph to disjoint sampling

919866f

Merge branch 'release/26.06' into support_multigraph_in_disjoint_samp…

d0818a4

…ling

ChuckHastings self-assigned this May 15, 2026

ChuckHastings added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels May 15, 2026

ChuckHastings marked this pull request as ready for review May 15, 2026 16:44

ChuckHastings requested a review from a team as a code owner May 15, 2026 16:44

alexbarghi-nv approved these changes May 15, 2026

View reviewed changes

seunghwak reviewed May 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for multigraph to disjoint sampling#5520

add support for multigraph to disjoint sampling#5520
ChuckHastings wants to merge 2 commits into
rapidsai:release/26.06from
ChuckHastings:support_multigraph_in_disjoint_sampling

ChuckHastings commented May 15, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented May 15, 2026

Uh oh!

alexbarghi-nv left a comment

Uh oh!

seunghwak left a comment

Uh oh!

seunghwak May 15, 2026

Uh oh!

seunghwak May 16, 2026

Uh oh!

seunghwak May 16, 2026

Uh oh!

seunghwak May 16, 2026

Uh oh!

seunghwak May 16, 2026

Uh oh!

seunghwak May 16, 2026

Uh oh!

seunghwak May 16, 2026

Uh oh!

seunghwak May 16, 2026

Uh oh!

seunghwak May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ChuckHastings commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot Bot commented May 15, 2026

Uh oh!

alexbarghi-nv left a comment

Choose a reason for hiding this comment

Uh oh!

seunghwak left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ChuckHastings commented May 15, 2026 •

edited

Loading