Skip to content

Shrink high-RSS tests in testsuite.#711

Merged
maleadt merged 1 commit into
masterfrom
tb/rss
Apr 18, 2026
Merged

Shrink high-RSS tests in testsuite.#711
maleadt merged 1 commit into
masterfrom
tb/rss

Conversation

@maleadt
Copy link
Copy Markdown
Member

@maleadt maleadt commented Apr 18, 2026

When run against CuArray in CUDA.jl's test suite, two tests dominated GPU RSS without exercising anything the smaller sizes wouldn't:

  • linalg/kron: 32x64 * 128x16 produces a 4096x1024 result; halving each operand to 16x32 * 64x8 still covers all opa/opb transpose variants at ~16x less memory.
  • reductions/mapreducedim! large: the 5000x500 / 500x5000 / 1000000 sizes were picked to exceed the multi-element-reading threshold (~86k on typical GPUs). 1000x500 / 500x1000 / 500000 still clears that threshold on realistic devices.

When run against CuArray in CUDA.jl's test suite, two tests dominated
GPU RSS without exercising anything the smaller sizes wouldn't:

- linalg/kron: 32x64 * 128x16 produces a 4096x1024 result; halving each
  operand to 16x32 * 64x8 still covers all opa/opb transpose variants
  at ~16x less memory.
- reductions/mapreducedim! large: the 5000x500 / 500x5000 / 1000000
  sizes were picked to exceed the multi-element-reading threshold
  (~86k on typical GPUs). 1000x500 / 500x1000 / 500000 still clears
  that threshold on realistic devices.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@maleadt maleadt merged commit 2344458 into master Apr 18, 2026
17 of 18 checks passed
@maleadt maleadt deleted the tb/rss branch April 18, 2026 10:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant