Skip to content

Fix CUDA stream reuse in clue filter#20

Open
mikemcqueen wants to merge 4 commits into
masterfrom
non-deterministic
Open

Fix CUDA stream reuse in clue filter#20
mikemcqueen wants to merge 4 commits into
masterfrom
non-deterministic

Conversation

@mikemcqueen

Copy link
Copy Markdown
Owner

Summary

  • fix CUDA stream reuse so per-stream source counts are refreshed when later sums shrink
  • initialize stream stride before device setup and zero stream device pointers
  • size filter scratch buffers to active source count instead of full occupancy

Verification

  • node-gyp rebuild
  • node clues -pf.72 -c2,5 -m3 --xor constitution > /dev/null
  • node clues -pf.72 -c2,5 -m3 --xor constitution -v > /dev/null

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant