nonzero, where fixes from #938 by ClaudiaComito · Pull Request #2332 · helmholtz-analytics/heat

ClaudiaComito · 2026-06-02T04:22:00Z

Due Diligence

General:
- title of the PR is suitable to appear in the Release Notes
Implementation:
- unit tests: all split configurations tested
- unit tests: multiple dtypes tested
- NEW unit tests: MPS tested (1 MPI process, 1 GPU)
- benchmarks: created for new functionality
- benchmarks: performance improved or maintained
- documentation updated where needed

Description

Issue/s resolved: #

Changes proposed:

Type of change

Memory requirements

Performance

Does this change modify the behaviour of other functions? If so, which?

yes / no

brownbaerchen · 2026-06-02T04:42:06Z

+        )
+        # vectorized sorting of nz indices along axis 0
+        global_nonzero.balance_()
+        global_nonzero = manipulations.unique(global_nonzero, axis=0)


Why is this needed? Seems like duplicate entries would be a bug at this point. Or are there some side effects of unique here?

brownbaerchen · 2026-06-02T04:56:14Z

+        self.assertEqual(len(wh), 2)
+        self.assertEqual(wh[0].gshape[0], 6)
+        self.assertEqual(wh[0].dtype, ht.int64)
+        self.assertEqual(wh[0].split, None)


Maybe this could be compared to numpy and torch too?

brownbaerchen · 2026-06-02T09:54:25Z

The issues arise because the API of nonzero is totally changed in this PR. See the following example:

import heat as ht
import torch
import numpy as np

a = ht.arange(7)

print(torch.nonzero(a.larray > 3))
# tensor([[4],
#         [5],
#         [6]])

print(np.nonzero(a.numpy() > 3))
# (array([4, 5, 6]),)

print(ht.nonzero(a > 3))
# on main: DNDarray([4, 5, 6], dtype=ht.int64, device=cpu:0, split=None)
# on features/nonzero-updates: (DNDarray(MPI-rank: 0, Shape: (3,), Split: None, Local Shape: (3,), Device: cpu:0, Dtype: int64, Data:
#                                        [4, 5, 6]),)

# new on features/nonzero-updates:
print(ht.nonzero(a > 3, as_tuple=False))
# DNDarray([[4],
#           [5],
#           [6]], dtype=ht.int64, device=cpu:0, split=None)

That is to say the previous API was neither numpy, nor torch and this PR changes it to default to numpy but with the option to get the torch behavior.

This is a good thing in principle, but I don't like to silently change the API. So, yes it's annoying that the current main relies on the previous API in a bunch of places and doesn't work with only these changes, but also we should clearly indicate this as a breaking, API-changing change.

We have two options:

Adapt all uses of nonzero in the current main to use the new and improved API, then merge this PR
Don't merge this PR and merge the changes together with the huge advanced indexing PR Expand distributed indexing, match numpy indexing scheme #938

Either way, we need to mention the API changes in the release notes. What do you think, @JuanPedroGHM, @mtar, @ClaudiaComito?

nonzero, where changes from #938

0293069

ClaudiaComito added this to the 1.9.0 milestone Jun 2, 2026

github-project-automation Bot added this to Roadmap Jun 2, 2026

ClaudiaComito added bug Something isn't working indexing labels Jun 2, 2026

github-project-automation Bot moved this to Todo in Roadmap Jun 2, 2026

ClaudiaComito mentioned this pull request Jun 2, 2026

Expand distributed indexing, match numpy indexing scheme #938

Open

4 tasks

brownbaerchen requested changes Jun 2, 2026

View reviewed changes

github-project-automation Bot moved this from Todo to In Progress in Roadmap Jun 2, 2026

brownbaerchen added 3 commits June 2, 2026 08:23

Fixed tests

4ea407e

Small refactoring

4075890

Disabling fail-fast

ae15914

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nonzero, where fixes from #938#2332

nonzero, where fixes from #938#2332
ClaudiaComito wants to merge 4 commits into
mainfrom
features/nonzero-updates

ClaudiaComito commented Jun 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brownbaerchen Jun 2, 2026

Uh oh!

Uh oh!

brownbaerchen Jun 2, 2026

Uh oh!

brownbaerchen commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ClaudiaComito commented Jun 2, 2026

Due Diligence

Description

Changes proposed:

Type of change

Memory requirements

Performance

Does this change modify the behaviour of other functions? If so, which?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brownbaerchen Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

brownbaerchen Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

brownbaerchen commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants