[ENH] Fix binary IG calculation to respect identical split values by musaqlain · Pull Request #3206 · aeon-toolkit/aeon

musaqlain · 2025-12-26T21:31:40Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR fixes a logic error in ShapeletTransform where the binary Information Gain (IG) calculation was incorrectly evaluating split points between identical distance values.

Problem:
The _calc_binary_ig function iterated through every index in the sorted orderline to test split points. It failed to check if the current distance was equal to the next distance (orderline[i].distance == orderline[i+1].distance). This caused the algorithm to calculate splits inside groups of identical values, which is mathematically invalid and resulted in inflated IG scores (e.g., returning ~0.97 instead of ~0.42 for the reproduction case).

Solution:
Added a check to strictly skip invalid split points where the distance values are identical.

if orderline[split][0] == orderline[split+1][0]:
    continue

…eon-toolkit#1322)

aeon-actions-bot · 2025-12-26T21:32:00Z

Thank you for contributing to `aeon`

I have added the following labels to this PR based on the title: [ enhancement ].
I have added the following labels to this PR based on the changes made: [ transformations ]. Feel free to change these if they do not properly represent the PR.

The Checks tab will show the status of our automated tests. You can click on individual test runs in the tab or "Details" in the panel below to see more information if there is a failure.

If our pre-commit code quality check fails, any trivial fixes will automatically be pushed to your PR unless it is a draft.

Don't hesitate to ask questions on the aeon Slack channel if you have any.

PR CI actions

These checkboxes will add labels to enable/disable CI functionality for this PR. This may not take effect immediately, and a new commit may be required to run the new configuration.

Run pre-commit checks for all files
Run mypy typecheck tests
Run all pytest tests and configurations
Run all notebook example tests
Run numba-disabled codecov tests
Stop automatic pre-commit fixes (always disabled for drafts)
Disable numba cache loading
Regenerate expected results for testing
Push an empty commit to re-run CI checks

musaqlain · 2026-02-04T02:03:43Z

PTAL @MatthewMiddlehurst @TonyBagnall

MatthewMiddlehurst · 2026-02-07T20:39:04Z

This is less of a fix and more of a change to the algorithm itself. I would want to see it evaluated on benchmark datasets to see how it changes results before going forward. Either way skipping the last IG check does not seem correct.

TonyBagnall · 2026-02-28T22:24:17Z

hi, I looked at this and I dont really think the change is necessary. Its a design decision, and it as no impact on performance when I tested it, so I think atm I go by if it aint broke, dont fix it. I'll convert to draft, but if you can show it adds value, happy to switch back

[ENH] Fix binary IG calculation to skip identical split points (Issue a…

0eda90d

…eon-toolkit#1322)

musaqlain requested review from MatthewMiddlehurst and TonyBagnall as code owners December 26, 2025 21:31

aeon-actions-bot bot added enhancement New feature, improvement request or other non-bug code enhancement transformations Transformations package labels Dec 26, 2025

TonyBagnall marked this pull request as draft February 28, 2026 22:24

MatthewMiddlehurst mentioned this pull request Mar 23, 2026

[ENH] ShapeletTransform: binary ig calculation problem #1322

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Fix binary IG calculation to respect identical split values#3206

[ENH] Fix binary IG calculation to respect identical split values#3206
musaqlain wants to merge 1 commit intoaeon-toolkit:mainfrom
musaqlain:fix/1322-shapelet-ig-calculation

musaqlain commented Dec 26, 2025

Uh oh!

aeon-actions-bot bot commented Dec 26, 2025

Uh oh!

musaqlain commented Feb 4, 2026

Uh oh!

MatthewMiddlehurst commented Feb 7, 2026

Uh oh!

TonyBagnall commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

musaqlain commented Dec 26, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

aeon-actions-bot bot commented Dec 26, 2025

Thank you for contributing to aeon

Uh oh!

musaqlain commented Feb 4, 2026

Uh oh!

MatthewMiddlehurst commented Feb 7, 2026

Uh oh!

TonyBagnall commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Thank you for contributing to `aeon`