enh - add Y transformation keyword argument in NormativeModel to enforce positivity by contsili · Pull Request #372 · amarquand/PCNtoolkit

contsili · 2026-02-12T17:15:04Z

Closes #356

If Y<-1 then log(Y+1) cant be computed (ValueError) but I expect Y to always be >=-1 (since the functionality i added in this pr is only for data that are always positive). If we want to be conservative we can add though a check like:

# Validate that values are > -1 (required for log1p)
min_val = float(np.min(data[var].values))
if min_val <= -1.0:
    raise ValueError(
        f"y_transform='log1p' requires all {var} values > -1 but your {var} are not"
    )

We can still consider to add a test and/or an example.

…rce positivity

AuguB · 2026-02-12T22:10:45Z

Will this solve the problem addressed in issue #365? As I see it, since the standardization happens after the log transform, this can and probably will still yield negative values, resulting in invalid input to WarpLog.

Additionally, why would we apply a log transform as a preparation for a log transform?

Some ideas:

Explicitly disallowing standardize+WarpLog. A minmaxscaler can be used instead. Informative messages to the user can be helpful here.
Adding a new scaler that only divides the data by its standard deviation, which amounts to a scaling around y=0; positive data stays positive. Such a scaler could be automatically applied when all of the following hold:

Y is strictly positive
BLR with Warplog as regressionmodel
The selected outscaler is standardize
Then we can safely overwrite the outscaler with the new scaler (e.g. scale_only_standardize)

I do agree the option to add a log transform on Y as it is implemented now is a good idea, and it should be kept somehow. I just don't think it really solves the problem of using WarpLog + standardize. EDIT: I realize now that this PR is not intended to solve #365.

contsili · 2026-02-13T09:49:57Z

as you mentioned in the end of your previous comment, this pr is not meant to solve #365, but #356. #356 was proposed by @amarquand. It is meant to not allow for strictly positive response variables (like brain volumes or WM hypoinstensities) to have negative centiles (see for example, fig. 6(e) in [1]

For #365 for now we implemented something close to your first idea: see #366. However, your second idea is also a good one. We can make an issue to implement it.

amarquand · 2026-02-13T11:42:11Z

Indeed. We have a user that wants to apply the shash model to strictly positive data, and get the centiles back in the original space. But @AuguB is right that the interaction with the inscaler and outscaler needs to be carefully considered.

Has this been tested?

contsili · 2026-02-13T12:18:07Z

I wrote an example script which i have locally.

I used inscaler and outscaler = 'standardise', dataset fcon1000:

1. BLR(heteroskedastic=True)
Not enforced positivity:

Enforced positivity:

2. HBR() (default parameters)
Not enforced positivity

Enforced positivity:

@amarquand how do these results look?

contsili · 2026-02-24T16:34:37Z

I will add a test script for this feature and then we are ready to merge it

That way it is clear what is tested: main: tests the main normative model functions helper: tests the helper functions

…non-negativity

- Add BLR model fixtures (instead of test model fixtures) - adjust assertions: centiles and yhat should be > -1

test_normative_model is the same as test_normative_model_helper

contsili · 2026-03-04T10:50:41Z

After discussion with @amarquand: I will implement two keyword arguments

y_transform = 'log'
y_transform = 'log1p'

In the case of 'log' we expect all centiles >=0 vs 'log1p' we expect centiles >=-1

contsili · 2026-03-05T15:36:56Z

from the tests the centile plots below are created (model parameters: BLR with linear basis, homoskedasticity, no warping) :
y-transform = None

y-transform = 'log'

y-transform = 'log1p'

We see that the fit is not good as we have non linear data with negative skewed and fit a very simple blr.

An interesting finding is that with the y-transform = 'log' all the centiles are positive but the 95th is quite far from the others. I belive that happens cause when the model’s predictions are made in log space and then convert them back to the original scale using exp(), the exp() "inflates" the differences between centiles.

The reason why the 95th centile is more inflated in the log case vs the log1p case is:
log1p(Y) maps this to [0, ...]
log(Y) maps this to [-inf, ...]

This wide range of log(Y) make the std of Y a lot higher than the log1p(Y), leading to an inflated centile = exp( Z * std + mean1)

enh - add Y transformation keyword argument in NormativeModel to enfo…

a81df5a

…rce positivity

contsili requested a review from amarquand February 12, 2026 17:20

contsili assigned amarquand Feb 12, 2026

contsili assigned AuguB Feb 13, 2026

Merge branch 'dev' into log_positivity

4e70e66

contsili added 8 commits March 2, 2026 16:49

Merge branch 'dev' into log_positivity

ef15def

enh - rename NormativeModel tests

1d2f536

That way it is clear what is tested: main: tests the main normative model functions helper: tests the helper functions

enh - generate data and model to use for the log transform test

06f21a3

enh - add log transform tests for centiles and predictions to ensure …

f79717e

…non-negativity

enh - refactor log transformed centiles and yhat tests

2774ddf

- Add BLR model fixtures (instead of test model fixtures) - adjust assertions: centiles and yhat should be > -1

fix - discard test model fixtures

c791079

enh - add blr model fixtures

fdc7371

fix - remove redundancies

fce4ee3

test_normative_model is the same as test_normative_model_helper

contsili added 11 commits March 5, 2026 14:29

Merge branch 'dev' into log_positivity

f8d735d

fix - remove redundant import

2f42207

enh - add log transform functionality

27201ab

Merge branch 'dev' into log_positivity

d9e320a

enh - rename

87e4115

enh - rename

80a442f

enh - refactor comments

4d31037

enh - add test fixtures for log transform

71d5f70

enh - refactor existing log1p tests

dd702bd

enh - add log transform tests

ba5844e

enh - group tests for faster execution

e13a09a

contsili added 5 commits March 5, 2026 17:04

enh - check that the log transform can be performed

9b0ec3b

enh - plotter should assign positive response vars

235d13e

enh - last improvements to the tests

bc734f0

fix - clip before fitting

60e5ea6

enh - comment

1930194

contsili mentioned this pull request Mar 5, 2026

Add internal log transform to enforce positivity #356

Closed

contsili merged commit 4970689 into dev Mar 5, 2026
2 checks passed

contsili deleted the log_positivity branch March 5, 2026 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enh - add Y transformation keyword argument in NormativeModel to enforce positivity#372

enh - add Y transformation keyword argument in NormativeModel to enforce positivity#372
contsili merged 26 commits intodevfrom
log_positivity

contsili commented Feb 12, 2026 •

edited

Loading

Uh oh!

AuguB commented Feb 12, 2026 •

edited

Loading

Uh oh!

contsili commented Feb 13, 2026 •

edited

Loading

Uh oh!

amarquand commented Feb 13, 2026

Uh oh!

contsili commented Feb 13, 2026 •

edited

Loading

Uh oh!

contsili commented Feb 24, 2026 •

edited

Loading

Uh oh!

contsili commented Mar 4, 2026

Uh oh!

contsili commented Mar 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

contsili commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AuguB commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

contsili commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amarquand commented Feb 13, 2026

Uh oh!

contsili commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

contsili commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

contsili commented Mar 4, 2026

Uh oh!

contsili commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

contsili commented Feb 12, 2026 •

edited

Loading

AuguB commented Feb 12, 2026 •

edited

Loading

contsili commented Feb 13, 2026 •

edited

Loading

contsili commented Feb 13, 2026 •

edited

Loading

contsili commented Feb 24, 2026 •

edited

Loading

contsili commented Mar 5, 2026 •

edited

Loading