Add IC ensemble ability to evaluator and update inference aggregators for ensembles by Arcomano1234 · Pull Request #709 · ai2cm/ace

Arcomano1234 · 2026-01-05T19:47:21Z

Adds support for same initial condition (IC) ensembles for stochastic models during evaluation and inline inference. Also adds / extends ensemble-based aggregators for step 20 during inference.
Changes:

symbol (e.g. fme.core.my_function) or script and concise description of changes or added feature
Can group multiple related symbols on a single bullet
Tests added

… for inference config

…nsemble" This reverts commit f998b9f.

…g, update tests

fme/ace/aggregator/inference/main.py

mcgibbon · 2026-01-23T21:32:48Z

fme/ace/aggregator/inference/main.py

                gen_data_norm=gen_data_norm,
                i_time_start=self._n_timesteps_seen,
            )
+        if self.n_ensemble_per_ic > 1:


Check out fme/ace/aggregator/one_step/main.py and see if you like the structure there for how it separates ensemble and deterministic aggregators. I think what's here is fine too, maybe preferable.

Either works but I did have a slight preference for the structure I proposed vs in fme/ace/aggregator/one_step/main.py to better allow more complex operations with ensemble aggs

mcgibbon · 2026-01-23T21:36:16Z

fme/ace/aggregator/inference/test_main.py

-        "a": torch.ones([batch_size, n_timesteps, nx, ny], device=get_device()),
-        "b": torch.ones([batch_size, n_timesteps, nx, ny], device=get_device()) * 3,
-        "c": torch.ones([batch_size, n_timesteps, nx, ny], device=get_device()) * 4,
+        "a": torch.ones(


Suggestion (optional): Like you do in test_inference_evaluator_aggregator_ensemble, make two BatchData, call .broadcast_ensemble on them, and then use PairedData.from_batch_data instead of constructing an ensemble PairedData at a low level. That would avoid coupling this test to the low-level implementation of the internals of BatchData/PairedData, making it easier to change the way ensembles are handled later if we need to.

fme/ace/aggregator/one_step/ensemble.py

mcgibbon · 2026-01-23T21:39:15Z

fme/ace/data_loading/batch_data.py

            horizontal_dims=self.horizontal_dims,
            epoch=self.epoch,
            labels=self.labels.to(device) if self.labels is not None else None,
+            n_ensemble=self.n_ensemble,


Could separate these changes into their own PR, it looks like they're fixing an existing bug. Would make it easier to tell if you updated the tests for this (if you did great, if you didn't please do).

Maybe for some of these methods we should have a more general "test all attributes that don't start with underscore are identical, except for .device." type test that will always catch these issues, even if we add new attributes.

mcgibbon · 2026-01-23T21:41:37Z

fme/ace/data_loading/batch_data.py

+            data=repeat_interleave_batch_dim(self.data, n_ensemble),
+            time=xr.concat([self.time] * n_ensemble, dim="sample"),
            labels=labels,
-            epoch=self.epoch,


It looks like epoch=self.epoch got deleted. Can you remove the changes to this method, which seem to be stylistic? I normally try to construct things before the final return-setting-many-attributes, in the case of larger inits like this one.

mcgibbon · 2026-01-23T21:42:28Z

fme/ace/data_loading/batch_data.py

    def target(self) -> TensorMapping:
        return {k: v for k, v in self.reference.items() if k in self.prediction}

+    def broadcast_ensemble(self) -> tuple[EnsembleTensorDict, EnsembleTensorDict]:


Issue: I would expect .broadcast_ensemble to return the same type, like it does for BatchData. I'd also expect it to be a light wrapper that broadcasts the internal BatchData. Is it possible to do that here? If you do, the test that got refactored to low-level-constrict PairedData could have its changes limited to calling the .broadcast_ensemble method on the one it was already making.

To break up #709 into manageable PRs, this PR adds the ensemble mean aggregator to `one_step/ensemble.py`. Changes: - Add `EnsembleMeanRMSEMetric` class `one_step/ensemble.py` - [x] Tests added

…ests

Arcomano1234 added 30 commits October 5, 2025 19:09

Add ensemble methods to batch data and add entry point for n_ensemble…

cffe68c

… for inference config

remove weird merge artifact for write.flush vs writer.finalize

56a7a93

get tests passing

b35d4b3

stash getting stepper working

39c0797

stash getting stepper working

cbfed69

get tests passing

ad0df01

remove new typing (not needed for now)

2cded50

add inference test_get_initial_condition ensemble test

8b7ce02

add ensemble tests to predict

377b626

Merge branch 'main' into feature/ensemble-inference

113298e

remove un-intended changes to inference

2f97d2d

address comments

9ff2db5

Merge branch 'main' into feature/ensemble-inference

ae54469

update ensemble tests

173e5d9

merge main into branch

ca3d42c

IC ensembles for evaulator

21cacbc

test ic eval

a1c3cde

add ensemble attribute to other apis in batch data and dataloading tests

fa95256

pass n_ensemble_per_ic to aggregators

3f7a607

paired datawriter needs number of IC ensembles

c8b5c7e

add 20 step ensemble metrics

382ab62

update CRPS to memory efficient version that supports any n_ensemble

f998b9f

Revert "update CRPS to memory efficient version that supports any n_e…

ed33415

…nsemble" This reverts commit f998b9f.

update ensemble CRPS to allow for more than 2 ensembles, fix alpha bu…

210d0e4

…g, update tests

record ensemble logs

eb2572b

add ensembles to summary agg

7260524

get tests passing

92921f0

test ERA5 FTed weather skill

5680a40

Fix bug related to paired data

e1ff57a

add tests to main inference agg

89e8e29

Arcomano1234 added 16 commits November 13, 2025 15:58

add ERA5 experiment and remove prints

81c3ee2

1 forward step in memory

d2baa75

only 3 dates

c725205

save wandb log

7d5bc41

remove clone in pin memory

77c2594

Add ensemble tests to evaluator

59f91ee

Add inline inference n_ensemble_per_ic configurable

7ba518c

test inline inference with 2 ensembles

e8311c3

Incorporate comments

8d63059

merge main into branch

ff2d0ee

remove merge artifacts

3dd93a2

add some testing

524d1f8

explicit initialization of batch data

fb65d46

remove print statements

f17b235

remove experiment dir

82186f6

Merge branch 'main' into feature/ensemble-inference-aggs

cb6adb3

mcgibbon reviewed Jan 23, 2026

View reviewed changes

Arcomano1234 added 3 commits January 24, 2026 10:34

remove prints

4fd8beb

vectorize crps

072b63d

comments

1eb075b

Arcomano1234 mentioned this pull request Feb 3, 2026

Add EnsembleMean Aggregator #793

Merged

1 task

Arcomano1234 added 3 commits February 4, 2026 13:33

merge main into branch

7007389

merge main into branch

bec668a

remove comment

1a9880b

Arcomano1234 added 5 commits February 10, 2026 13:57

Merge branch 'main' into feature/ensemble-inference-aggs

b2d7ba7

remove device call to broadcast_ensemble

88da20e

Merge branch 'main' into feature/ensemble-inference-aggs

720f9e5

Have broadcast_ensemble for PairedData return PairedData and update t…

4ea396c

…ests

add inline inference and tests

cc4b342

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add IC ensemble ability to evaluator and update inference aggregators for ensembles#709

Add IC ensemble ability to evaluator and update inference aggregators for ensembles#709
Arcomano1234 wants to merge 84 commits intomainfrom
feature/ensemble-inference-aggs

Arcomano1234 commented Jan 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

mcgibbon Jan 23, 2026

Uh oh!

Arcomano1234 Feb 3, 2026

Uh oh!

mcgibbon Jan 23, 2026

Uh oh!

Uh oh!

Uh oh!

mcgibbon Jan 23, 2026

Uh oh!

mcgibbon Jan 23, 2026

Uh oh!

mcgibbon Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Arcomano1234 commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mcgibbon Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

Arcomano1234 Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mcgibbon Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Arcomano1234 commented Jan 5, 2026 •

edited

Loading