Align validation step with checkpoint step by licesma · Pull Request #54 · physical-superintelligence-lab/Psi0

licesma · 2026-06-03T19:59:57Z

Validation was triggered on global_step % validation_steps == 0, while checkpointing
uses (global_step + 1) % checkpointing_steps == 0. This one-step offset means the
eval loss logged near a checkpoint does not correspond to that checkpoint's weights.

As a side effect, validation no longer runs at step 0 (on the barely-trained model right
after the first step)

songlin

It is helpful to run the validation code path for the first training loop. Can you revise the code again?

Align validation step with checkpoint

b575a48

songlin requested changes Jun 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Align validation step with checkpoint step#54

Align validation step with checkpoint step#54
licesma wants to merge 1 commit into
physical-superintelligence-lab:mainfrom
licesma:fix/align_validation_with_checkpoint

licesma commented Jun 3, 2026

Uh oh!

songlin left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

licesma commented Jun 3, 2026

Uh oh!

songlin left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

songlin left a comment •

edited

Loading