You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
eval: add typed training config to Hydra and feedback requests (#38)
## Summary
- add a nested training section to eval Hydra config defaults
- add explicit eval-side training schema and convert it once into
runtime TrainingConfig
- enforce strict type invariance for runtime (HarnessConfig.training
must be TrainingConfig)
- pass typed training config through every FeedbackItem generated by
eval runner
- update eval docs and tests for nested overrides (training.is_clip,
training.learning_rate)
## Validation
- uv run pytest tests/test_eval_config.py tests/test_eval_runner.py -q
- 23 passed
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
* **New Features**
* Added configurable training parameters (learning rate, alpha,
clipping, max-grad-norm, KL weight, teacher_top_k) and runtime
validation for training settings.
* CLI and programmatic interfaces now support overriding nested training
hyperparameters and explicit output directory.
* **Configuration**
* Batch processing frequency increased to 4 steps per batch.
* Default output directory now uses a timestamped path when not
overridden.
* **Tests**
* Added tests covering training config validation, overrides, and
mismatch rejection.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
0 commit comments