Describe the task
Refactor the statistical validation code in data/src/validation/ to consistently and efficiently use Pandera's DataFrameSchema for statistical range validation. Currently, the implementation is inconsistent across different validators, with some using Pandera's schema validation while others implement custom statistical checks, leading to code duplication and reduced maintainability. This task involves standardizing all output schema tests to leverage Pandera's built-in functionality and best practices, eliminating redundant statistical validation code by either utilizing existing Pandera features or consolidating repetitive logic into the base validator class.
Acceptance Criteria
Describe the task
Refactor the statistical validation code in
data/src/validation/to consistently and efficiently use Pandera'sDataFrameSchemafor statistical range validation. Currently, the implementation is inconsistent across different validators, with some using Pandera's schema validation while others implement custom statistical checks, leading to code duplication and reduced maintainability. This task involves standardizing all output schema tests to leverage Pandera's built-in functionality and best practices, eliminating redundant statistical validation code by either utilizing existing Pandera features or consolidating repetitive logic into the base validator class.Acceptance Criteria
data/src/validation/to identify inconsistent statistical validation patternsDataFrameSchemawith appropriate statistical checks