Skip to content

[quality] verify: color-aware scoring (SSIM is blind to color/feature loss) #37

@realproject7

Description

@realproject7

Parent quality EPIC. Root cause: verify.score uses SSIM only, which is largely luminance/structural — it returned 0.923 for shiba with no face and ignored red→brown shifts. The loop therefore optimizes the wrong thing and reports false 'pass'.

Fix: blend SSIM with a color-fidelity term, e.g. mean perceptual color distance (ΔE in Lab) or a per-region color-histogram difference, into the score the loop optimizes and reports. Recalibrate --quality default if needed. Optionally add a small-feature/edge term so deleted faces are penalized.

Acceptance: the shiba 'no-face' and watercolor 'desaturated' outputs score materially below a faithful conversion; faithful outputs still score high. Add tests with a color-shifted fixture.

Metadata

Metadata

Assignees

No one assigned

    Labels

    follow-upAdversarial second-pass findingqualityOutput quality improvement

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions