Goal
Prepare and validate a clean, reproducible dataset for radiology report generation using MIMIC-CXR.
Scope
- Transform
dicom files to jpg
- Extract image–report pairs
- Remove low-quality or incomplete reports
- Split into train / validation / test using official split
Acceptance Criteria
- Data preprocessing scripts
- Documented dataset statistics
Goal
Prepare and validate a clean, reproducible dataset for radiology report generation using MIMIC-CXR.
Scope
dicomfiles tojpgAcceptance Criteria