An end-to-end Machine Learning pipeline and Tableau dashboard predicting CMS Hospital Star Ratings (1-5) based on clinical and patient experience metrics.
- Ingestion: Automated retrieval of 4,800+ hospital records from CMS Provider Data.
- Processing: Handled schema drift, unified disparate facility IDs, and isolated
Hybrid_HWRreadmission metrics. - Modeling: XGBoost Multi-class classifier (43% accuracy vs 20% baseline) to predict Star Ratings.
- Interpretability: SHAP feature importance revealed Patient Experience (Nurse Communication) as a primary driver alongside Clinical Mortality.
- Language: Python 3
- Libraries:
pandas,xgboost,shap,scikit-learn - Visualization: Tableau Public
- Run
python src/ingestion.pyto load raw CMS data. - Run
python src/processing.pyto build the master analytics table. - Run
python src/modeling.pyto train the model and generate SHAP insights.