Skip to content

Tejeshyewale/CareGuard-ML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 

Repository files navigation

πŸ₯ CareGuard-ML: Hospital Readmission Risk Prediction System

MedRisk is a machine learning–based clinical decision support system that predicts the risk of a patient being readmitted to the hospital within 30 days of discharge. The goal is to help healthcare providers identify high-risk patients early and take preventive care actions.


πŸ“Œ Problem Statement

Hospital readmissions are costly and often preventable. Identifying high-risk patients before discharge allows hospitals to:

  • Provide additional monitoring
  • Schedule early follow-ups
  • Improve quality of care
  • Reduce healthcare costs

This project uses historical patient data to build a predictive model that estimates readmission risk.


🎯 Project Objective

To develop a machine learning model that:

  • Analyzes patient demographics, medical history, and hospital stay details
  • Predicts the likelihood of 30-day readmission
  • Provides interpretable results to support clinical decision-making

🧠 Machine Learning Approach

This is a binary classification problem:

  • 0 β†’ No readmission
  • 1 β†’ Readmission within 30 days

ML Techniques Used

  • Data preprocessing and feature engineering

  • Handling missing and categorical data

  • Imbalanced data handling using SMOTE

  • Model training using:

    • Logistic Regression
    • Random Forest
    • XGBoost (final model)
  • Model evaluation using:

    • Precision, Recall, F1-score
    • ROC-AUC score
  • Model interpretability using feature importance / SHAP


πŸ“‚ Dataset

The dataset contains historical hospital records with features such as:

  • Patient age and gender
  • Admission type and length of stay
  • Number of lab tests and diagnoses
  • Medication details
  • Previous hospital visits

Note: This project uses publicly available healthcare-style datasets for educational and research purposes.


βš™οΈ Tech Stack

  • Python – Core programming language
  • Pandas & NumPy – Data manipulation
  • Scikit-learn – ML models and preprocessing
  • XGBoost – Advanced gradient boosting model
  • Imbalanced-learn (SMOTE) – Handling class imbalance
  • SHAP / Feature Importance – Model explainability
  • Streamlit / Flask – Web app interface (optional)

πŸ“Š Model Evaluation Metrics

Because readmission prediction is an imbalanced problem, accuracy alone is not reliable. The following metrics are used:

  • Recall – To identify as many high-risk patients as possible
  • Precision – To avoid unnecessary false alarms
  • F1 Score – Balance between precision and recall
  • ROC-AUC – Overall model discrimination ability

πŸ–₯️ System Workflow

  1. Load and preprocess hospital dataset
  2. Handle missing values and encode categorical features
  3. Balance data using SMOTE
  4. Train multiple ML models
  5. Select best-performing model (XGBoost)
  6. Evaluate performance using medical-relevant metrics
  7. Interpret predictions using SHAP/feature importance
  8. Deploy as a simple web application for risk prediction

πŸš€ Future Improvements

  • Use real-time hospital EHR data
  • Add lab result trends and vital signs
  • Integrate with hospital management systems
  • Perform external validation across multiple hospitals

⚠️ Disclaimer

This project is for educational and research purposes only. It is not intended to replace professional medical judgment. The system is designed as a decision-support tool to assist healthcare providers.


πŸ“Œ Author

Developed as a machine learning healthcare analytics project to demonstrate real-world ML problem solving, model interpretability, and responsible AI usage.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors