Nexus Info Project-3 (AI/ML domain)
Made By: Kumar Shantanu (4th Year CSE Student at KIIT University, Bhubaneswar, Odisha, India)
My Email ID: shantanunitw01@gmail.com
This project focuses on predicting heart disease using machine learning algorithms and evaluates their performance on a dataset. The goal is to identify the most effective model for accurate heart disease prediction.
Heart disease is a significant health concern globally, and accurate prediction models are crucial for timely intervention. This study aims to enhance the reliability of heart disease prediction by comparing and fine-tuning machine learning algorithms.
Heart disease prediction is addressed through the application of various machine-learning algorithms on a comprehensive dataset. The project aims to discover patterns in the data to improve the accuracy of predicting heart disease.
- Compare ML algorithm performance for heart disease prediction.
- Identify the most effective ML approach for accurate prediction.
- Fine-tune models for enhanced accuracy.
- Understanding the Problem
- Data Collection
- Data Preprocessing
- Exploratory Data Analysis (EDA)
- Model Selection
- Best Model Selection
- Hyperparameter Tuning
- Model Training
- Model Testing
- Performance Metrics
- Model Interpretability
- Validation and Testing
- Visualization of Results
- Programming Language: Python
- ML Libraries: Scikit-learn, SHAP
- Data Analysis: Pandas, NumPy, Matplotlib, Seaborn
- Metrics: Accuracy Score, Precision, Recall, F1 Score, SHAP values, Confusion Matrix
- Tools: GridSearchCV
- Visualization: Seaborn, Matplotlib, SHAP
ML models:
- Logistic Regression
- Decision Tree
- Random Forest
- Support Vector Machine
- Random Forest achieved the highest accuracy, 85.37%, making it the best-performing model.
- Other models showed competitive accuracies ranging from 73.17% to 82.93%.
- Random Forest was fine-tuned with optimal hyperparameters, enhancing its accuracy.
- Random Forest's superior performance makes it the preferred choice for heart disease prediction.
- Models were extensively tested and validated to ensure the accuracy, reliability, and robustness of the heart disease prediction system.