Various Machine Learning models have been trained using the PIMA Diabetes Dataset. The hyperparameter tuning of the model have been done using GridSeearchCV. The trained models include logistic regression, decision tree, random forest, gradient boosting classifier etc.
Dataset Link: https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database
- matplotlib
- pandas
- seaborn
- numpy
- pickle
- scikit-learn
- xgboost
- Null Values Check
- Replacing NA values
- Outliers Detection
- Feature Engineering
- One Hot Encoding
- PairPlot
- HistPlot
- BoxPlot
- Correlation Heatmap
- Clone this Repository
- Downlaod the dataset
- Unzip the dataset in the same folder as python file (.ipynb)
- run the file OR
- use the trained model (gbc.pkl)



