Skip to content

A machine learning project for Parkinson’s disease detection and analysis using health data. Includes classification (healthy vs Parkinson’s) and regression (severity prediction) models, with pipelines, evaluation metrics, and documentation.

Notifications You must be signed in to change notification settings

Parkinson-ML/Parkinson

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧠 Parkinson's Disease Detection

This project applies Machine Learning (ML) techniques to detect and analyze Parkinson's disease.
It includes both classification and regression models, providing valuable insights for early diagnosis and disease progression tracking.


📂 Project Structure

Parkinson/
├── Model_Training/
│   ├── Classification.ipynb           # Jupyter notebook for classification models
│   ├── Regression.ipynb               # Jupyter notebook for regression models
│
├── Model_Pipeline/
│   ├── Script_classification.ipynb    # Jupyter notebook for classification pipeline
│   ├── Script_regression.ipynb        # Jupyter notebook for regression pipeline
│
├── Dataset/
│   ├── Dataset_Description.md         # Detailed dataset documentation
│   ├── parkinsons_disease_data_cls.csv  # Dataset used for training/testing
│
├── Parkinsons_Project_Documentation.pdf # Project documentation (PDF)
│
└── README.md                           # Project documentation (Markdown)


🎯 Objectives

  • Detect Parkinson's disease using patient biomedical voice data and health records.
  • Compare and evaluate different machine learning algorithms.
  • Perform both classification (disease vs. no disease) and regression (severity prediction).
  • Provide interpretable metrics to support medical decision-making.

📑 Dataset Overview

The dataset provides comprehensive health information for patients (IDs 3058–5162) who underwent examination for Parkinson’s disease diagnosis.
It includes demographic, lifestyle, medical, cognitive, functional, and symptom-related features.

1. Patient Information

  • PatientID: Unique identifier (3058–5162)
  • Demographics: Age (50–90), Gender, Ethnicity, Education level
  • Lifestyle: BMI, Smoking status, Alcohol consumption (weekly units), Physical activity (hours/week), Diet quality score, Sleep quality score

2. Medical History

  • Family history of Parkinson’s
  • Traumatic brain injury
  • Hypertension, Diabetes, Depression, Stroke

3. Clinical Measurements

  • Blood Pressure: Systolic (90–180 mmHg), Diastolic (60–120 mmHg)
  • Cholesterol: Total (150–300), LDL (50–200), HDL (20–100), Triglycerides (50–400)

4. Cognitive & Functional Assessments

  • UPDRS: Unified Parkinson’s Disease Rating Scale (0–199; higher = worse)
  • MoCA: Montreal Cognitive Assessment (0–30; lower = impairment)
  • Functional Assessment: 0–10; lower = impairment

5. Symptoms

  • Tremor, Rigidity, Bradykinesia (slowness of movement)
  • Postural instability
  • Speech problems
  • Sleep disorders
  • Constipation

6. Confidential Information

  • DoctorInCharge: Confidential field with anonymized placeholder (DrXXXConfid)

7. Diagnosis

  • Diagnosis: Binary outcome → 1 = Parkinson’s, 0 = No Parkinson’s

🚀 Usage

1. Run Jupyter Notebooks

Start Jupyter Lab or Notebook:

jupyter notebook

Open:

  • Classification.ipynb → Train & evaluate classification models
  • Regression.ipynb → Train & evaluate regression models
  • Script_classification.ipynb → Run the trained classification model pipeline on new data
  • Script_regression.ipynb → Run the trained regression model pipeline on new data

📊 Evaluation Metrics

  • Classification Models:

    • Accuracy ✅
    • Precision, Recall, F1-score 📈
    • ROC-AUC 🔵
  • Regression Models:

    • Mean Squared Error (MSE)
    • Mean Absolute Error (MAE)
    • R² Score

📌 Results

  • Achieved ~95% accuracy in classification.
  • Regression models provided good estimates of disease progression severity.

(Detailed results are available inside the notebooks.)


🔮 Future Improvements

  • Incorporate deep learning models (e.g., LSTMs for time-series voice data).
  • Enhance feature engineering with signal processing techniques.
  • Deploy as a web-based diagnostic tool for clinicians.

🤝 Contributing

Hossam Ahmed
Fatma Ahmed
Maria Kaiser
Nouran Haitham
Nora Ahmed
Christin Medhat

About

A machine learning project for Parkinson’s disease detection and analysis using health data. Includes classification (healthy vs Parkinson’s) and regression (severity prediction) models, with pipelines, evaluation metrics, and documentation.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •