Deep-Learning-Raman-Spectroscopy

Goal of the project

The purpose of this repository is to use Transfer Learning in order to classify patients affected by Amyotrophic Lateral Sclerosis using Raman Spectroscopy.

Dataset :

The main dataset used in this project is composed of 393 spectra belonging to 20 patients. affected by Amyotrophic Lateral Sclerosis (ALS) and 198 to 10 healthy ones (CTRL). The data can be found here.

Notice that the pretrained model for the Transfer Learning experiments come from a bacteria dataset that you can found here.

The project structure :

It is composed of 3 directories:

Bacteria_TL - all the files comes from here and is composed of :
- 3 pretrained models i.e saved parameters for pre-trained CNN (pretrained_model.ckpt, finetuned_model.ckpt and clinical_pretrained_model.ckpt )
- datasets.py - contains code for setting up datasets and dataloaders for spectral data
- resnet.py - contains ResNet CNN model class
- training.py - contains code for training CNN and making predictions
Project Report - contains the final report of the project written in LaTeX using MiKTeX and editing with Texmaker
Raman_Data - 2 sub-folders containing the spectra of ALS et CTRL + a CSV file summing up the patient IDs and the samples IDs
checkpoints - it contains the checkpoints of the features extractor models developped in the notebook 4 and 6

It is also composed of 5 jupyter notebooks :

1_test_models_dataset.ipynb - file to load the data, pre-process it (removing negative values and features selection), plot some of spectra and predict on simple (LogisticRegression, DecisionTree) and a bit more complex (SVM, RandomForest) Machine Learning (ML) models using different splitting techniques (LeaveOneGroupOut and GroupKFold)
2_predictions_with_pretrained_models.ipynb - Using the pretrained models of Bacteria-ID on our dataset to make some predictions using average accuracy and standard deviation.
3_fine_tuning_experiments.ipynb - After a custom splitting dataset technique producing a "finetunable" set and a "test" set, we finetune the predicted models in order to determine the best model and increase our average accuracy.
4_features_extraction.ipynb - Using each pretrained models of Bacteria-ID, features are extracted ("representing" our data), from different layer. Then, two different models are tested on these features : a classical model and a deep one.
5_data_augmentation.ipynb - Using some data augmentation technique on spectral data (offset, multiplication and Gaussian noise), the finetuned models seems to obtained better results.
6_data_augmentation_with_features_extractor - The methods of data augmentation previously used on the finetuned models are now applied to the features extraction method.

Finally the remaining files are :

data_loader.py - a python file to load the data (factorization of code
extractor.py - a modification of the resnet.py file class to extract features (by removing the final layers)

Requirements

The code in this repo has been testing with python 3.6.9 and python 3.6.12 using Anaconda Python distribution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep-Learning-Raman-Spectroscopy

Goal of the project

Dataset :

The project structure :

Requirements

Reference papers

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
Bacteria_TL		Bacteria_TL
Project Report		Project Report
checkpoints		checkpoints
.gitignore		.gitignore
1_test_models_dataset.ipynb		1_test_models_dataset.ipynb
2_predictions_with_pretrained_models.ipynb		2_predictions_with_pretrained_models.ipynb
3_fine_tuning_experiments.ipynb		3_fine_tuning_experiments.ipynb
4_features_extraction.ipynb		4_features_extraction.ipynb
5_data_augmentation.ipynb		5_data_augmentation.ipynb
6_data_augmentation_with_features_extractor.ipynb		6_data_augmentation_with_features_extractor.ipynb
ProjectPresentation.png		ProjectPresentation.png
README.md		README.md
data_loader.py		data_loader.py
extractor.py		extractor.py

nsgln/Deep-Learning-Raman-Spectroscopy

Folders and files

Latest commit

History

Repository files navigation

Deep-Learning-Raman-Spectroscopy

Goal of the project

Dataset :

The project structure :

Requirements

Reference papers

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages