Speech-Emotion-Recognition

The.ipynb and.py files in this repo are functionally equivalent; unless otherwise specified, use the.ipynb file as the default.

This project was developed and implemented as part of the course Audio Processing and Indexing hosted by LIACS at Leiden University in the academic year 2022-2023.

This notebook contains three functioning parts.

A frequency spectrogram merging section that illustrate the merging of spectrograms regarding different emotions.
A MFCC extraction section to create MFCC spectrograms used as input in model training.
A CNN network section that contains a model that learns from MFCC inputs and achieves an accuracy of roughly 60%.

The results are analysed with confusion matrix and statistical procedures (e.g. PCA). Some interesting results are found in our project.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Project Demo.pdf		Project Demo.pdf
RAVDESS dataset.txt		RAVDESS dataset.txt
RAVDESS_P.zip		RAVDESS_P.zip
README.md		README.md
Speech_Emo_Reco.ipynb		Speech_Emo_Reco.ipynb
speech_emo_reco.py		speech_emo_reco.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-Emotion-Recognition

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Speech-Emotion-Recognition

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages