Speech Recognition with Python: Fourier Transform, Spectrograms, and MFCCs

This repository contains Python scripts for analyzing audio signals and extracting features for speech recognition and other machine learning tasks. The code demonstrates recording audio, visualizing waveforms, generating spectrograms, and extracting Mel-Frequency Cepstral Coefficients (MFCCs).

Additionally, this repository includes a detailed example of the Fourier Transform in both LaTeX and PDF formats, providing a mathematical explanation and visualization of how time-domain signals transform into the frequency domain.

Features

Record and save audio as a WAV file.
Visualize time-domain waveforms using Matplotlib.
Generate spectrograms to analyze frequency variations over time.
Extract MFCCs, a compact representation of audio signals, for machine learning.
Understand the Fourier Transform with provided LaTeX and PDF documentation.

Applications

Speech recognition
Sentiment analysis
Speaker identification
Audio classification

Requirements

Python 3.x
Libraries:
- numpy
- matplotlib
- librosa
- sounddevice
- scipy

Install all dependencies with:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Fourier Transform Example.pdf		Fourier Transform Example.pdf
Fourier Transform Example.tex		Fourier Transform Example.tex
Fourier Transform in Speech Recognition.ipynb		Fourier Transform in Speech Recognition.ipynb
README.md		README.md
Slides-Fourier Transformation for Speech Recognition.docx		Slides-Fourier Transformation for Speech Recognition.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Recognition with Python: Fourier Transform, Spectrograms, and MFCCs

Features

Applications

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition with Python: Fourier Transform, Spectrograms, and MFCCs

Features

Applications

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages