AccentFlow is a real-time accent conversion application that captures live audio, transcribes it into text using Google Web Speech API, and converts it into a specified accent using AWS Polly’s neural text-to-speech capabilities. This app is built using Python with a simple and interactive web interface created using Streamlit.
- Real-Time Speech Transcription
- Accent Conversion using AWS Polly
- Interactive and User-Friendly Web Interface
- Real-Time Feedback and Status Updates
To run AccentFlow on your local machine, ensure the following requirements are met:
- Python 3.7 or higher installed.
- Libraries and Dependencies:
streamlitboto3pyaudioSpeechRecognition
- Audio Input: Ensure your system has a microphone attached or supports audio input.
git clone https://github.com/your-repository-url
cd your-repository-folder
python3 -m venv venv
source venv/bin/activate # On Windows, use 'venv\Scripts\activate'
Run the following command to install all the required dependencies:
pip install streamlit boto3 pyaudio SpeechRecognition
To access AWS Polly, you need to provide your AWS credentials. These can be hardcoded in the script or configured through environment variables. If using temporary credentials, include AWS_SESSION_TOKEN as shown in the script.
export AWS_DEFAULT_REGION="us-west-2"
export AWS_ACCESS_KEY_ID="YOUR_ACCESS_KEY_ID"
export AWS_SECRET_ACCESS_KEY="YOUR_SECRET_ACCESS_KEY"
export AWS_SESSION_TOKEN="YOUR_SESSION_TOKEN" # If using temporary credentials
To start AccentFlow, navigate to the root directory of the project and run the following command:
streamlit run streamlit_app.py
If you are facing issues with PyAudio, make sure you have portaudio installed:
sudo apt-get install portaudio19-dev
Install PyAudio dependencies using Homebrew:
brew install portaudio
pip install pyaudio