Smart Audio Transcription and Editing Tool

Introduction

This project combines audio transcription and smart editing into a convenient end-to-end user tool. The smart editing feature parses the transcription, adds appropriate paragraph breaks and corrects grammatical / sentence errors. Users are able to manually edit the text to their liking at any stage of processing. It uses the following models:

Transcription: OpenAI's Whisper model
Post-transcription smart editing: Microsoft's Phi-3 Mini 128K Instruct OpenRouter.AI API

Installation

Clone the repository.
Install python3 and pip.
Inside the project root folder, run python -m venv venv to create a virtual environment.
Activate the virtual environment: run source ./venv/bin/activate.
Inside the project root folder, run pip install -r requirements.txt to install the necessary packages.
To download the Whisper model, run pip install --upgrade --no-deps --force-reinstall git+https://github.com/openai/whisper.git. Follow the instructions on https://github.com/openai/whisper for more information.

Running the web app

Run python -m flask --app tool run --port 8000
Open the web app on localhost:8000 in your Chrome browser.

Technologies

Flask
Python
HTML
JavaScript
CSS

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
tool		tool
user-uploads		user-uploads
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Smart Audio Transcription and Editing Tool

Introduction

Installation

Running the web app

Technologies

About

Uh oh!

Releases

Packages

Languages

koayte/transcribe-edit

Folders and files

Latest commit

History

Repository files navigation

Smart Audio Transcription and Editing Tool

Introduction

Installation

Running the web app

Technologies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages