Screen.Recording.2025-03-16.115435.mp4
AI Video Summarizer is a desktop application that automatically transcribes and summarizes video content using advanced AI models. This tool uses OpenAI's Whisper for speech recognition and Hugging Face's T5 transformer model for text summarization, all wrapped in a user-friendly PyQt5 interface.
- Video Upload: Easily select video files from your computer
- Automatic Transcription: Converts speech to text using Whisper AI
- Smart Summarization: Generates concise summaries using T5 transformer
- Copy & Save: Options to copy summaries to clipboard or save to file
- Progress Tracking: Visual progress bar shows processing status
- Modern UI: Clean and intuitive interface with dark mode design
- Python 3.x: Core programming language
- PyQt5: GUI framework
- OpenAI Whisper: State-of-the-art speech recognition model
- Hugging Face Transformers: NLP models for text summarization
- MoviePy: Video processing library
- QtAwesome: Icon pack for UI elements
- Clone this repository:
git clone https://github.com/mutassimalzeem/ai-video-summarizer.git
cd ai-video-summarizer
- Install the required packages:
pip install PyQt5 qtawesome openai-whisper transformers moviepy torch
- Run the application:
python ai-video-summarizer.py
- Launch the application
- Click the "Upload Video" button to select a video file
- Wait for the AI to process the video (this may take several minutes depending on video length)
- View the generated summary in the text area
- Use the "Copy Summary" button to copy to clipboard, or "Save Summary" to save as a text file
- Python 3.8 or higher
- 8GB+ RAM recommended
- NVIDIA GPU with CUDA support (optional, for faster processing)
- 500MB+ disk space for models
ai-video-summarizer/
├── video_summarizer.py # Main application file
├── requirements.txt # Required Python packages
├── README.md # This file
└── temp/ # Temporary folder for audio processing
- Add support for multiple languages
- Implement batch processing for multiple videos
- Add option to customize summary length
- Create export options in different formats (PDF, DOCX)
- Implement more sophisticated summarization techniques
MIT License - See LICENSE file for details
- OpenAI Whisper for the speech recognition model
- Hugging Face for the T5 summarization model
- PyQt5 for the application framework
- Mutassim Al Shahriar Zeem - Initial work and maintenance
For questions, suggestions, or collaborations, please open an issue on this repository or contact mutassimalshahriar@gmail.com.
