| title | AI Audio-to-Video Converter |
|---|---|
| emoji | 🎥 |
| colorFrom | blue |
| colorTo | green |
| sdk | docker |
| app_port | 7860 |
| app_file | app.py |
Convert audio to video with synchronized images using Hugging Face models.
- Transcription:
openai/whisper-large-v3 - Image Acquisition: Stable Diffusion or Google scraping with caching
- Video Assembly: MoviePy with FFmpeg
- Enhancements: Robust error handling, image caching, temporary file cleanup
- Upload audio (max 50 min).
- Select video format (9:16, 16:9, or 1:1).
- Choose image method (Scraper or Generator).
- Generate and download the video.
- Pinned
huggingface_hub==0.20.3fordiffuserscompatibility. - Updated
gr.Audiofor Gradio 4.19.2 compatibility. - Added logging and cleanup for improved debugging and resource management.