- Generates captions for any given image and furthermore converts the captions into audio.
- Uses CNN-LSTM model to generate captions for the input images and Google Text-to-Speech API for converting text to speech.
muskaanv0/Image-Audio-captioning
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|