🩺 AI Doctor 2.0 — Multimodal Health Assistant

AI Doctor 2.0 is an experimental AI-powered health assistant capable of listening, seeing, and responding intelligently through a unified multimodal interface. It integrates speech, image, and text modalities using advanced AI models and a user-friendly Gradio interface.

🚀 Features

🎙️ Voice Input — Real-time speech-to-text using OpenAI Whisper
🖼️ Image Input — Vision context processed using LLaMA Instruct (Base64 encoded)
🧠 AI Response — Powered by Groq LLM for ultra-fast and accurate reasoning
🔊 Text-to-Speech Output — Converts AI responses to voice using gTTS
💻 Web UI — Seamless interaction via Gradio

🧩 Tech Stack

Component	Technology
Speech-to-Text	OpenAI Whisper
Image Processing	LLaMA Instruct
LLM Inference	Groq LLM
Text-to-Speech	gTTS
Web UI	Gradio
Backend	Python

⚙️ Setup Instructions

1. Clone the Repository

git clone https://github.com/Coder-010506/ai-doctor-2.0.git
cd ai-doctor-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
download.jpeg		download.jpeg
final.mp3		final.mp3
gradio_app.py		gradio_app.py
information.py		information.py
user.py		user.py
voice_of_the_doctor.py		voice_of_the_doctor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🩺 AI Doctor 2.0 — Multimodal Health Assistant

🚀 Features

🧩 Tech Stack

⚙️ Setup Instructions

1. Clone the Repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🩺 AI Doctor 2.0 — Multimodal Health Assistant

🚀 Features

🧩 Tech Stack

⚙️ Setup Instructions

1. Clone the Repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages