Gemini Live API via Phone call (with Twilio)

This project demonstrates a real-time voice conversation using Twilio (over phone) and Google's Gemini Multimodal Live API (via Vertex AI).

Architecture

Twilio: Handles the phone call and streams audio to this server via WebSocket.
FastAPI Server: Receives audio from Twilio, transcodes it, and sends it to Gemini.
Gemini (Vertex AI): Processes the audio and returns a generated audio response.
FastAPI Server: Receives audio from Gemini, transcodes it back, and sends it to Twilio.

Prerequisites

Python 3.10+
A Google Cloud Project with Vertex AI API and Cloud Run API enabled.
Google Cloud CLI installed and authenticated (gcloud auth login).
A Twilio Account and a purchased phone number.

Setup & Deployment

Install Dependencies (Local Development):
```
pip install -r requirements.txt
```

Google Cloud Authentication:

gcloud auth login
gcloud config set project YOUR_PROJECT_ID

Deploy to Cloud Run:

We will deploy the container directly to Cloud Run. This handles the SSL and public URL for us.
```
./deploy.sh
```
- If prompted to enable APIs (Cloud Build, Cloud Run), say yes.
- Once finished, it will output a Service URL (e.g., https://call-me-live-api-xyz.a.run.app).

Twilio Configuration

Go to the Twilio Console.
Navigate to Voice > TwiML > TwiML Apps.
Create a new TwiML App (or update an existing one).
Set the Voice Request URL to your Cloud Run URL with the /incoming-call path:

https://YOUR-CLOUD-RUN-URL.a.run.app/incoming-call
Configure your Twilio Phone Number to use this TwiML App.

Usage

Call your Twilio phone number.
Speak to Gemini!

Technical Details

Hosting: Google Cloud Run (Serverless Container).
Audio Format: Twilio uses G.711 mulaw at 8000Hz. Gemini uses PCM (Linear 16-bit) at 24000Hz.
Transcoding: The audioop library is used to convert between these formats in real-time.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
deploy.sh		deploy.sh
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gemini Live API via Phone call (with Twilio)

Architecture

Prerequisites

Setup & Deployment

Twilio Configuration

Usage

Technical Details

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gemini Live API via Phone call (with Twilio)

Architecture

Prerequisites

Setup & Deployment

Twilio Configuration

Usage

Technical Details

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages