Google AI MCP

FastMCP 3.2 unified gateway for Google AI cloud services + local LeWorldModel (LeWM) bridge. Chat, images, video, music, speech, embeddings — plus world model planning via local lewm-mcp proxy.

Quick Start

# Install deps
uv sync --extra test --extra dev
npm --prefix webapp ci

# Start backend + frontend
.\start.ps1

# E2E audit (headless, zero clicks)
just e2e

Frontend: http://127.0.0.1:11015
Backend API: http://127.0.0.1:11014
API Docs: http://127.0.0.1:11014/api/docs
MCP HTTP: http://127.0.0.1:11014/mcp

Services

Service	Model	Requires
Chat	Gemini 3.1 Pro, Gemma 4	GOOGLE_API_KEY
Image	Nano Banana 2 / Pro	GOOGLE_API_KEY
Video	Veo 3.1 Preview	GOOGLE_CLOUD_PROJECT
Music	Lyria 3 Pro / Clip	GOOGLE_CLOUD_PROJECT
Speech	Gemini TTS (one-shot)	GOOGLE_API_KEY
Embeddings	gemini-embedding-001	GOOGLE_API_KEY
World Model (LeWM bridge)	LeWorldModel JEPA	lewm-mcp on port 10927

All Google services fall back to mock mode when credentials are missing. LeWM proxied via httpx to lewm-mcp.

Speech vs speech-mcp

Capability	google-ai-mcp	speech-mcp
One-shot Gemini TTS	Yes (`google_ai_speech`, `/speech` UI)	Yes
Gemini Live (WebSocket)	No — link only	Yes
Streaming STT / Chirp / FunASR	No	Yes
ElevenLabs / fleet voice tools	No	Yes

Use google-ai-mcp for multimodal Google AI (chat, image, video, music, Omni, embeddings). Run speech-mcp when you need real-time voice agents — do not duplicate that stack here.

MCP Tools (11)

Tool	Operations
`google_ai_chat`	chat, list_models
`google_ai_image`	generate, list_models
`google_ai_video`	generate, list_models
`google_ai_omni`	generate, list_models
`google_ai_music`	generate, list_models
`google_ai_speech`	tts, list_voices, list_models
`google_ai_embeddings`	embed, list_models
`google_ai_world`	health, train_prepare, infer_prepare, surprise_stub
`google_ai_status`	Health check all services
`show_google_ai_status_card`	Prefab UI card

Configuration

$env:GOOGLE_API_KEY = "your-api-key"
$env:GOOGLE_CLOUD_PROJECT = "your-project-id"
$env:LEWM_API_URL = "http://127.0.0.1:10927"  # defaults to this

E2E Audit

just e2e
# Runs: clear zombies → start servers → 9 E2E tests → console probe → API health
# Fleet auditor at mcp-central-docs/scripts/playwright-audit.ps1

Docker

docker compose up -d

Tests

uv run pytest tests/ -q       # 17 backend tests
npx playwright test           # 9 E2E tests (via just e2e)

Claude Desktop (MCPB)

just mcpb-pack
# Output: dist/google-ai-mcp.mcpb

Drag dist/google-ai-mcp.mcpb onto Claude Desktop, or:

npx @anthropic-ai/mcpb install https://github.com/sandraschi/google-ai-mcp

Requires Python 3.12+ with dependencies (or uv sync in the repo). Set GOOGLE_API_KEY when prompted.

Native Desktop (Tauri)

just build-native
# Installer: native/target/release/bundle/nsis/Google AI MCP_0.1.0_x64-setup.exe

Bundles React dist/ + PyInstaller backend sidecar. First run generates Tauri icons from assets/icon.png if needed.

Repository

https://github.com/sandraschi/google-ai-mcp

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
assets		assets
docker		docker
native		native
scripts		scripts
src/google_ai_mcp		src/google_ai_mcp
tests		tests
webapp		webapp
.gitignore		.gitignore
.mcpbignore		.mcpbignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Dockerfile.backend		Dockerfile.backend
Dockerfile.frontend		Dockerfile.frontend
README.md		README.md
build_mcpb.py		build_mcpb.py
docker-compose.yml		docker-compose.yml
e2e.ps1		e2e.ps1
google-ai-mcp-backend.spec		google-ai-mcp-backend.spec
justfile		justfile
llms-full.txt		llms-full.txt
llms.txt		llms.txt
manifest.json		manifest.json
pyproject.toml		pyproject.toml
run_e2e.ps1		run_e2e.ps1
run_e2e_full.bat		run_e2e_full.bat
run_e2e_headless.bat		run_e2e_headless.bat
run_server.py		run_server.py
start.bat		start.bat
start.ps1		start.ps1
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google AI MCP

Quick Start

Services

Speech vs speech-mcp

MCP Tools (11)

Configuration

E2E Audit

Docker

Tests

Claude Desktop (MCPB)

Native Desktop (Tauri)

Repository

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Google AI MCP

Quick Start

Services

Speech vs speech-mcp

MCP Tools (11)

Configuration

E2E Audit

Docker

Tests

Claude Desktop (MCPB)

Native Desktop (Tauri)

Repository

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages