Sketch2App

Sketch a UI. A Gemini agent analyzes, plans, builds, and chats with you to refine it — live.

Built for the Google I/O Hackathon 2026.

What it does

A three-pane studio with a Gemini-powered agent in the middle:

Sketch — draw a rough wireframe on a canvas (left pane).
Agent — hit Run agent. A multi-step Gemini agent:
- ① Analyze — vision model reads the sketch and produces a structured plan (JSON-schema enforced): title, summary, components, theme, interactions.
- ② Build — composes the plan + sketch into a complete, self-contained HTML page.
- ③ Refine — chat box at the bottom of the agent pane. Type "make it dark mode", "add a sign-up link", "use a teal palette" — Gemini patches the live app. You can even draw on the sketch to point at things or add new elements while you chat.
- ④ Voice — hit the microphone icon to dictate your refinements hands-free.
Live output — the generated app runs in a sandboxed iframe on the right. Toggle to Code view, copy, or download.

Built with Tailwind (shadcn-style aesthetic), vanilla JS, one HTML file, no backend.

Why it matters

The fastest path from "idea in your head" to "working product on screen" is currently typing a 200-word prompt. That's a writing skill, not a design skill. Sketching is universal — kids, PMs, designers, founders all sketch. We let the sketch be the prompt.

This is multimodal reasoning as a creative tool: vision + voice in, working code out, no intermediate text representation.

Demo (60 seconds)

Open index.html in a browser (or serve with python3 -m http.server).
Paste a Gemini API key — get one free at aistudio.google.com/apikey.
Sketch on the left canvas. Try:
- A header bar with a logo box
- A grid of product cards
- A login form
Hit Generate ✨ (or Cmd/Ctrl+Enter).
Watch a working app appear on the right. Toggle to Code view, Copy, or Download the HTML.

How it works — the agent loop

   sketch (canvas PNG, base64)
            │
            ▼
   ┌──────────────────────────────────────┐
   │ ① ANALYZE                            │
   │  Gemini 2.5 with responseSchema      │  →  structured Plan JSON
   │  (vision + JSON-mode)                │      { title, components[], theme,
   └──────────────────────────────────────┘        interactions[], reasoning }
            │
            ▼
   ┌──────────────────────────────────────┐
   │ ② BUILD                              │
   │  Gemini 2.5 (plan + sketch + system) │  →  full self-contained HTML
   └──────────────────────────────────────┘
            │            ▲
            ▼            │ user message
   ┌──────────────────────────────────────┐
   │ ③ REFINE  (chat loop)                │
   │  Gemini 2.5 (current HTML + msg)     │  →  updated HTML, swap into <iframe>
   └──────────────────────────────────────┘

Structured output — Phase 1 uses Gemini's responseMimeType: application/json with a responseSchema, so the plan is type-safe.
Frontend only. fetch straight to generativelanguage.googleapis.com — no backend, no proxy.
Key stays local. Stored in localStorage, never leaves your browser.
Sandboxed preview. Generated HTML runs in <iframe sandbox="allow-scripts allow-forms">.

Tech

Tailwind (CDN) for the shadcn-style aesthetic — dark zinc theme, violet/cyan accents
HTML5 Canvas + Pointer Events for drawing
Gemini API (gemini-2.5-flash, gemini-2.5-pro, gemini-2.0-flash selectable)
Vanilla JS, no frameworks, no bundler, no dependencies

Files

File	Purpose
`index.html`	The entire app — UI, canvas logic, Gemini call, preview iframe
`README.md`	This file

Roadmap

Refine loop: send the current generated HTML + new sketch back to Gemini for iteration
Voice prompts: dictate refinements using the microphone
Multi-page: generate a small sitemap from a sketched flow
Component library mode: paste your design tokens, get on-brand output

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sketch2App

What it does

Why it matters

Demo (60 seconds)

How it works — the agent loop

Tech

Files

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sketch2App

What it does

Why it matters

Demo (60 seconds)

How it works — the agent loop

Tech

Files

Roadmap

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages