🛸

SkyAugment

Photorealistic drone footage variants from simulation data

Powered by Seedance 2.0 + Z.AI

Problem • Demo • How It Works • Quickstart • Use Cases • Roadmap

🔥 The Problem

Drone perception teams are blocked on training data diversity. Real flight hours cost $200–$2,000/hr, and rare conditions — dust storms, dense fog, wildfire smoke — are expensive or dangerous to capture.

ArduPilot SITL + Gazebo gives free synthetic trajectories, but the renders look like a video game. Models trained on them don't generalize to real-world sensor characteristics.

🎬 Demo

Input — ArduPilot/Gazebo sim frame

Generated variant — "wildfire smoke layer at dusk over forest"

https://drive.google.com/file/d/1Vsh4-VZeXwW4y8L5ZLh8a_gp2pWYhfgN/view?usp=sharing

Aerial drone footage, wildfire smoke layer at dusk over forest. Slow gentle drone orbit at constant altitude, smooth circular pan, camera always locked toward the scene center. Real-time speed only — strictly NO timelapse, NO hyperlapse, NO fast motion. ONE single uninterrupted continuous shot.


💡 Lighting	Harsh backlight from low sun angle, 18% particulate haze, warm orange tones
🌪️ Weather	Dust storm with 40 mph sustained winds, horizontal visibility 150 m
🏜️ Terrain	Arid desert highway, sand dunes flanking both sides, rocky outcrops
🕕 Time of day	Golden hour, ~18:00 local, sun at 10° elevation
🌫️ Atmospheric effects	PM10 dust veil at 0–300 m AGL, airborne sand particles
📷 Camera artifacts	Lens flare from sun angle, slight overexposure on horizon

⚙️ How It Works

Gazebo sim frame  ──►  Z.AI Scenario Reasoner  ──►  Seedance 2.0 Reference-to-Video  ──►  Labeled footage

┌──────────────────┐     ┌─────────────────────────┐     ┌──────────────────────────┐
│  Gazebo / SITL   │     │   Z.AI Scenario Agent    │     │   Seedance 2.0           │
│  sim frame (PNG) │────►│   Short prompt → JSON    │────►│   Reference-to-Video     │
│                  │     │   lighting, weather,     │     │   image + structured     │
│  ArduPilot SITL  │     │   terrain, atmo effects, │     │   prompt → 5s 720p clip  │
│  ground truth    │     │   camera artifacts       │     │                          │
└──────────────────┘     └─────────────────────────┘     └──────────────────────────┘
                                                                      │
                                                          ┌───────────▼──────────────┐
                                                          │   Labeled variant set    │
                                                          │   cached locally,        │
                                                          │   served instantly       │
                                                          └──────────────────────────┘

The Z.AI Scenario Reasoner is the agentic layer — it takes a vague edge-case description and expands it into a precise, structured environmental spec that Seedance can condition on. The agent reasons over lighting physics, atmospheric optics, and sensor characteristics to produce diversity-maximizing variants.

┌─────────────────────────────────────┐
│  Real flight data       $500 / hr   │
│  SkyAugment              $0.50 / clip│
│  Time to first clip       ~60 sec   │
│  Labeled metadata           ✓ auto  │
└─────────────────────────────────────┘

🚀 Quickstart

git clone <repo>
cd betaFundHackathon
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
cp .env.example .env   # add your API keys
uvicorn backend.main:app --port 8000
# open http://localhost:8000

Environment variables

SEEDANCE_API_KEYS=key1,key2,key3   # BytePlus Ark — round-robined
ZAI_API_KEY=your_zai_key           # Z.AI (Anthropic-compatible endpoint)
USE_CACHE=true                     # serve pre-generated results instantly

🛠️ Tech Stack

Layer	Tech
🎥 Video generation	Seedance 2.0 (`dreamina-seedance-2-0-fast-260128`) via BytePlus Ark
🧠 Scenario reasoning	Z.AI GLM-4 via Anthropic-compatible API
⚡ Backend	FastAPI + async httpx polling
🖥️ Frontend	Vanilla HTML/CSS/JS — no build step
💾 Cache	JSON + local MP4 with ffmpeg faststart

🎯 Use Cases

🚁 Drone perception training data — generate rare atmospheric conditions at scale
🚗 Autonomous vehicle sensor simulation — photorealistic camera artifacts for model robustness
🛡️ Defense/ISR — edge-case augmentation for target detection under degraded visibility
🔧 ArduPilot community — plug-and-play with existing SITL workflows

Named buyers: Anduril · Shield AI · Skydio · Zipline · Percepto · DoD test ranges

🗺️ Roadmap

Timeline	Milestone
Day 1–2	Cold-email 10 drone perception leads
Day 3–4	Open-source ArduPilot integration plugin → r/ArduPilot, ArduPilot Discord
Day 5	Apply to Beta University Cohort 11
Day 6–7	Publish AerialEdgeCase-100 dataset on HuggingFace
Month 1	First design partner ($5K/mo, 1000 clips/mo)
Q1	Integrate with Mirage cyber-range as perception-data module

👤 Built By

Pranav Bhusari — Security + ML Engineer

MS Purdue CERIAS · Ex-LLNL / Peraton / Alif

Direct experience with synthetic data pipelines, simulation environments, and the defense/dual-use buyer.

🔗 linkedin.com/in/pranav-bhusari

Submitted to Beta Super Hackathon · Track 4: Physical AI + Simulation · Submission code: butterbase0502

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
backend		backend
demo_assets		demo_assets
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
highwayflyover.png		highwayflyover.png
requirements.txt		requirements.txt
spec.md		spec.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛸

SkyAugment

🔥 The Problem

🎬 Demo

Input — ArduPilot/Gazebo sim frame

Generated variant — "wildfire smoke layer at dusk over forest"

⚙️ How It Works

🚀 Quickstart

Environment variables

🛠️ Tech Stack

🎯 Use Cases

🗺️ Roadmap

👤 Built By

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🛸

SkyAugment

🔥 The Problem

🎬 Demo

Input — ArduPilot/Gazebo sim frame

Generated variant — "wildfire smoke layer at dusk over forest"

⚙️ How It Works

🚀 Quickstart

Environment variables

🛠️ Tech Stack

🎯 Use Cases

🗺️ Roadmap

👤 Built By

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages