DEMO: https://framer.com/projects/Zero-Distance--nV9PyBnMbb4wt0S3KW7i-uLQRz?node=augiA20Il
We are building a gesture-based navigation system that allows users to control Google Slides using hand gestures detected via a webcam.
- Use a webcam to stream live video.
Input: video stream
Output: string [gesture class]
- Continuously capture video frames from the webcam.
- Analyze each frame to classify gestures such as:
- Peace (✌️) → QUIT (also sends Esc if Slides Mode is ON)
- Thumbs-Up (👍) → NEXT slide (Right Arrow)
- Thumbs-Down (👎) → PREVIOUS slide (Left Arrow)
- Rock (🤘) → SCROLL UP (Arrow Up)
- Shaka (🤙) → SCROLL DOWN (Arrow Down)
- OK sign (👌) → START SLIDESHOW (Cmd/Ctrl+Enter)
- Open palm (✋) → STOP SLIDESHOW (Esc)
Input: string [gesture class]
Output: keyboard command
- Map each recognized gesture to a corresponding keyboard command.
"Fist"→ (Neutral or pause state)"3 fingers together, horizontal (up or down)":- If pointing up → simulate holding the Up Arrow key.
- If pointing down → simulate holding the Down Arrow key.
- The simulated key press should be held for 5 seconds.
Input: keyboard command
Output: action in the presentation
- Google Slides (or similar platforms) respond to arrow key input:
- Simulated key presses trigger slide navigation.
Create virtual environment:
python -m venv .venvEntering virtual environment:
source .venv/bin/activate