--
A robust system that leverages Azure Cognitive Services for speech recognition and MediaPipe for gesture recognition, enabling intuitive voice commands and hand gesture controls.
- 🎤 Voice Commands: Open and close applications using speech recognition.
- 🖐️ Gesture Recognition: Control the mouse cursor and perform clicks with hand gestures.
- 🖥️ User-Friendly GUI: Includes a Tkinter-based control panel to start and stop speech and gesture recognition.
Ensure the following tools and libraries are installed:
- Python 3.9 or higher
- Azure Cognitive Services Speech SDK
- MediaPipe
- OpenCV
- PyAutoGUI
- Tkinter
git clone https://github.com/jagdeeshjk/Hand_control.git
cd Hand_control # On Windows
python -m venv myenv
source myenv/Scripts/activate
# On macOS/Linux
python -m venv myenv
source myenv/bin/activate pip install -r requirements.txt If you wish to deactivate type "deactivate" (without quotes)
- Create an Azure Cognitive Services account.
- Obtain your Speech API Key and Region.
- Replace placeholders
YOUR-SPEECH-API-KEYandYOUR-LOCATIONin the following files:APP.pytest_def.py
python APP.py python App1.py python test_def.py APP.py- Main script for gesture and voice control.App1.py- GUI-based application for gesture and voice control.test_def.py- Script to test speech recognition functionality.requirements.txt- List of dependencies.
This project uses the following amazing tools and technologies: