Sign Language Detection using LSTM and Mediapipe

This repository contains code for a Sign Lnaguage Recognition system that uses the Mediapipe library for pose estimation and hand tracking, as well as LSTM (Long Short-Term Memory) for sequence classification. The system is designed to recognize and classify hand gestures from webcam video input.

1. Business Understanding

The Gesture Recognition system is developed to detect and classify hand gestures in real time. It can be used in various applications, such as sign language recognition, human-computer interaction, or gesture-based control systems. By understanding and recognizing different hand gestures, the system can enable more natural and intuitive interactions with computers and devices.

2. Results

The system achieves gesture recognition in real-time from webcam input. It can detect and classify hand gestures with a high degree of accuracy, providing the recognized gestures as output. The recognized gestures are displayed on the screen in real-time, allowing users to interact with the system by performing different hand gestures.

3. Technologies used.

a. Programing Languages

Python

b. Libraries

OpenCV: For capturing webcam feed and image processing.
Matplotlib: For visualizing image data.
NumPy: For array manipulation and data handling.
Mediapipe: For pose estimation and hand tracking.
TensorFlow: For creating and training the LSTM model.

4. Approach

The project follows these main steps:

Data Collection: The system captures video frames from the webcam and uses the Mediapipe library to detect the pose and landmarks of the face, hands, and body.
Landmark Extraction: The extracted landmarks are processed to extract keypoints for the face, pose, and both hands. The keypoints are then concatenated to form a feature vector.
Data Preprocessing: The collected data is split into sequences, each containing a fixed number of frames (sequence_length) for each gesture (action).
Model Training: An LSTM model is created using TensorFlow. The LSTM layers are designed to learn the temporal patterns in the input sequences and classify the gestures accordingly. The model is trained using the training data.
Gesture Recognition: During real-time gesture recognition, the system continuously captures frames, extracts keypoints, and forms sequences of keypoints. These sequences are fed into the trained LSTM model to predict the recognized gestures.
Real-Time Visualization: The recognized gestures are displayed on the screen in real-time. If a gesture is held for a certain threshold of confidence, it is considered a recognized gesture and displayed as part of a sentence. The last five recognized gestures are displayed as a sentence.

The system can be extended to recognize more gestures by adding new actions to the "actions" array in the code.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Logs/train		Logs/train
README.md		README.md
sign_language_detection.h5		sign_language_detection.h5
sign_language_detection.ipynb		sign_language_detection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sign Language Detection using LSTM and Mediapipe

1. Business Understanding

2. Results

3. Technologies used.

a. Programing Languages

b. Libraries

4. Approach

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sign Language Detection using LSTM and Mediapipe

1. Business Understanding

2. Results

3. Technologies used.

a. Programing Languages

b. Libraries

4. Approach

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages