Skip to content

S26: Improve and Finetune the Computer Vision Model #71

@notzabir

Description

@notzabir

Description

Real-Time Computer Vision — Go Beyond Face Detection

Upgrades
1. Face Recognition (Not Just Detection)
Use:

  • ArcFace

  • FaceNet

  • DeepFace

This allows:

  • Personalized greeting

  • Multi-user support

2. Gaze Detection
Detect:

  • Is user actually looking at robot?

  • Are they distracted?

Look into:

  • MediaPipe Face Mesh

  • Gaze tracking algorithms

3. Emotion Detection
Use light models:

  • FER2013-trained CNN

  • DeepFace emotion module

Don’t overinterpret — just soft signal.

Jetson Optimization Resources
Look into:

  • TensorRT conversion

  • ONNX model conversion

  • NVIDIA DeepStream SDK

DeepStream is powerful for:

  • Real-time video pipelines

  • Hardware acceleration

More Resources

Real-Time Computer Vision (YOLO + MediaPipe + Multithreading)
YOLOv8 Real-Time Inference

MediaPipe Face Detection / Face Mesh

These directly help your:

  • Face scanning activation logic
  • Multi-user detection
  • Latency profiling

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

Status

Todo

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions