Skip to content

37743/IMTAI-VR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IMTAI VR

VR IMTAI training environment

Virtual Reality-Based Industrial Machinery Training Using Agentic AI for Decision Support for Industry 4.0

Unity Latest Release Platform License

VR IMTAI is a Unity-based virtual reality training system for industrial machinery operation. It combines immersive XR interaction with agentic AI, large language model reasoning, retrieval-augmented generation, speech input, voice output, safety validation, and performance evaluation.

The current training environment focuses on a bench lathe workflow, where trainees can interact with machine controls, receive adaptive guidance, and be evaluated against procedural and safety-oriented task requirements.

Table of Contents

Demo

Google Drive Demonstration Video (~8 mins)

Watch the demo

Overview

Modern Industry 4.0 environments require operators who can work with complex machinery safely, consistently, and with strong procedural awareness. Traditional training can be expensive, hazardous, difficult to personalize, and hard to evaluate objectively.

This project explores a VR training workflow where the trainee's actions are observed in real time, interpreted against machine state and task context, and used to drive grounded guidance and assessment. The system is designed around four main responsibilities:

  • Perceive trainee actions through XR interactions and machine-state tracking.
  • Reason over task progress, safety constraints, and industrial knowledge.
  • Guide the trainee through adaptive text, speech, and interactable highlighting.
  • Evaluate performance through state-aware task completion and safety checks.

Core Capabilities

Area Description
Immersive machinery training Interactive lathe controls, workpiece handling, snapping, spindle/feed behavior, and procedural task flow inside VR.
Agentic guidance Context-aware responses generated from trainee questions, machine state, task progress, and retrieved machine knowledge.
Safety validation Runtime checks for power, emergency stop, feed, spindle, threading, and machine-control constraints.
Speech interface Whisper-based speech-to-text for trainee questions and Piper-based text-to-speech for spoken guidance.
Performance evaluation Step tracking, action validation, scoring hooks, and persistent previous-score support for trainee review.
XR deployment Meta Quest-oriented Unity project using XR, OpenXR, Meta XR SDK, and URP.

Technology Stack

Layer Tools
Engine Unity 6, Universal Render Pipeline
XR Meta XR SDK, Meta XR Interaction SDK, OpenXR, Unity Input System
AI inference Unity Sentis
Retrieval and reasoning LangChain, Kimi-K2-Instruct, FAISS
Speech-to-text Whisper
Text-to-speech Piper
Target hardware Meta Quest 2, Quest 3, Quest 3S

Installation and Setup

Requirements

  • Unity 6000.4.2f1
  • Meta Quest 2, Meta Quest 3, or Meta Quest 3S
  • At least 1 GB of free headset storage
  • USB-C cable, preferably USB 3.0, for development builds
  • SideQuest for manual APK installation
  • Meta Quest mobile app with Developer Mode enabled

Install from APK

  1. Install the Meta Quest app on your phone and sign in with your Meta account.
  2. Pair your Quest headset in the Meta Quest app.
  3. Enable Developer Mode from Menu -> Devices -> your Quest -> Developer Mode.
  4. Reboot the headset.
  5. Connect the headset to your computer and allow USB debugging inside the headset.
  6. Install and open SideQuest on your computer.
  7. Confirm the SideQuest connection indicator is green and authorized.
  8. Download the latest APK from the Releases page.
  9. In SideQuest, select Install APK from folder, choose the downloaded APK, and wait for installation to finish.
  10. Launch the application from Apps -> Unknown Sources inside the headset.

Open the Unity Project

  1. Clone this repository.
  2. Open Unity Hub.
  3. Add the project folder: VR Industrial Training.
  4. Open the project with Unity 6000.4.2f1.
  5. Switch the build target to Android for Quest deployment.
  6. Build and run to a connected headset when developing locally.

Repository Structure

IMTAI-VR/
|-- VR Industrial Training/        # Unity project root
|   |-- Assets/                    # Scenes, scripts, models, prefabs, and runtime assets
|   |-- Packages/                  # Unity package manifest
|   `-- ProjectSettings/           # Unity project configuration
|-- t999_user_manual.txt           # Lathe reference material used by the project
|-- LICENSE
`-- README.md

Key script areas:

  • Assets/_Scripts/Machinery Logic - lathe simulation, state tracking, safety rules, guidance, snapping, and evaluation logic.
  • Assets/_Scripts/STT - Whisper loading, recording, and transcription state machine.
  • Assets/_Scripts/TTS - Piper text-to-speech integration.
  • Assets/_Scripts/StateAsm - reusable state-machine support.

Academic Context

This repository supports the official graduation project submission (June 2026) for Egypt-Japan University of Science and Technology (E-JUST) and follows academic best practices for reproducibility, documentation, and extensibility.

Credits

Team Members and Contributions

Supervisor

Dr. Ahmed B. Zaky - ahmed.zaky@ejust.edu.eg

Egypt-Japan University of Science and Technology
Faculty of Computer Science and Information Technology
New Borg El-Arab City, 21934, Alexandria, Egypt

License

This project is licensed under the terms of the MIT License.

About

IMTAI-VR is a Unity-based VR training system integrating agentic AI decision loops with LLM reasoning and Retrieval-Augmented Generation (RAG). It performs real-time perception of user actions, adaptive guidance, safety validation, and performance evaluation via modular agents, vector search, STT/TTS, and XR interaction pipelines.

Topics

Resources

License

Stars

Watchers

Forks

Contributors