Virtual Reality-Based Industrial Machinery Training Using Agentic AI for Decision Support for Industry 4.0
VR IMTAI is a Unity-based virtual reality training system for industrial machinery operation. It combines immersive XR interaction with agentic AI, large language model reasoning, retrieval-augmented generation, speech input, voice output, safety validation, and performance evaluation.
The current training environment focuses on a bench lathe workflow, where trainees can interact with machine controls, receive adaptive guidance, and be evaluated against procedural and safety-oriented task requirements.
- Demo
- Overview
- Core Capabilities
- Technology Stack
- Installation and Setup
- Repository Structure
- Academic Context
- Credits
- License
Google Drive Demonstration Video (~8 mins)
Modern Industry 4.0 environments require operators who can work with complex machinery safely, consistently, and with strong procedural awareness. Traditional training can be expensive, hazardous, difficult to personalize, and hard to evaluate objectively.
This project explores a VR training workflow where the trainee's actions are observed in real time, interpreted against machine state and task context, and used to drive grounded guidance and assessment. The system is designed around four main responsibilities:
- Perceive trainee actions through XR interactions and machine-state tracking.
- Reason over task progress, safety constraints, and industrial knowledge.
- Guide the trainee through adaptive text, speech, and interactable highlighting.
- Evaluate performance through state-aware task completion and safety checks.
| Area | Description |
|---|---|
| Immersive machinery training | Interactive lathe controls, workpiece handling, snapping, spindle/feed behavior, and procedural task flow inside VR. |
| Agentic guidance | Context-aware responses generated from trainee questions, machine state, task progress, and retrieved machine knowledge. |
| Safety validation | Runtime checks for power, emergency stop, feed, spindle, threading, and machine-control constraints. |
| Speech interface | Whisper-based speech-to-text for trainee questions and Piper-based text-to-speech for spoken guidance. |
| Performance evaluation | Step tracking, action validation, scoring hooks, and persistent previous-score support for trainee review. |
| XR deployment | Meta Quest-oriented Unity project using XR, OpenXR, Meta XR SDK, and URP. |
| Layer | Tools |
|---|---|
| Engine | Unity 6, Universal Render Pipeline |
| XR | Meta XR SDK, Meta XR Interaction SDK, OpenXR, Unity Input System |
| AI inference | Unity Sentis |
| Retrieval and reasoning | LangChain, Kimi-K2-Instruct, FAISS |
| Speech-to-text | Whisper |
| Text-to-speech | Piper |
| Target hardware | Meta Quest 2, Quest 3, Quest 3S |
- Unity
6000.4.2f1 - Meta Quest 2, Meta Quest 3, or Meta Quest 3S
- At least 1 GB of free headset storage
- USB-C cable, preferably USB 3.0, for development builds
- SideQuest for manual APK installation
- Meta Quest mobile app with Developer Mode enabled
- Install the Meta Quest app on your phone and sign in with your Meta account.
- Pair your Quest headset in the Meta Quest app.
- Enable Developer Mode from
Menu -> Devices -> your Quest -> Developer Mode. - Reboot the headset.
- Connect the headset to your computer and allow USB debugging inside the headset.
- Install and open SideQuest on your computer.
- Confirm the SideQuest connection indicator is green and authorized.
- Download the latest APK from the Releases page.
- In SideQuest, select
Install APK from folder, choose the downloaded APK, and wait for installation to finish. - Launch the application from
Apps -> Unknown Sourcesinside the headset.
- Clone this repository.
- Open Unity Hub.
- Add the project folder:
VR Industrial Training. - Open the project with Unity
6000.4.2f1. - Switch the build target to Android for Quest deployment.
- Build and run to a connected headset when developing locally.
IMTAI-VR/
|-- VR Industrial Training/ # Unity project root
| |-- Assets/ # Scenes, scripts, models, prefabs, and runtime assets
| |-- Packages/ # Unity package manifest
| `-- ProjectSettings/ # Unity project configuration
|-- t999_user_manual.txt # Lathe reference material used by the project
|-- LICENSE
`-- README.md
Key script areas:
Assets/_Scripts/Machinery Logic- lathe simulation, state tracking, safety rules, guidance, snapping, and evaluation logic.Assets/_Scripts/STT- Whisper loading, recording, and transcription state machine.Assets/_Scripts/TTS- Piper text-to-speech integration.Assets/_Scripts/StateAsm- reusable state-machine support.
This repository supports the official graduation project submission (June 2026) for Egypt-Japan University of Science and Technology (E-JUST) and follows academic best practices for reproducibility, documentation, and extensibility.
- @37743 - Yousef Ibrahim Gomaa Mahmoud - yousef.gomaa@ejust.edu.eg
- @YahyaAzab - Yahya Refaat Mohamed Azab - yahya.azab@ejust.edu.eg
- Mazen Hassan Ahmed Hassan - mazen.hassan@ejust.edu.eg
Dr. Ahmed B. Zaky - ahmed.zaky@ejust.edu.eg
Egypt-Japan University of Science and Technology
Faculty of Computer Science and Information Technology
New Borg El-Arab City, 21934, Alexandria, Egypt
This project is licensed under the terms of the MIT License.
