RHOS studies Embodied AI, Physical Reasoning, and Human Activity Understanding. We are building a knowledge and reasoning-driven system that enables intelligent agents and robots to perceive human activities, reason about human behavior logics, learn skills from human activities, and interact with the environment.
Our homepage: https://mvig-rhos.com
An unofficial PyTorch/GPU implementation of D4RT for 4D reconstruction and tracking, with WorldTrack evaluation, visualization tools, and released Hugging Face checkpoints.
- π» GitHub: Lijiaxin0111/Open-d4rt
- πΏ RHOS Branch: OpenD4RT
- π€ Hugging Face: OpenD4RT Checkpoints
- π Paper: arXiv:2512.00960
- π Website: Open4DHOI
- π» GitHub: wenboran2002/open4dhoi_code
- π€ Dataset: Open4DHOI
- π¦ Demo Data: Google Drive
- πΏ RHOS Branch: open4dhoi_code
- π Paper: arXiv:2511.17898
- π Website: L1Flow Project Page
- π» GitHub: THyanNK/L1Flow
- π¦ Dataset: Diffusion Policy Training Data
- π€ Checkpoints: L1Flow Results
- πΏ RHOS Branch: L1Flow
- π Paper: arXiv:2511.15407
- π» GitHub: RHOS/IPR-1 branch
RoboHiMan: A Hierarchical Evaluation Paradigm for Compositional Generalization in Long-Horizon Manipulation
- π Paper: arXiv:2510.13149
- π Website: RoboHiMan
- π» GitHub: chenyt31/RoboHiMan
- πΏ RHOS Branch: RoboHiMan
- π Paper: arXiv:2509.14688
- π Website: exUMI Project Page
- π» GitHub: silicx/exUMI
- πΏ RHOS Branch: exUMI
- π Paper: arXiv:2505.01396
- π Website: SIME Project Page
- π» GitHub: EricJin2002/SIME
- π¦ Dataset: robomimic Benchmark Instructions
- πΏ RHOS Branch: SIME
- π Paper: arXiv:2503.15898
- π Website: Open3DHOI Project Page
- π» GitHub: wenboran2002/open-3dhoi
- π€ Dataset: Open3DHOI
- πΏ RHOS Branch: open-3dhoi
- π Website: ImDy Project Page
- π» GitHub: Foruck/ImDy
- π€ Dataset: ImDy
- π¦ Checkpoints: Google Drive
- πΏ RHOS Branch: ImDy
- π» GitHub: Foruck/HDyS
- πΏ RHOS Branch: HDyS
- π Website: ViFailback
- π» GitHub: x1nyuzhou/ViFailback
- πΏ RHOS Branch: ViFailback
- π Website: Human-Robot Joint Learning
- π» GitHub: RHOS/HAJL branch
- π Paper: arXiv:2410.01417
- π Website: LLM Inception
- π» GitHub: lihongcs/LLM_Inception
- π¦ Data: Google Drive
- π€ Demo: Hugging Face Space
- πΏ RHOS Branch: LLM_Inception
Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models
- π Paper: arXiv:2412.04939
- π» GitHub: RHOS/Verb-Mirage branch
Fine-grained spatio-temporal activity understanding based on AVA videos, as part of the HAKE project.
- π Paper: HAKE-GIO
- π Website: HAKE
- π» GitHub: DirtyHarryLYL/HAKE-AVA
- π€ Dataset: HAKE-GIO
- π¦ Dataset Guide: HAKE-AVA DATASET.md
- πΏ RHOS Branch: HAKE-AVA