[codex] implement AdaVLA-RL adaptive compute by scout-123-china · Pull Request #154 · moojink/openvla-oft

scout-123-china · 2026-05-14T06:03:01Z

Summary

Add AdaVLA-RL adaptive compute policy, learned token scoring, compute-cost accounting, and PPO helpers.
Integrate dynamic visual-token budgets, language-layer depth control, and optional fast-path inference into the OpenVLA HF model.
Add offline and LIBERO online RL training scripts plus evaluation/deployment loading paths for compute-policy checkpoints.

python -m ruff check --select F,E9,B008 prismatic/models/adaptive_compute.py prismatic/extern/hf/modeling_prismatic.py vla-scripts/finetune.py vla-scripts/train_adavla_rl.py vla-scripts/train_adavla_rl_libero.py experiments/robot/openvla_utils.py experiments/robot/robot_utils.py experiments/robot/libero/run_libero_eval.py vla-scripts/deploy.py
python -m compileall prismatic experiments vla-scripts

Direct push to moojink/openvla-oft was denied for the current account, so this PR is opened from the fork scout-123-china/openvla-oft.

sll019950225 and others added 3 commits May 14, 2026 14:00

implement AdaVLA-RL adaptive compute

872667a

fix adaptive compute DDP unused params

ee9859a

document AdaVLA-RL smoke run

08def29