Hi, thanks for the great work!
From the code, it seems that the OSWorld environment in the training loop is currently tied to Volcengine for distributed deployment. Would it be possible to support a local distributed Docker-based solution as an alternative? This would be very helpful for users who want to run the training pipeline on their own infrastructure.