We are continuously expanding the World Model method library in our framework.
The representative methods currently being integrated are listed below. We warmly welcome more developers to submit PRs and help us improve the ecosystem together!
| Method | Paper | Code Repository | Main Direction / Task |
|---|---|---|---|
| Infinite-World | paper | code | Long Navigation Video Generation |
| MatrixGame-3 | paper | code | Long Navigation Video Generation |
| Solaris | paper | code | Navigation Video Generation (multiplayer Minecraft world modeling) |
| Cambrian-S | paper | code | Spatial Reasoning |
| Ctrl-World | paper | code | VLA / Robot Manipulation (controllable generative world model) |
| FantasyWorld | paper | code | 3D Scene Generation (unified video and 3D prediction) |
| MemFlow | paper | code | Long video generation |
If you have methods youβd like to integrate (e.g., new world models, evaluation benchmarks, or inference pipelines), feel free to open an Issue or submit a PR directly.
Status Legend: β Completed; π§ In Progress; π Ongoing; π Not Started.
- β Integrate multiple world model tasks into a single unified framework.
- π Continuously add cutting-edge and valuable world model-related works to OpenWorldLib.
- π§ Support more works related to long-video generation.
- π§ Support more works related to autonomous driving.
- π§ Optimize the contribution workflow to make it easier for other contributors to submit PRs, and continuously refactor the OpenWorldLib codebase.
- π Provide comprehensive documentation and detailed explanations for the interactions within each pipeline.
- π Support inference data parallelism across all pipelines.
- π Streamline the codebase and simplify the outputs of the pipelines.
- The current repository may still contain some redundant code, which we will further streamline;
- The interactions between different modules are not yet seamless enough. We will work towards enabling more flexible and natural integration between components;
- The performance of some existing methods still requires further improvement, specifically:
- The rendering effectiveness of vggt should improve;
-
Continuous Benchmark Enhancement
We will further improve the current benchmark suite by expanding task coverage, refining evaluation dimensions, and strengthening data quality control. We also plan to increase dataset scale to improve robustness and discriminative power of evaluation results. -
Optimization for Multi-Stream Generation
Since interaction signals are typically variable-length inputs, the current framework is not yet optimal for high-concurrency generation. We will improve support for multi-stream generation through better scheduling, batch alignment, and throughput optimization. -
Training Framework Development
OpenWorldLib primarily aims to provide a standardized and extensible framework for world models and their inference pipelines, with a strong emphasis on comprehensiveness. In the next phase, we will introduce world model training frameworks built on several mainstream foundation models. -
Datasets and Tooling Release
Along with the training framework, we will release corresponding datasets, preprocessing pipelines, and starter examples to help users with reproduction, fine-tuning, and further development.