--- license: mit tags: - text-game - world-model - rlvr datasets: - thuml/bytesized32-world-model-cot base_model: - thuml/bytesized32-world-model-sft --- See https://github.com/thuml/RLVR-World for examples for using this model. ## Citation ``` @article{wu2025rlvr, title={RLVR-World: Training World Models with Reinforcement Learning}, author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, journal={arXiv preprint arXiv:2505.13934}, year={2025}, }