Wan-R1: A Reasoning-via-Video Maze-Solving Model

Fine-tuned on VR-Bench to evaluate and enhance video-based reasoning ability across structured maze environments.

Project GitHub HuggingFace

πŸ“° News

  • 2025-11-20: Released 5 fine-tuned Wan-R1 models (3D, Regular, Irregular, Sokoban, Trapfield) trained on VR-Bench.
  • 2025-12: In-progress: preparing codebase for fine-tuning and evaluation release.

πŸ”§ Future Work

  • πŸ“¦ Release LoRA fine-tuning scripts based on VR-Bench.
  • πŸ“Š Open-source evaluation toolkit for reasoning via video.
  • πŸ“ Provide training logs & hyperparameters for full reproducibility.

🧠 Models

Model Download Description
Wan_R1_3d_maze_5B πŸ€— HuggingFace Fine-tuned LoRA for Maze3D tasks (easy, medium, and hard) from the base model Wan2.2-TI2V-5B.
Wan_R1_irregular_maze_5B πŸ€— HuggingFace Fine-tuned LoRA for PathFinder tasks (easy, medium, and hard) from base model Wan2.2-TI2V-5B.
Wan_R1_regular_maze_5B πŸ€— HuggingFace Fine-tuned LoRA for Maze tasks (easy, medium, and hard) from base model Wan2.2-TI2V-5B.
Wan_R1_sokoban_5B πŸ€— HuggingFace Fine-tuned LoRA for Sokoban tasks (easy, medium, and hard) from base model Wan2.2-TI2V-5B.
Wan_R1_trapfield_5B πŸ€— HuggingFace Fine-tuned LoRA for TrapField tasks (easy, medium, and hard) from base model Wan2.2-TI2V-5B.

πŸ“‘ Citation

If you use this model or the VR-Bench dataset in your work, please cite:

πŸ“„ Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks


@misc{yang2025reasoningvideoevaluationvideo,
      title={Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks}, 
      author={Cheng Yang and Haiyuan Wan and Yiran Peng and Xin Cheng and Zhaoyang Yu and Jiayi Zhang and Junchi Yu and Xinlei Yu and Xiawu Zheng and Dongzhan Zhou and Chenglin Wu},
      year={2025},
      eprint={2511.15065},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2511.15065}, 
}

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for HY-Wan/Wan-R1

Finetuned
(12)
this model