VideoChat-r1.5: visual test-time scaling to reinforce multimodal reasoning by iterative perception Oct 11, 2025· Ziang Yan , Xinhao Li , Yinan He , Zhengrong Yue , Xiangyu Zeng , Yali Wang , Yu Qiao Limin Wang , Yi Wang · 0 min read Cite URL Type Conference paper Publication Proceedings of the Neural Information Processing Systems Last updated on Oct 11, 2025 Authors Limin Wang Nanjing University ← StreamForest: efficient online video understanding with persistent event memory Oct 11, 2025 Correspondence as video: test-time adaption on SAM2 for reference segmentation in the wild Aug 12, 2025 →