VideoChat-r1.5: visual test-time scaling to reinforce multimodal reasoning by iterative perception

2025年10月11日·
Ziang Yan
,
Xinhao Li
,
Yinan He
,
Zhengrong Yue
,
Xiangyu Zeng
,
Yali Wang
,
Yu Qiao
Limin Wang
Limin Wang
,
Yi Wang
· 0 分钟阅读时长
类型
出版物
Proceedings of the Neural Information Processing Systems