出版物

Ziang Yan, Xinhao Li, Yinan He, Zhengrong Yue, Xiangyu Zeng, Yali Wang, Yu Qiao, Limin Wang, Yi Wang (2025). VideoChat-r1.5: visual test-time scaling to reinforce multimodal reasoning by iterative perception. Proceedings of the Neural Information Processing Systems.

引用 URL

Xiangyu Zeng, Kefan Qiu, Qingyu Zhang, Xinhao Li, Jing Wang, Jiaxin Li, Ziang Yan, Kun Tian, Meng Tian, Xinhai Zhao, Others (2025). StreamForest: efficient online video understanding with persistent event memory. Proceedings of the Neural Information Processing Systems.

引用 URL

Chenhui Zhu, Yilu Wu, Shuai Wang, Gangshan Wu, Limin Wang (2025). MotionRAG: motion retrieval-augmented image-to-video generation. Proceedings of the Neural Information Processing Systems.

引用 URL

Yuchen Zhang, Hanyue Du, Chun Cao, Jingwei Xu (2025). Loquetier: a virtualized multi-LoRA framework for unified LLM fine-tuning and serving. Proceedings of the Neural Information Processing Systems.

引用 URL

Zhenpeng Huang, Jiaqi Li, Zihan Jia, Xinhao Li, Desen Meng, Lingxue Song, Xi Chen, Liang Li, Limin Wang (2025). LongVPO: from anchored cues to self-reasoning for long-form video preference optimization. Proceedings of the Neural Information Processing Systems.

引用 URL

Yan-Shuo Liang, Wu-Jun Li (2025). Gated integration of low-rank adaptation for continual learning of language models. Proceedings of the Neural Information Processing Systems.

引用 URL

Yuping He, Yifei Huang, Guo Chen, Baoqi Pei, Jilan Xu, Tong Lu, Jiangmiao Pang (2025). EgoExoBench: a benchmark for first-and third-person view video understanding in MLLMs. Proceedings of the Neural Information Processing Systems.

引用 URL

Guo Chen, Zhiqi Li, Shihao Wang, Jindong Jiang, Yicheng Liu, Lidong Lu, De-an Huang, Wonmin Byeon, Matthieu Le, Tuomas Rintamaki, Others (2025). Eagle 2.5: boosting long-context post-training for frontier vision-language models. Proceedings of the Neural Information Processing Systems.

引用 URL

Namkyeong Lee, Yunhak Oh, Heewoong Noh, Gyoung S Na, Minkai Xu, Hanchen Wang, Tianfan Fu, Chanyoung Park (2025). 3D interaction geometric pre-training for molecular relational learning. Proceedings of the Neural Information Processing Systems.

引用 URL

Jiashuo Yu, Yue Wu, Meng Chu, Zhifei Ren, Zizheng Huang, Pei Chu, Ruijie Zhang, Yinan He, Qirui Li, Songze Li, Zhenxiang Li, Zhongying Tu, Conghui He, Yu Qiao, Yali Wang, Yi Wang, Limin Wang (2025). VRBench: a benchmark for multi-step reasoning in long narrative videos. Proceedings of the IEEE/CVF International Conference on Computer Vision.

引用 URL