Xiangyu Zeng,
Kefan Qiu,
Qingyu Zhang,
Xinhao Li,
Jing Wang,
Jiaxin Li,
Ziang Yan,
Kun Tian,
Meng Tian,
Xinhai Zhao,
Others
(2025).
StreamForest: efficient online video understanding with persistent event memory.
Proceedings of the Neural Information Processing Systems.
Guo Chen,
Zhiqi Li,
Shihao Wang,
Jindong Jiang,
Yicheng Liu,
Lidong Lu,
De-an Huang,
Wonmin Byeon,
Matthieu Le,
Tuomas Rintamaki,
Others
(2025).
Eagle 2.5: boosting long-context post-training for frontier vision-language models.
Proceedings of the Neural Information Processing Systems.
Jiashuo Yu,
Yue Wu,
Meng Chu,
Zhifei Ren,
Zizheng Huang,
Pei Chu,
Ruijie Zhang,
Yinan He,
Qirui Li,
Songze Li,
Zhenxiang Li,
Zhongying Tu,
Conghui He,
Yu Qiao,
Yali Wang,
Yi Wang,
Limin Wang
(2025).
VRBench: a benchmark for multi-step reasoning in long narrative videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision.