LongVPO: from anchored cues to self-reasoning for long-form video preference optimization Oct 11, 2025· Zhenpeng Huang , Jiaqi Li , Zihan Jia , Xinhao Li , Desen Meng , Lingxue Song , Xi Chen , Liang Li Limin Wang · 0 min read Cite URL Type Conference paper Publication Proceedings of the Neural Information Processing Systems Last updated on Oct 11, 2025 Authors Limin Wang Nanjing University ← Gated integration of low-rank adaptation for continual learning of language models Oct 11, 2025 Loquetier: a virtualized multi-LoRA framework for unified LLM fine-tuning and serving Oct 11, 2025 →