LongVPO: from anchored cues to self-reasoning for long-form video preference optimization

Oct 11, 2025·

Zhenpeng Huang

,

Jiaqi Li

,

Zihan Jia

,

Xinhao Li

,

Desen Meng

,

Lingxue Song

,

Xi Chen

,

Liang Li

Limin Wang

Limin Wang

· 0 min read

Cite URL

Type

Conference paper

Publication

Proceedings of the Neural Information Processing Systems

Last updated on Oct 11, 2025

Limin Wang

Authors

Nanjing University

← Gated integration of low-rank adaptation for continual learning of language models Oct 11, 2025

Loquetier: a virtualized multi-LoRA framework for unified LLM fine-tuning and serving Oct 11, 2025 →