Towards Brigding the Gap Between Large-scale Pretraining and Efficient Finetuning for Humanoid Control Weidong Huang, Zhehan Li, Hangxin Liu, Biao Hou, Yao Su, Jingwen Zhang † ICLR 2026 下载 查看更多
STVG-R1: Incentivizing Instance-Level Reasoning and Grounding in Videos via Reinforcement Learning Xiaowen Zhang, Zhi Gao, Licheng Jiao, Lingling Li, Qing Li ICLR 2026 下载 查看更多
MVR:Multi-view Video Reward Shaping for Reinforcement Learning Lirui Luo*Guoxi Zhang*, Hongming Xu, Yaodong Yang, Cong Fang†, Qing Li† ICLR 2026 下载 查看更多
MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning Yapeng Mi, Hengli Li, Yanpeng Zhao†, Chenxi Li, Huimin Wu, Xianjian Ma, Song-Chun Zhu, YingNian Wu, Qing Li† ICLR 2026 下载 查看更多
When Large Multimodal Models Confront Evolving Knowledge: Challenges and Explorations Kailin Jiang*, Yuntao Du*, Yukai Ding, Yuchen Ren, Ning Jiang, Zhi Gao, Zilong Zheng, Lei Liu†, Bin Li, Qing Li† ICLR 2026 下载 查看更多
Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts Xianwei Cao, Dou Quan, Zhenliang Zhang† , Shuang Wang ICLR 2026 下载 查看更多