Linking Process to Outcome: Conditonal Reward Modeling for LLM Reasoning Zheng Zhang, Ziwei Shan, Kaitao Song, Yexin Li†, Kan Ren† ICLR 2026 下载 查看更多
Aegis: Automated Error Generation and Identification for Multi-Agent Systems Fanqi Kong∗,Ruijie Zhang∗,Huaxiao Yin, Guibin Zhang, Xiaofei Zhang, Ziang Chen, Zhaowei Zhang, Xiaoyuan Zhang, Song-Chun Zhu, Xue Feng† ICLR 2026 下载 查看更多
ADAPT: Adaptive Decentralized Architecture with Perception-aligned Training for Structural Generalization in Multi-Agent RL Zhixiang Zhang*, Shuo Chen✉, Yexin Li, Feng Wang✉, AAAI 2026 下载 查看更多
World Models Should Prioritize the Unification of Physical and Social Dynamics Xiaoyuan Zhang*, Chengdong Ma, Yizhe Huang, Weidong Huang, Siyuan Qi, Song-Chun Zhu†, Xue Feng†, Yaodong Yang† NeurIPS position paper track 2025 下载 查看更多
Social World Model-Augmented Mechanism Design Policy Learning Xiaoyuan Zhang*, Yizhe Huang*, Chengdong Ma, Zhixun Chen, Long Ma, Yali Du, Song-Chun Zhu, Yaodong Yang†, Xue Feng† NeurIPS 2025 下载 查看更多
Simulating Human-like Daily Activities with Desire-driven Autonomy Yiding Wang*, Yuxuan Chen*, Fangwei Zhong✉, Long Ma, Yizhou Wang ICLR 2025 下载 查看更多