Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs Zhaowei Zhang, Fengshuo Bai, Qizhi Chen, Chengdong Ma, Mingzhi Wang, Haoran Sun, Zilong Zheng✉, and Yaodong Yang✉ ICLR 2025 下载 查看更多
In-Context Editing: Learning Knowledge from Self-Induced Distributions Siyuan Qi*✉, Bangcheng Yang*, Kailin Jiang, Xiaobo Wang, Jiaqi Li, Yifan Zhong, Yaodong Yang, and Zilong Zheng✉ ICLR 2025 下载 查看更多
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games Fanqi Kong, Yizhe Huang, Song-Chun Zhu, Siyuan Qi, Xue Feng† NeurIPS 2024 下载 查看更多
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making Yizhe Huang*, Xingbo Wang*, Hao Liu, Fanqi Kong, Aoyang Qin, Min Tang, Xiaoxi Wang, Song-Chun Zhu, Mingjie Bi, Siyuan Qi, Xue Feng† NeurIPS Datasets and Benchmarks 2024 下载 查看更多
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy Zhenyu Guan, Xiangyu Kong†, Fangwei Zhong†, Yizhou Wang NeurIPS 2024 下载 查看更多
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning Sirui Chen*, Zhaowei Zhang*, Yaodong Yang†, Yali Du† AAAI 2024 下载 查看更多