STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning Sirui Chen*, Zhaowei Zhang*, Yaodong Yang†, Yali Du† AAAI 2024 下载 查看更多
Heterogeneous Value Alignment Evaluation for Large Language Models Zhaowei Zhang*, Ceyao Zhang*†, Nian Liu, Siyuan Qi, Ziqi Rong, Song-Chun Zhu, Shuguang Cui, Yaodong Yang‡ AAAI 2024 2024 下载 查看更多
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL Fangwei Zhong*†, Kui Wu, Hai Ci, Churan Wang, Hao Chen ECCV 2024 下载 查看更多
A Contextual Combinatorial Bandit Approach to Negotiation Yexin Li, Zhancun Mu, Siyuan Qi† ICML 2024 下载 查看更多
Fast Peer Adaptation with Context-aware Exploration Long Ma*, Yuanfei Wang*, Fangwei Zhong†, Song-chun Zhu, Yizhou Wang ICML 2024 下载 查看更多
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning Yizhe Huang*, Fanqi Kong, Song-chun Zhu, Xue Feng† ICML 2024 下载 查看更多