Mind the Gap: The Divergence Between Human and LLM-Generated Tasks Yi-Long Lu, Jiajun Song, Chunhui Zhang, Wei Wang† AAAI 2026 下载 查看更多
TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials Bofei Zhang*, Zirui Shang*, Zhi Gao*, Wang Zhang, Rui Xie, Xiaojian Ma, Tao Yuan, Xinxiao Wu, Song-Chun Zhu, Qing Li† AAAI 2026 下载 查看更多
Reasoning with Exploration: An Entropy Perspective Daixuan Cheng*, Shaohan Huang*, Xuekai Zhu, Bo Dai, Wayne Xin Zhao✉, Zhenliang Zhang✉, Furu Wei ICLR 2026 下载 查看更多
ADAPT: Adaptive Decentralized Architecture with Perception-aligned Training for Structural Generalization in Multi-Agent RL Zhixiang Zhang*, Shuo Chen✉, Yexin Li, Feng Wang✉, AAAI 2026 下载 查看更多
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints Andrew Zhao, Quentin Xu, Matthieu Liu, Shenzhi Wang, Yong-jin Liu, Zilong Zheng✉, and Gao Huang✉ AAAI 2025 下载 查看更多
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning Sirui Chen*, Zhaowei Zhang*, Yaodong Yang†, Yali Du† AAAI 2024 下载 查看更多