Multi-Agent Reinforcement Learning is A Sequence Modeling Problem Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang NeurIPS 2022 下载 查看更多
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang NeurIPS 2022 下载 查看更多
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang NeurIPS 2022 下载 查看更多
A Unified Diversity Measure for Multiagent Reinforcement Learning Zongkai Liu, Chao Yu, Yaodong Yang, Peng Sun, Zifan Wu, Yuan Li NeurIPS 2022 下载 查看更多
Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation Zhizhou Ren, Anji Liu, Yitao Liang, Jian Peng, Jianzhu Ma NeurIPS 2022 下载 查看更多
Constrained Update Projection Approach to Safe Policy Optimization Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan NeurIPS 2022 下载 查看更多