Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang NeurIPS 2022 下载 查看更多
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang NeurIPS 2022 下载 查看更多
A Unified Diversity Measure for Multiagent Reinforcement Learning Zongkai Liu, Chao Yu, Yaodong Yang, Peng Sun, Zifan Wu, Yuan Li NeurIPS 2022 下载 查看更多
Constrained Update Projection Approach to Safe Policy Optimization Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan NeurIPS 2022 下载 查看更多
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang, Stephen Marcus McAleer, Yiran Geng, Hao Dong, Zongqing Lu, Song-Chun Zhu, Yaodong Yang NeurIPS Datasets and Benchmarks 2022 下载 查看更多
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control Xuehai Pan, Mickel Liu, fangwei zhong, Yaodong Yang, Song-Chun Zhu, Yizhou Wang NeurIPS Datasets and Benchmarks 2022 下载 查看更多