Constrained Update Projection Approach to Safe Policy Optimization Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan NeurIPS 2022 下载 查看更多
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang, Stephen Marcus McAleer, Yiran Geng, Hao Dong, Zongqing Lu, Song-Chun Zhu, Yaodong Yang NeurIPS Datasets and Benchmarks 2022 下载 查看更多
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control Xuehai Pan, Mickel Liu, fangwei zhong, Yaodong Yang, Song-Chun Zhu, Yizhou Wang NeurIPS Datasets and Benchmarks 2022 下载 查看更多
TarGF: Learning Target Gradient Field for Object Rearrangement Mingdong Wu, Fangwei Zhong, Yulong Xia, Hao Dong NeurIPS 2022 下载 查看更多
LIGS:Learning Intrinstic-reward Generation Selection for Multi-Agent Learning David Henry Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez-Nieves, Oliver Slumbers, Feifei Tong, Yang Li, Jiangcheng Zhu, Yaodong Yang, Jun Wang ICLR 2022 下载 查看更多
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind Yuanfei Wang, Fangwei zhong, Jing Xu, Yizhou Wang ICLR 2022 下载 查看更多