北京通用人工智能研究院BIGAI

科研成果

A Unified Diversity Measure for Multiagent Reinforcement Learning

Constrained Update Projection Approach to Safe Policy Optimization

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control

TarGF: Learning Target Gradient Field for Object Rearrangement

LIGS:Learning Intrinstic-reward Generation Selection for Multi-Agent Learning