北京通用人工智能研究院BIGAI

科研成果

DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on Simulation

Multi-Agent Reinforcement Learning is A Sequence Modeling Problem

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning 

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning 

A Unified Diversity Measure for Multiagent Reinforcement Learning

Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation