北京通用人工智能研究院BIGAI

科研成果

RSPT: Reconstruct Surroundings and Predict Trajectories for Generalizable Active Object Tracking

Proactive Multi-Camera Collaboration for 3D Human Pose Estimation

Multi-Agent Reinforcement Learning is A Sequence Modeling Problem

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning 

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning 

A Unified Diversity Measure for Multiagent Reinforcement Learning