北京通用人工智能研究院BIGAI

科研成果

Artificial Social Intelligence: A Comparative and Holistic View

Multi-Agent Reinforcement Learning is A Sequence Modeling Problem

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning 

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning 

A Unified Diversity Measure for Multiagent Reinforcement Learning

Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation