北京通用人工智能研究院BIGAI

科研成果

Interactive Visual Reasoning Under Uncertainty

Active Reasoning in an Open-World Environment

Evaluating and Inducing Personality in Pre-trained Language Models

Learning non-Markovian Decision-Making from State-only Sequences

Multi-Agent Reinforcement Learning is A Sequence Modeling Problem

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning