北京通用人工智能研究院BIGAI

科研成果

STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning

Heterogeneous Value Alignment Evaluation for Large Language Models

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

A Contextual Combinatorial Bandit Approach to Negotiation

Fast Peer Adaptation with Context-aware Exploration

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning