北京通用人工智能研究院BIGAI

科研成果

A Contextual Combinatorial Bandit Approach to Negotiation

Fast Peer Adaptation with Context-aware Exploration

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning

ProAgent: Building Proactive Cooperative Agents with Large Language Models

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

Maximum Entropy Heterogeneous-Agent Reinforcement Learning