北京通用人工智能研究院BIGAI

科研成果

A Contextual Combinatorial Bandit Approach to Negotiation

Fast Peer Adaptation with Context-aware Exploration

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning

End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations

An Embodied Generalist Agent in 3D World

MEWL: Few-shot multimodal word learning with referential uncertainty