北京通用人工智能研究院BIGAI

论文索引

A Contextual Combinatorial Bandit Approach to Negotiation

Fast Peer Adaptation with Context-aware Exploration

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning

End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations

An Embodied Generalist Agent in 3D World

CDM-MPC: An Integrated Dynamic Planning and Control Framework for Bipedal Robots Jumping