北京通用人工智能研究院BIGAI

为机器立心
为人文赋理

Aegis: Automated Error Generation and Identification for Multi-Agent Systems

DexMove: Learning Tactile-Guided Non-Prehensile Manipulation with Dexterous Hands

Towards Brigding the Gap Between Large-scale Pretraining and Efficient Finetuning for Humanoid Control

STVG-R1: Incentivizing Instance-Level Reasoning and Grounding in Videos via Reinforcement Learning

MVR:Multi-view Video Reward Shaping for Reinforcement Learning

MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning