北京通用人工智能研究院BIGAI

科研成果

VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges

TACO: Taming Diffusion for in-the-wild Video Amodal Completion

GWM: Towards Scalable Gaussian World Models for Robotic Manipulation

Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing

Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding

Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation