北京通用人工智能研究院BIGAI

为机器立心
为人文赋理

Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V

Simulating Human-like Daily Activities with Desire-driven Autonomy

Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting

Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge