北京通用人工智能研究院BIGAI

科研成果

Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage

MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge

Robust Data Clustering with Outliers via Transformed TensorLow-Rank Representation

FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

UltraEdit: Instruction-based Fine-Grained Image Editing at Scale