北京通用人工智能研究院BIGAI

论文索引

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

Unifying 3D Vision-Language Understanding via Promptable Queries

LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments

Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels

Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling

LooGLE: Can Long-Context Language Models Understand Long Contexts?