ControlVLA: Few-shot Object-centric Adaptation for Pre-trained Vision-Language-Action Models Puhao Li*, Yingying Wu, Ziheng Xi, Wanlin Li, Yuzhe Huang, Zhiyuan Zhang, Yinghan Chen, Jianan Wang, Song-Chun Zhu, Tengyu Liu,†, Siyuan Huang† CoRL 2025 下载 查看更多
Ag2x2: A Robust Agent-Agnostic Visual Representation Boosts Zero-Shot Learning of Bimanual Robotic Manipulation Ziyin Xiong*, Yinghan Chen*, Puhao Li, Yixin Zhu, Tengyu Liu✉, Siyuan Huang✉ IROS 2025 下载 查看更多
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts Yuxuan Wang, Yueqian Wang, Bo Chen, Tong Wu, Dongyan Zhao, Zilong Zheng✉ CVPR 2025 下载 查看更多
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing Jinlu Zhang, Yixin Chen✉, Zan Wang, Jie Yang, Yizhou Wang✉, Siyuan Huang✉ CVPR 2025 下载 查看更多
Dynamic Motion Blending for Versatile Motion Editing Nan Jiang*, Hongjie Li*, Ziye Yuan*, Zimo He, Yixin Chen, Tengyu Liu, Yixin Zhu✉, Siyuan Huang✉ CVPR 2025 下载 查看更多
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes Ruijie Lu*†, Yixin Chen*, Junfeng Ni, Baoxiong Jia, Yu Liu, Diwen Wan, Gang Zeng, Siyuan Huang CVPR 2025 下载 查看更多