F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions 发表评论 / Research / 作者: Fang Peng
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding 发表评论 / Research / 作者: Fang Peng
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding 发表评论 / Research / 作者: Fang Peng
LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments 发表评论 / Research / 作者: Fang Peng
Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels 发表评论 / Research / 作者: Fang Peng