VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding Yue Fan*, Xiaojian Ma*†, Rujie Wu, Yuntao Du, Jiaqi Li, Zhi Gao, Qing Li ECCV 2024 下载 查看更多
End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li ICML 2024 下载 查看更多
CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update Zhi Gao, Yuntao Du, Xintong Zhang, Xiaojian Ma, Wenjuan Han, Song-chun Zhu, Qing Li CVPR 2024 下载 查看更多
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World Rujie Wu, Xiaojian Ma, Qing Li, Zhenliang Zhang, Wei Wang, Song-Chun Zhu, Yizhou Wang ICLR 2024 下载 查看更多
Neural-Symbolic Recursive Machine for Systematic Generalization Qing Li, Yixin Zhu, Yitao Liang, Ying Nian Wu, Song-Chun Zhu, Siyuan Huang ICLR 2024 下载 查看更多
Adapting Large Language Models via Reading Comprehension Daixuan Cheng, Shaohan Huang, Furu Wei ICLR 2024 下载 查看更多