In-Context Editing: Learning Knowledge from Self-Induced Distributions Siyuan Qi*✉, Bangcheng Yang*, Kailin Jiang, Xiaobo Wang, Jiaqi Li, Yifan Zhong, Yaodong Yang, and Zilong Zheng✉ ICLR 2025 下载 查看更多
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints Andrew Zhao, Quentin Xu, Matthieu Liu, Shenzhi Wang, Yong-jin Liu, Zilong Zheng✉, and Gao Huang✉ AAAI 2025 下载 查看更多