Publications

publications by categories in reversed chronological order. * indicates equal contribution.

2024

  1. Preprint
    Configurable Foundation Models: Building LLMs from a Modular Perspective
    Chaojun Xiao , Zhengyan Zhang , Chenyang Song , Dazhi Jiang , Feng Yao, Xu Han , Xiaozhi Wang , Shuo Wang , Yufei Huang , Guanyu Lin , Yingfa Chen , Weilin Zhao , Yuge Tu , Zexuan Zhong , Ao Zhang , Chenglei Si , Khai Hao Moo , Chenyang Zhao , Huimin Chen , Yankai Lin , Zhiyuan Liu , Jingbo Shang , and Maosong Sun
    Preprint
  2. Preprint
    TemporalBench: Towards Fine-grained Temporal Understanding for Multimodal Video Models
    Mu Cai , Reuben Tan , Jianrui Zhang , Bocheng Zou , Kai Zhang , Feng Yao, Fangrui Zhu , Jing Gu , Yiwu Zhong , Yuzhang Shang , Yao Dou , Jaden Park , Jianfeng Gao , Yong Jae Lee , and Jianwei Yang
    Preprint
  3. Preprint
    ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
    Hengyuan Zhang , Chenming Shang , Sizhe Wang , Dongdong Zhang , Feng Yao, Renliang Sun , Yiyao Yu , Yujiu Yang , and Furu Wei
    Preprint
  4. Preprint
    MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks
    Letian Peng , Zilong Wang , Feng Yao, Zihan Wang , and Jingbo Shang
    Preprint
  5. EMNLP 2024
    Data Contamination Can Cross Language Barriers
    Feng Yao*, Yufan Zhuang* , Zihao Sun , Sunan Xu , Animesh Kumar , and Jingbo Shang
    EMNLP 2024
  6. ACL 2024
    Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs
    Shang Zhou* , Feng Yao*, Chengyu Dong , Zihan Wang , and Jingbo Shang
    Findings of ACL 2024
  7. ACL 2024
    Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph
    Xiaochen Gao* , Feng Yao*, Kewen Zhao , Beilei He , Animesh Kumar , Vish Krishnan , and Jingbo Shang
    ACL 2024, (Oral)

2023

  1. AAAI 2023
    Unsupervised Legal Evidence Retrieval via Contrastive Learning with Approximate Aggregated Positive
    Feng Yao, Jingyuan Zhang , Yating Zhang , Xiaozhong Liu , Changlong Sun , Yun Liu , and Weixing Shen
    AAAI 2023, (Oral)
  2. ACL 2023
    The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation
    Hao Peng* , Xiaozhi Wang* , Feng Yao*, Kaisheng Zeng , Lei Hou , Juanzi Li , Zhiyuan Liu , and Weixing Shen
    Findings of ACL 2023
  3. EMNLP 2023
    OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding
    Hao Peng , Xiaozhi Wang , Feng Yao, Zimu Wang , Chuzhao Zhu , Kaisheng Zeng , Lei Hou , and Juanzi Li
    EMNLP 2023 Demo
  4. CIKM 2023
    MUSER: A Multi-View Similar Case Retrieval Dataset
    Qingquan Li , Yiran Hu , Feng Yao, Chaojun Xiao , Zhiyuan Liu , Maosong Sun , and Weixing Shen
    CIKM 2023, (Best Resource Paper Honorable Mention)
  5. ICAIL 2023
    EQUALS: A Real-World Dataset for Legal Questions Answering via Reading Laws
    Andong Chen , Feng Yao, Xinyan Zhao , Yating Zhang , Changlong Sun , and Weixing Shen
    International Conference on AI and Law, (Oral)

2022

  1. ACL 2022
    LEVEN: A Large-Scale Chinese Legal Event Detection Dataset
    Feng Yao*, Chaojun Xiao* , Xiaozhi Wang , Zhiyuan Liu , Lei Hou , Cunchao Tu , Juanzi Li , Yun Liu , Weixing Shen , and Maosong Sun
    Findings of ACL 2022