Information Extraction(IE)aims to extract structural knowledge from plain natural language texts.Recently,generative Large Language Models(LLMs)have demonstrated remarkable capabilities in text understanding and gener...Information Extraction(IE)aims to extract structural knowledge from plain natural language texts.Recently,generative Large Language Models(LLMs)have demonstrated remarkable capabilities in text understanding and generation.As a result,numerous works have been proposed to integrate LLMs for IE tasks based on a generative paradigm.To conduct a comprehensive systematic review and exploration of LLM efforts for IE tasks,in this study,we survey the most recent advancements in this field.We first present an extensive overview by categorizing these works in terms of various IE subtasks and techniques,and then we empirically analyze the most advanced methods and discover the emerging trend of IE tasks with LLMs.Based on a thorough review conducted,we identify several insights in technique and promising research directions that deserve further exploration in future studies.We maintain a public repository and consistently update related works and resources on GitHub(LLM4IE repository).展开更多
基金supported in part by the grants from the National Natural Science Foundation of China(Nos.62222213,62072423)partially supported by Research Impact Fund(No.R1015-23),APRC-CityU New Research Initiatives(No.9610565,Start-up Grant for New Faculty of CityU)+7 种基金CityU-HKIDS Early Career Research Grant(No.9360163)Hong Kong ITC Innovation and Technology Fund Midstream Research Programme for Universities Project(No.ITS/034/22MS)Hong Kong Environmental and Conservation Fund(No.88/2022)SIRG-CityU Strategic Interdisciplinary Research Grant(No.7020046)Huawei(Huawei Innovation Research Program),Tencent(CCFTencent Open Fund,Tencent Rhino-Bird Focused Research Program),Ant Group(CCF-Ant Research Fund,Ant Group Research Fund)Alibaba(CCFAlimama Tech Kangaroo Fund(No.2024002))CCF-BaiChuan-Ebtech Foundation Model FundKuaishou.
文摘Information Extraction(IE)aims to extract structural knowledge from plain natural language texts.Recently,generative Large Language Models(LLMs)have demonstrated remarkable capabilities in text understanding and generation.As a result,numerous works have been proposed to integrate LLMs for IE tasks based on a generative paradigm.To conduct a comprehensive systematic review and exploration of LLM efforts for IE tasks,in this study,we survey the most recent advancements in this field.We first present an extensive overview by categorizing these works in terms of various IE subtasks and techniques,and then we empirically analyze the most advanced methods and discover the emerging trend of IE tasks with LLMs.Based on a thorough review conducted,we identify several insights in technique and promising research directions that deserve further exploration in future studies.We maintain a public repository and consistently update related works and resources on GitHub(LLM4IE repository).