# Awesome-LLM4IE-Papers **Repository Path**: orangego/Awesome-LLM4IE-Papers ## Basic Information - **Project Name**: Awesome-LLM4IE-Papers - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-10-27 - **Last Updated**: 2025-10-27 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Awesome-LLM4IE-Papers 🔥🔥🔥 **The article has been accepted by Frontiers of Computer Science (FCS)**. --- Awesome papers about generative Information extraction using LLMs

The organization of papers is discussed in our survey: [Large Language Models for Generative Information Extraction: A Survey](https://link.springer.com/article/10.1007/s11704-024-40555-y). If you find any relevant academic papers that have not been included in our research, please submit a request for an update. We welcome contributions from everyone. If any suggestions or mistakes, please feel free to let us know via email at **derongxu@mail.ustc.edu.cn** and **chenweicw@mail.ustc.edu.cn**. We appreciate your feedback and help in improving our work. If you find our survey useful for your research, please cite the following paper: @article{xu2024large, title={Large language models for generative information extraction: A survey}, author={Xu, Derong and Chen, Wei and Peng, Wenjun and Zhang, Chao and Xu, Tong and Zhao, Xiangyu and Wu, Xian and Zheng, Yefeng and Wang, Yang and Chen, Enhong}, journal={Frontiers of Computer Science}, volume={18}, number={6}, pages={186357}, year={2024}, publisher={Springer} } ## 📒 Table of Contents - [Information Extraction tasks](#information-extraction-tasks) - [Named Entity Recognition](#named-entity-recognition) - [Relation Extraction ](#relation-extraction) - [Event Extraction](#event-extraction) - [Universal Information Extraction](#universal-information-extraction) - [Information Extraction Techniques](#information-extraction-techniques) - [Supervised Fine-tuning](#supervised-fine-tuning) - [Few-shot ](#few-shot) - [Zero-shot](#zero-shot) - [Data Augmentation](#data-augmentation) - [Prompts Design](#prompts-design) - [Constrained Decoding Generation](#constrained-decoding-generation) - [Specific Domain](#specific-domain) - [Evaluation and Analysis](#evaluation-and-analysis) - [Project and Toolkit](#project-and-toolkit) - ⏰ [Recently Updated Papers](#recently-updated-papers) (After 2024/09/04, the updated papers is here~) - ⭐️ [Datasets](#datasets) (with Download Link~) ## 💡 News - **Update Logs** - The details can be find in ./update_new_papers_list. - **2024/09/04** Add 22 papers - **2024/06/06** Add 41 papers - **2024/03/30** Add 27 papers - **2024/03/29** Add 20 papers # Information Extraction tasks A taxonomy by various tasks. ## Named Entity Recognition Models targeting only ner tasks. ### Entity Typing | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing](https://aclanthology.org/2023.findings-emnlp.1040/) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/yanlinf/CASENT) | | [Generative Entity Typing with Curriculum Learning](https://arxiv.org/abs/2210.02914) | EMNLP | 2022-12 | [GitHub](https://github.com/siyuyuan/GET) | ### Entity Identification & Typing | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Granular Entity Mapper: Advancing Fine-grained Multimodal Named Entity Recognition and Grounding](https://aclanthology.org/2024.findings-emnlp.183/) | EMNLP Findings | 2024 | | | [Double-Checker: Large Language Model as a Checker for Few-shot Named Entity Recognition](https://aclanthology.org/2024.findings-emnlp.180/) | EMNLP Findings | 2024 | [GitHub](https://github.com/fanshu6hao/Double-Checker) | | [VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models](https://aclanthology.org/2024.acl-long.134/) | ACL | 2024 | [GitHub](https://github.com/emseoyk/VerifiNER) | | [ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models](https://aclanthology.org/2024.findings-acl.947/) | ACL Findings | 2024 | [GitHub](https://github.com/StefanHeng/ProgGen) | | [Rethinking Negative Instances for Generative Named Entity Recognition](https://aclanthology.org/2024.findings-acl.206/) | ACL Findings | 2024 | [GitHub](https://github.com/yyDing1/GNER) | | [LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition](https://aclanthology.org/2024.findings-acl.76/) | ACL Findings | 2024 | [GitHub](https://github.com/JinYuanLi0012/RiVEG) | | [RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition](https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae095/7665312) | Others | 2024-05 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) | | [P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models](https://arxiv.org/abs/2405.04960) | Arxiv | 2024-06 | [GitHub](https://github.com/jiangguochaoGG/P-ICL) | | [Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?](https://arxiv.org/abs/2405.02602) | Arxiv | 2024-05 | []() | | [Know-Adapter: Towards Knowledge-Aware Parameter-Efficient Transfer Learning for Few-shot Named Entity Recognition](https://aclanthology.org/2024.lrec-main.854/) | COLING | 2024 | []() | | [ToNER: Type-oriented Named Entity Recognition with Generative Language Model](https://aclanthology.org/2024.lrec-main.1412.pdf) | COLING | 2024 | []() | | [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) | | [Astronomical Knowledge Entity Extraction in Astrophysics Journal Articles via Large Language Models](https://iopscience.iop.org/article/10.1088/1674-4527/ad3d15/meta) | Others | 2024-04 | []() | | [LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking](https://arxiv.org/abs/2404.05624) | Arxiv | 2024-04 | [GitHub](https://github.com/YFR718/LTNER) | | [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Others | 2024-04 | []() | | [Knowledge-Enriched Prompt for Low-Resource Named Entity Recognition](https://dl.acm.org/doi/abs/10.1145/3659948) | TALLIP | 2024-04 | []() | | [VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition](https://arxiv.org/abs/2404.17835) | Arxiv | 2024-04 | [GitHub](https://github.com/Eulring/VANER) | | [LLMs in Biomedicine: A study on clinical Named Entity Recognition](https://arxiv.org/pdf/2404.07376) | Arxiv | 2024-04 | []() | | [Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning](https://www.researchgate.net/profile/Rafael-Nunes-35/publication/379665297_Out_of_Sesame_Street_A_Study_of_Portuguese_Legal_Named_Entity_Recognition_Through_In-Context_Learning/links/6614701839e7641c0ba6879b/Out-of-Sesame-Street-A-Study-of-Portuguese-Legal-Named-Entity-Recognition-Through-In-Context-Learning.pdf) | ResearchGate | 2024-04 | []() | | [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) | | [LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty](https://arxiv.org/abs/2402.10573) | WWW | 2024 | | [Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models](https://aclanthology.org/2024.naacl-short.49/) | NAACL Short | 2024 | [GitHub](https://github.com/Emma1066/Self-Improve-Zero-Shot-NER) | | [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | NAACL | 2024 | [GitHub](https://github.com/allenai/beacon) | | [MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks](https://arxiv.org/abs/2404.00457) | Arxiv | 2024-03 | [GitHub](https://github.com/KomeijiForce/MetaIE) | | [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Arxiv | 2024-03 | []() | | [Augmenting NER Datasets with LLMs: Towards Automated and Refined Annotation](https://arxiv.org/abs/2404.01334) | Arxiv | 2024-03 | []() | | [ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context](https://ojs.aaai.org/index.php/AAAI/article/view/29892)| AAAI | 2024 | | [Embedded Named Entity Recognition using Probing Classifiers](https://arxiv.org/abs/2403.11747) | Arxiv | 2024-03 | [GitHub](https://github.com/nicpopovic/EMBER) | | [In-Context Learning for Few-Shot Nested Named Entity Recognition](https://arxiv.org/abs/2402.01182) | Arxiv | 2024-02 | []() | | [LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition](https://arxiv.org/abs/2402.14568) | Arxiv | 2024-02 | []() | | [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Nature Communications | 2024-02 | [GitHub](https://github.com/lbnlp/nerre-llama) | | [NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data](https://arxiv.org/abs/2402.15343) | Arxiv | 2024-02 | | [A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction](https://arxiv.org/abs/2402.13364) | Arxiv | 2024-02 | | [PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition](https://arxiv.org/abs/2402.04838) | Arxiv | 2024-02 | | [Small Language Model Is a Good Guide for Large Language Model in Chinese Entity Relation Extraction](https://arxiv.org/abs/2402.14373) | Arxiv | 2024-02 | | | [C-ICL: Contrastive In-context Learning for Information Extraction](https://arxiv.org/abs/2402.11254) | Arxiv | 2024-02 | | [UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition](https://openreview.net/pdf?id=r65xfUb76p) | ICLR | 2024 | [GitHub](https://github.com/universal-ner/universal-ner) | | [Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering](https://arxiv.org/abs/2303.16416v3) | Arxiv | 2024-01 | [GitHub](https://github.com/BIDS-Xu-Lab/Clinical_Entity_Recognition_Using_GPT_models) | | [2INER: Instructive and In-Context Learning on Few-Shot Named Entity Recognition](https://aclanthology.org/2023.findings-emnlp.259/) | EMNLP Findings | 2023-12 | | | [In-context Learning for Few-shot Multimodal Named Entity Recognition](https://aclanthology.org/2023.findings-emnlp.196/) | EMNLP Findings | 2023-12 | | | [Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!](https://arxiv.org/abs/2303.08559) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/mayubo2333/LLM-IE) | | [Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset](https://arxiv.org/abs/2310.10118) | EMNLP | 2023-12 | [GitHub](https://github.com/CompNet/conivel/tree/gen) | | [LLMaAA: Making Large Language Models as Active Annotators](https://arxiv.org/abs/2310.19596) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/ridiculouz/LLMAAA) | | [Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge](https://arxiv.org/abs/2305.12212) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/JinYuanLi0012/PGIM) | | [GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer](https://arxiv.org/abs/2311.08526) | Arxiv | 2023-11 | [GitHub](https://github.com/urchade/GLiNER) | | [GPT Struct Me: Probing GPT Models on Narrative Entity Extraction](https://ieeexplore.ieee.org/abstract/document/10350182) | WI-IAT | 2023-10 | [GitHub](https://github.com/hmosousa/gpt_struct_me) | | [GPT-NER: Named Entity Recognition via Large Language Models](https://arxiv.org/abs/2304.10428) | Arxiv | 2023-10 | [GitHub](https://github.com/ShuheWang1998/GPT-NER) | | [Prompt-NER: Zero-shot Named Entity Recognition in Astronomy Literature via Large Language Models](https://arxiv.org/abs/2310.17892) | Arxiv | 2023-10 | | | [Inspire the Large Language Model by External Knowledge on BioMedical Named Entity Recognition](https://arxiv.org/abs/2309.12278) | Arxiv | 2023-09 | | | [One Model for All Domains: Collaborative Domain-Prefx Tuning for Cross-Domain NER](https://arxiv.org/abs/2301.10410) | IJCAI | 2023-09 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/cross) | | [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Arxiv | 2023-08 | | | [Learning In-context Learning for Named Entity Recognition ](https://aclanthology.org/2023.acl-long.764/) | ACL | 2023-07 | [GitHub](https://github.com/chen700564/metaner-icl) | | [Debiasing Generative Named Entity Recognition by Calibrating Sequence Likelihood](https://aclanthology.org/2023.acl-short.98/) | ACL Short | 2023-07 | | | [Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks](https://aclanthology.org/2023.findings-acl.578/) | ACL Findings | 2023-07 | | | [Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction](https://aclanthology.org/2023.bionlp-1.15/) | BioNLP | 2023-07 | [GitHub](https://github.com/arkhn/bio-nlp2023) | | [NAG-NER: a Unified Non-Autoregressive Generation Framework for Various NER Tasks](https://aclanthology.org/2023.acl-industry.65/) | ACL Industry | 2023-07 | | [Unified Named Entity Recognition as Multi-Label Sequence Generation](https://ieeexplore.ieee.org/abstract/document/10191921) | IJCNN | 2023-06 | | [PromptNER : Prompting For Named Entity Recognition](https://arxiv.org/abs/2305.15444) | Arxiv | 2023-06 | | | [Does Synthetic Data Generation of LLMs Help Clinical Text Mining?](https://arxiv.org/abs/2303.04360) | Arxiv | 2023-04 | | | [Unified Text Structuralization with Instruction-tuned Language Models](https://arxiv.org/abs/2303.14956) | Arxiv | 2023-03 | []() | | [Structured information extraction from complex scientific text with fine-tuned large language models](https://arxiv.org/abs/2212.05238) | Arxiv | 2022-12 | [Demo](http://www.matscholar.com/info-extraction) | | [LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting](https://aclanthology.org/2022.coling-1.209/) | COLING | 2022-10 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/few-shot) | | [De-bias for generative extraction in unified NER task](https://aclanthology.org/2022.acl-long.59.pdf) | ACL | 2022-05 | | | [InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER](https://arxiv.org/abs/2203.03903) | Arxiv | 2022-03 | | | [Document-level Entity-based Extraction as Template Generation](https://aclanthology.org/2021.emnlp-main.426/) | EMNLP | 2021-11 | [GitHub](https://github.com/PlusLabNLP/TempGen) | | [A Unified Generative Framework for Various NER Subtasks](https://arxiv.org/abs/2106.01223) | ACL | 2021-08 | [GitHub](https://github.com/yhcc/BARTNER) | | [Template-Based Named Entity Recognition Using BART](https://aclanthology.org/2021.findings-acl.161.pdf) | ACL Findings | 2021-08 | [GitHub](https://github.com/Nealcly/templateNER) | ## Relation Extraction Models targeting only RE tasks. ### Relation Classification | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Others | 2024-04 | []() | | [CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model](https://arxiv.org/abs/2404.18085) | Arxiv | 2024-04 | [GitHub](https://github.com/SkyuForever/CRE-LLM) | | [Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction](https://arxiv.org/abs/2404.17809) | IJCAI | 2024-04 | []() | | [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024-04 | []() | | [Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors](https://arxiv.org/abs/2404.17807) | IJCAI | 2024-04 | []() | | [Retrieval-Augmented Generation-based Relation Extraction](https://arxiv.org/abs/2404.13397) | Arxiv | 2024-04 | [GitHub](https://github.com/sefeoglu/RAG4RE) | | [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Arxiv | 2024-04 | []() | | [STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models](https://ojs.aaai.org/index.php/AAAI/article/view/29839) | AAAI | 2024-03 | | | [Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction](https://arxiv.org/abs/2402.11142) | Arxiv | 2024-02 | | [Chain of Thought with Explicit Evidence Reasoning for Few-shot Relation Extraction](https://aclanthology.org/2023.findings-emnlp.153/) | EMNLP Findings | 2023-12 | | | [GPT-RE: In-context Learning for Relation Extraction using Large Language Models](https://arxiv.org/abs/2305.02105) | EMNLP | 2023-12 | [GitHub](https://github.com/YukinoWan/GPT-RE) | | [Guideline Learning for In-context Information Extraction](https://arxiv.org/abs/2310.05066) | EMNLP | 2023-12 | | | [Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!](https://arxiv.org/abs/2303.08559) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/mayubo2333/LLM-IE) | | [LLMaAA: Making Large Language Models as Active Annotators](https://arxiv.org/abs/2310.19596) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/ridiculouz/LLMAAA) | | [Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs](https://arxiv.org/abs/2312.00552) | EMNLP | 2023-12 | [GitHub](https://github.com/qingwang-isu/AugURE) | | [Revisiting Large Language Models as Zero-shot Relation Extractors](https://arxiv.org/abs/2310.05028) | EMNLP Findings | 2023-12 | | | [Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment](https://arxiv.org/abs/2310.10590) | Arxiv | 2023-10 | | [Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors](https://aclanthology.org/2023.findings-acl.50.pdf) | ACL Findings | 2023-07 | [GitHub](https://github.com/OSU-NLP-Group/QA4RE) | | [How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?](https://arxiv.org/abs/2305.01555) | ACL Workshop | 2023-07 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/UnleashLLMRE) | | [Sequence generation with label augmentation for relation extraction](https://ojs.aaai.org/index.php/AAAI/article/view/26532)| AAAI | 2023-06 | [GitHub](https://github.com/pkuserc/RELA) | | [Does Synthetic Data Generation of LLMs Help Clinical Text Mining?](https://arxiv.org/abs/2303.04360) | Arxiv | 2023-04 | | | [DORE: Document Ordered Relation Extraction based on Generative Framework](https://aclanthology.org/2022.findings-emnlp.253/) | EMNLP Findings | 2022-12 | | [REBEL: Relation Extraction By End-to-end Language generation](https://aclanthology.org/2021.findings-emnlp.204/) | EMNLP Findings | 2021-11 | [GitHub](https://github.com/babelscape/rebel) | ### Relation Triplet | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis](https://aclanthology.org/2024.acl-long.476/) | ACL | 2024 | [GitHub](https://github.com/OceannTwT/era-cot) | | [AutoRE: Document-Level Relation Extraction with Large Language Models](https://aclanthology.org/2024.acl-demos.20/) | ACL Demos | 2024 | [GitHub](https://github.com/bigdante/AutoRE) | | [Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors](https://arxiv.org/abs/2404.17807) | IJCAI | 2024-04 | []() | | [Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction](https://arxiv.org/abs/2401.13598) | WWW | 2024 | | [Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction](https://aclanthology.org/2024.lrec-main.778/) | COLING | 2024 | [GitHub](https://github.com/Ding-Papa/Evaluating-filtering-coling24) | | [Unlocking Instructive In-Context Learning with Tabular Prompting for Relational Triple Extraction](https://aclanthology.org/2024.lrec-main.1488/) | COLING | 2024 | | | [A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction](https://arxiv.org/abs/2402.13364) | Arxiv | 2024-02 | | [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Nature Communications | 2024-02 | [GitHub](https://github.com/lbnlp/nerre-llama) | | [Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models](https://arxiv.org/abs/2310.11085) | Arxiv | 2024-02 | [GitHub](https://github.com/oezyurty/REPLM) | | [Small Language Model Is a Good Guide for Large Language Model in Chinese Entity Relation Extraction](https://arxiv.org/abs/2402.14373) | Arxiv | 2024-02 | | | [Efficient Data Learning for Open Information Extraction with Pre-trained Language Models](https://aclanthology.org/2023.findings-emnlp.869/) | EMNLP Findings | 2023-12 | | [Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment](https://arxiv.org/abs/2310.10590) | Arxiv | 2023-10 | | [Unified Text Structuralization with Instruction-tuned Language Models](https://arxiv.org/abs/2303.14956) | Arxiv | 2023-03 | []() | | [Document-level Entity-based Extraction as Template Generation](https://aclanthology.org/2021.emnlp-main.426/) | EMNLP | 2021-11 | [GitHub](https://github.com/PlusLabNLP/TempGen) | ### Relation Strict | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks](https://arxiv.org/abs/2404.00457) | Arxiv | 2024-03 | [GitHub](https://github.com/KomeijiForce/MetaIE) | | [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Arxiv | 2024-03 | []() | | [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024-03 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) | | [An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction](https://ojs.aaai.org/index.php/AAAI/article/view/29919) | AAAI | 2024-03 | [GitHub](https://github.com/urchade/ATG) | | [C-ICL: Contrastive In-context Learning for Information Extraction](https://arxiv.org/abs/2402.11254) | Arxiv | 2024-02 | | [REBEL: Relation Extraction By End-to-end Language generation](https://aclanthology.org/2021.findings-emnlp.204/) | EMNLP Findings | 2021-11 | [GitHub](https://github.com/babelscape/rebel) | ## Event Extraction Models targeting only EE tasks. ### Event Detection | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Improving Event Definition Following For Zero-Shot Event Detection](https://arxiv.org/abs/2403.02586) | Arxiv | 2024-03 | []() | | [Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment](https://arxiv.org/abs/2310.10590) | Arxiv | 2023-10 | | [Unified Text Structuralization with Instruction-tuned Language Models](https://arxiv.org/abs/2303.14956) | Arxiv | 2023-03 | []() | | [Unleash GPT-2 Power for Event Detection](https://aclanthology.org/2021.acl-long.490.pdf) | ACL | 2021-08 | []() | ### Event Argument Extraction | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [LLMs Learn Task Heuristics from Demonstrations: A Heuristic-Driven Prompting Strategy for Document-Level Event Argument Extraction](https://aclanthology.org/2024.acl-long.647/) | ACL | 2024 | [GitHub](https://github.com/hzzhou01/HD-LoA-Prompting) | | [Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction](https://aclanthology.org/2024.findings-acl.564/) | ACL Findings | 2024 | [GitHub](https://github.com/LWL-cpu/DEEIA) | | [KeyEE: Enhancing Low-Resource Generative Event Extraction with Auxiliary Keyword Sub-Prompt](https://ieeexplore.ieee.org/abstract/document/10506770) | Others | 2024-04 | [GitHub](https://github.com/OStars/KeyEE) | | [MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks](https://arxiv.org/abs/2404.00457) | Arxiv | 2024-03 | [GitHub](https://github.com/KomeijiForce/MetaIE) | | [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) | | [ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Refinement](https://arxiv.org/abs/2401.13218) | Arxiv | 2024-01 | []() | | [Context-Aware Prompt for Generation-based Event Argument Extraction with Diffusion Models](https://dl.acm.org/doi/10.1145/3583780.3614820) | CIKM | 2023-10 | []() | | [Contextualized Soft Prompts for Extraction of Event Arguments](https://aclanthology.org/2023.findings-acl.266/) | ACL Findings | 2023-07 | | | [AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model](https://aclanthology.org/2023.acl-long.615/) | ACL | 2023-07 | [GitHub](https://github.com/PlusLabNLP/AMPERE) | | [Code4Struct: Code Generation for Few-Shot Event Structure Prediction](https://arxiv.org/abs/2210.12810) | ACL | 2023-07 | [GitHub](https://github.com/xingyaoww/code4struct) | | [Event Extraction as Question Generation and Answering](https://aclanthology.org/2023.acl-short.143.pdf) | ACL short | 2023-07 | [GitHub](https://github.com/dataminr-ai/Event-Extraction-as-Question-Generation-and-Answering) | | [Global Constraints with Prompting for Zero-Shot Event Argument Classification](https://aclanthology.org/2023.findings-eacl.191/) | EACL Findings | 2023-05 | | | [Prompt for extraction? PAIE: prompting argument interaction for event argument extraction](https://aclanthology.org/2022.acl-long.466/) | ACL | 2022-05 | [GitHub](https://github.com/mayubo2333/PAIE) | ### Event Detection & Argument Extraction | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction](https://aclanthology.org/2024.findings-acl.760/) | ACL Findings | 2024 | [GitHub](https://github.com/ej0cl6/TextEE) | | [EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models](https://arxiv.org/abs/2402.11430) | Arxiv | 2024-02 | | [Guideline Learning for In-context Information Extraction](https://arxiv.org/abs/2310.05066) | EMNLP | 2023-12 | | | [DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction](https://aclanthology.org/2023.findings-emnlp.121) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/GangZhao98/DemoSG) | | [Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!](https://arxiv.org/abs/2303.08559) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/mayubo2333/LLM-IE) | | [DICE: Data-Efficient Clinical Event Extraction with Generative Models](https://aclanthology.org/2023.acl-long.886.pdf) | ACL | 2023-07 | [GitHub](https://github.com/derekmma/DICE) | | [A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction](https://openreview.net/pdf?id=3xGnOrUqt1) | NeurIPS Workshop | 2023-10 | []() | | [STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models](https://ojs.aaai.org/index.php/AAAI/article/view/29839) | AAAI | 2024-03 | []() | | [DEGREE: A Data-Efficient Generative Event Extraction Model](https://aclanthology.org/2022.naacl-main.138/) | NAACL | 2022-07 | [GitHub](https://github.com/PlusLabNLP/DEGREE) | | [ClarET: Pre-training a correlation-aware context-to-event transformer for event-centric generation and classification](https://aclanthology.org/2022.acl-long.183/) | ACL | 2022-05 | [GitHub](https://github.com/yczhou001/ClarET) | | [Dynamic prefix-tuning for generative template-based event extraction](https://aclanthology.org/2022.acl-long.358.pdf) | ACL | 2022-05 | []() | | [Text2event: Controllable sequence-to- structure generation for end-to-end event extraction](https://arxiv.org/abs/2106.09232) | ACL | 2021-08 | [GitHub](https://github.com/luyaojie/text2event) | | [Document-level event argument extraction by conditional generation](https://aclanthology.org/2021.naacl-main.69.pdf) | NAACL | 2021-06 | [GitHub](https://github.com/raspberryice/gen-arg) | ## Universal Information Extraction Unified models targeting multiple IE tasks. ### NL-LLMs based | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Diluie: constructing diverse demonstrations of in-context learning with large language model for unified information extraction](https://link.springer.com/article/10.1007/s00521-024-09728-5) | Others | 2024-04 | [GitHub](https://github.com/Phevos75/DILUIE) | | [ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models]() | COLING | 2024 | | | [YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction](https://arxiv.org/abs/2312.15548) | Arxiv | 2024-04 | | [Set Learning for Generative Information Extraction](https://aclanthology.org/2023.emnlp-main.806.pdf) | EMNLP | 2023-12 | []() | | [GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect](https://arxiv.org/abs/2311.06838) | Arxiv | 2023-11 | []() | | [InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction](https://arxiv.org/abs/2304.08085) | Arxiv | 2023-04 | [GitHub](https://github.com/BeyonderXX/InstructUIE) | | [Zero-Shot Information Extraction via Chatting with ChatGPT](https://arxiv.org/abs/2302.10205) | Arxiv | 2023-02 | [GitHub](https://github.com/cocacola-lab/ChatIE) | | [GenIE: Generative Information Extraction](https://aclanthology.org/2022.naacl-main.342.pdf) | NAACL | 2022-07 | [GitHub](https://github.com/epfl-dlab/GenIE) | | [DEEPSTRUCT: Pretraining of Language Models for Structure Prediction](https://aclanthology.org/2022.findings-acl.67/) | ACL Findings | 2022-05 | [GitHub](https://github.com/wang-research-lab/deepstruct) | | [Unified Structure Generation for Universal Information Extraction](https://aclanthology.org/2022.acl-long.395/) | ACL | 2022-05 | [GitHub](https://github.com/yhcc/BARTABSA) | | [Structured prediction as translation between augmented natural languages](https://arxiv.org/abs/2101.05779) | ICLR | 2021-01 | [GitHub](https://github.com/amazon-science/tanl) | ### Code-LLMs based | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction](https://aclanthology.org/2024.acl-long.475/) | ACL | 2024 | [GitHub](https://ict-goknow.github.io/knowcoder/) | | [GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction](https://openreview.net/pdf?id=Y3wpuxd7u9) | ICLR | 2024 | [GitHub](https://github.com/hitz-zentroa/GoLLIE) | | [Retrieval-Augmented Code Generation for Universal Information Extraction](https://arxiv.org/abs/2311.02962) | Arxiv | 2023-11 | []() | | [CODEIE: Large Code Generation Models are Better Few-Shot Information Extractors](https://arxiv.org/abs/2305.05711) | ACL | 2023-07 | [GitHub](https://github.com/artpli/CodeIE) | | [CodeKGC: Code Language Model for Generative Knowledge Graph Construction](https://dl.acm.org/doi/abs/10.1145/3641850) | ACM TALLIP | 2024-03 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/CodeKGC) | # Information Extraction Techniques A taxonomy by techniques. ## Supervised Fine-tuning | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Rethinking Negative Instances for Generative Named Entity Recognition](https://aclanthology.org/2024.findings-acl.206/) | ACL Findings | 2024 | [GitHub](https://github.com/yyDing1/GNER) | | [Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction](https://aclanthology.org/2024.findings-acl.564/) | ACL Findings | 2024 | [GitHub](https://github.com/LWL-cpu/DEEIA) | | [AutoRE: Document-Level Relation Extraction with Large Language Models](https://aclanthology.org/2024.acl-demos.20/) | ACL Demos | 2024 | [GitHub](https://github.com/bigdante/AutoRE) | | [Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction](https://arxiv.org/abs/2404.17809) | IJCAI | 2024-04 | []() | | [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024-04 | []() | | [An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction](https://ojs.aaai.org/index.php/AAAI/article/view/29919) | AAAI | 2024 | [GitHub](https://github.com/urchade/ATG) | | [Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction](https://aclanthology.org/2024.lrec-main.778/) | COLING | 2024 | [GitHub](https://github.com/Ding-Papa/Evaluating-filtering-coling24) | | [ToNER: Type-oriented Named Entity Recognition with Generative Language Model](https://aclanthology.org/2024.lrec-main.1412.pdf) | COLING | 2024 | []() | | [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) | | [KeyEE: Enhancing Low-Resource Generative Event Extraction with Auxiliary Keyword Sub-Prompt](https://ieeexplore.ieee.org/abstract/document/10506770) | Others | 2024-04 | [GitHub](https://github.com/OStars/KeyEE) | | [VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition](https://arxiv.org/abs/2404.17835) | Arxiv | 2024-04 | [GitHub](https://github.com/Eulring/VANER) | | [LLMs in Biomedicine: A study on clinical Named Entity Recognition](https://arxiv.org/pdf/2404.07376) | Arxiv | 2024-04 | []() | | [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) | | [CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model](https://arxiv.org/abs/2404.18085) | Arxiv | 2024-04 | [GitHub](https://github.com/SkyuForever/CRE-LLM) | | [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Arxiv | 2024-04 | []() | | [Improving Event Definition Following For Zero-Shot Event Detection](https://arxiv.org/abs/2403.02586) | Arxiv | 2024-03 | []() | | [Embedded Named Entity Recognition using Probing Classifiers](https://arxiv.org/abs/2403.11747) | Arxiv | 2024-03 | [GitHub](https://github.com/nicpopovic/EMBER) | | [EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models](https://arxiv.org/abs/2402.11430) | Arxiv | 2024-02 | | [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Nature Communications | 2024-02 | [GitHub](https://github.com/lbnlp/nerre-llama) | | [PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition](https://arxiv.org/abs/2402.04838) | Arxiv | 2024-02 | | [UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition](https://openreview.net/pdf?id=r65xfUb76p) | ICLR | 2024 | [GitHub](https://github.com/universal-ner/universal-ner) | | [GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction](https://openreview.net/pdf?id=Y3wpuxd7u9) | ICLR | 2024 | [GitHub](https://github.com/hitz-zentroa/GoLLIE) | | [Set Learning for Generative Information Extraction](https://aclanthology.org/2023.emnlp-main.806.pdf) | EMNLP | 2023-12 | []() | | [Efficient Data Learning for Open Information Extraction with Pre-trained Language Models](https://aclanthology.org/2023.findings-emnlp.869/) | EMNLP Findings | 2023-12 | | [DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction](https://aclanthology.org/2023.findings-emnlp.121) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/GangZhao98/DemoSG) | | [Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing](https://aclanthology.org/2023.findings-emnlp.1040/) | EMNLP Findings | 2023-12 | []() | | [GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect](https://arxiv.org/abs/2311.06838) | Arxiv | 2023-11 | []() | | [GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer](https://arxiv.org/abs/2311.08526) | Arxiv | 2023-11 | [GitHub](https://github.com/urchade/GLiNER) | | [Context-Aware Prompt for Generation-based Event Argument Extraction with Diffusion Models](https://dl.acm.org/doi/10.1145/3583780.3614820) | CIKM | 2023-10 | []() | | [Contextualized Soft Prompts for Extraction of Event Arguments](https://aclanthology.org/2023.findings-acl.266/) | ACL Findings | 2023-07 | | | [AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model](https://aclanthology.org/2023.acl-long.615/) | ACL | 2023-07 | [GitHub](https://github.com/PlusLabNLP/AMPERE) | | [Debiasing Generative Named Entity Recognition by Calibrating Sequence Likelihood](https://aclanthology.org/2023.acl-short.98/) | ACL short | 2023-07 | []() | | [DICE: Data-Efficient Clinical Event Extraction with Generative Models](https://aclanthology.org/2023.acl-long.886.pdf) | ACL | 2023-07 | [GitHub](https://github.com/derekmma/DICE) | | [Event Extraction as Question Generation and Answering](https://aclanthology.org/2023.acl-short.143.pdf) | ACL short | 2023-07 | [GitHub](https://github.com/dataminr-ai/Event-Extraction-as-Question-Generation-and-Answering) | | [NAG-NER: a Unified Non-Autoregressive Generation Framework for Various NER Tasks](https://aclanthology.org/2023.acl-industry.65/) | ACL Industry | 2023-07 | | [Sequence generation with label augmentation for relation extraction](https://ojs.aaai.org/index.php/AAAI/article/view/26532)| AAAI | 2023-06 | [GitHub](https://github.com/pkuserc/RELA) | | [Unified Named Entity Recognition as Multi-Label Sequence Generation](https://ieeexplore.ieee.org/abstract/document/10191921) | IJCNN | 2023-06 | | [InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction](https://arxiv.org/abs/2304.08085) | Arxiv | 2023-04 | [GitHub](https://github.com/BeyonderXX/InstructUIE) | | [Structured information extraction from complex scientific text with fine-tuned large language models](https://arxiv.org/abs/2212.05238) | Arxiv | 2022-12 | [Demo](http://www.matscholar.com/info-extraction) | | [Generative Entity Typing with Curriculum Learning](https://arxiv.org/abs/2210.02914) | EMNLP | 2022-12 | [GitHub](https://github.com/siyuyuan/GET) | | [DORE: Document Ordered Relation Extraction based on Generative Framework](https://aclanthology.org/2022.findings-emnlp.253/) | EMNLP Findings | 2022-12 | | [LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model](https://openreview.net/pdf?id=a8qX5RG36jd) | NeurIPS | 2022-10 | [GitHub](https://github.com/ChocoWu/LasUIE) | | [LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting](https://aclanthology.org/2022.coling-1.209/) | COLING | 2022-10 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/few-shot) | | [GenIE: Generative Information Extraction](https://aclanthology.org/2022.naacl-main.342.pdf) | NAACL | 2022-07 | [GitHub](https://github.com/epfl-dlab/GenIE) | | [DEGREE: A Data-Efficient Generative Event Extraction Model](https://aclanthology.org/2022.naacl-main.138/) | NAACL | 2022-07 | [GitHub](https://github.com/PlusLabNLP/DEGREE) | | [ClarET: Pre-training a correlation-aware context-to-event transformer for event-centric generation and classification](https://aclanthology.org/2022.acl-long.183/) | ACL | 2022-05 | [GitHub](https://github.com/yczhou001/ClarET) | | [DEEPSTRUCT: Pretraining of Language Models for Structure Prediction](https://aclanthology.org/2022.findings-acl.67/) | ACL Findings | 2022-05 | [GitHub](https://github.com/wang-research-lab/deepstruct) | | [Dynamic prefix-tuning for generative template-based event extraction](https://aclanthology.org/2022.acl-long.358.pdf) | ACL | 2022-05 | []() | | [Prompt for extraction? PAIE: prompting argument interaction for event argument extraction](https://aclanthology.org/2022.acl-long.466/) | ACL | 2022-05 | [GitHub](https://github.com/mayubo2333/PAIE) | | [Unified Structure Generation for Universal Information Extraction](https://aclanthology.org/2022.acl-long.395/) | ACL | 2022-05 | [GitHub](https://github.com/yhcc/BARTABSA) | | [De-bias for generative extraction in unified NER task](https://aclanthology.org/2022.acl-long.59.pdf) | ACL | 2022-05 | []() | | [Document-level Entity-based Extraction as Template Generation](https://aclanthology.org/2021.emnlp-main.426/) | EMNLP | 2021-11 | [GitHub](https://github.com/PlusLabNLP/TempGen) | | [REBEL: Relation Extraction By End-to-end Language generation](https://aclanthology.org/2021.findings-emnlp.204/) | EMNLP Findings | 2021-11 | [GitHub](https://github.com/babelscape/rebel) | | [A Unified Generative Framework for Various NER Subtasks](https://arxiv.org/abs/2106.01223) | ACL | 2021-08 | [GitHub](https://github.com/yhcc/BARTNER) | | [Template-Based Named Entity Recognition Using BART](https://aclanthology.org/2021.findings-acl.161.pdf) | ACL Findings | 2021-08 | [GitHub](https://github.com/Nealcly/templateNER) | | [Text2event: Controllable sequence-to- structure generation for end-to-end event extraction](https://arxiv.org/abs/2106.09232) | ACL | 2021-08 | [GitHub](https://github.com/luyaojie/text2event) | | [Document-level event argument extraction by conditional generation](https://aclanthology.org/2021.naacl-main.69.pdf) | NAACL | 2021-06 | [GitHub](https://github.com/raspberryice/gen-arg) | | [Structured prediction as translation between augmented natural languages](https://arxiv.org/abs/2101.05779) | ICLR | 2021-01 | [GitHub](https://github.com/amazon-science/tanl) | ## Few-shot ### Few-shot Fine-tuning | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Diluie: constructing diverse demonstrations of in-context learning with large language model for unified information extraction](https://link.springer.com/article/10.1007/s00521-024-09728-5) | Others | 2024-04 | [GitHub](https://github.com/Phevos75/DILUIE) | | [KeyEE: Enhancing Low-Resource Generative Event Extraction with Auxiliary Keyword Sub-Prompt](https://ieeexplore.ieee.org/abstract/document/10506770) | Others | 2024-04 | [GitHub](https://github.com/OStars/KeyEE) | | [Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors](https://arxiv.org/abs/2404.17807) | IJCAI | 2024-04 | []() | | [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | NAACL | 2024-03 | [GitHub](https://github.com/allenai/beacon) | | [DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction](https://aclanthology.org/2023.findings-emnlp.121) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/GangZhao98/DemoSG) | | [One Model for All Domains: Collaborative Domain-Prefx Tuning for Cross-Domain NER](https://arxiv.org/abs/2301.10410) | IJCAI | 2023-09 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/cross) | | [LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting](https://aclanthology.org/2022.coling-1.209/) | COLING | 2022-10 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/few-shot) | | [Unified Structure Generation for Universal Information Extraction](https://aclanthology.org/2022.acl-long.395/) | ACL | 2022-05 | [GitHub](https://github.com/yhcc/BARTABSA) | | [InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER](https://arxiv.org/abs/2203.03903) | Arxiv | 2022-03 | | | [Template-Based Named Entity Recognition Using BART](https://aclanthology.org/2021.findings-acl.161.pdf) | ACL Findings | 2021-08 | [GitHub](https://github.com/Nealcly/templateNER) | | [Structured prediction as translation between augmented natural languages](https://arxiv.org/abs/2101.05779) | ICLR | 2021-01 | [GitHub](https://github.com/amazon-science/tanl) | ### In-Context Learning | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction](https://aclanthology.org/2024.findings-acl.760/) | ACL Findings | 2024 | [GitHub](https://github.com/ej0cl6/TextEE) | | [RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition](https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae095/7665312) | Others | 2024-05 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) | | [P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models](https://arxiv.org/abs/2405.04960) | Arxiv | 2024-06 | [GitHub](https://github.com/jiangguochaoGG/P-ICL) | | [LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking](https://arxiv.org/abs/2404.05624) | Arxiv | 2024-04 | [GitHub](https://github.com/YFR718/LTNER) | | [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Others | 2024-04 | []() | | [LLMs in Biomedicine: A study on clinical Named Entity Recognition](https://arxiv.org/pdf/2404.07376) | Arxiv | 2024-04 | []() | | [Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning](https://www.researchgate.net/profile/Rafael-Nunes-35/publication/379665297_Out_of_Sesame_Street_A_Study_of_Portuguese_Legal_Named_Entity_Recognition_Through_In-Context_Learning/links/6614701839e7641c0ba6879b/Out-of-Sesame-Street-A-Study-of-Portuguese-Legal-Named-Entity-Recognition-Through-In-Context-Learning.pdf) | ResearchGate | 2024-04 | []() | | [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) | | [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024-04 | []() | | [Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models](https://aclanthology.org/2024.naacl-short.49/) | NAACL Short | 2024 | [GitHub](https://github.com/Emma1066/Self-Improve-Zero-Shot-NER) | | [ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context](https://ojs.aaai.org/index.php/AAAI/article/view/29892)| AAAI | 2024 | | [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | NAACL | 2024 | [GitHub](https://github.com/allenai/beacon) | | [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) | | [Unlocking Instructive In-Context Learning with Tabular Prompting for Relational Triple Extraction](https://aclanthology.org/2024.lrec-main.1488/) | COLING | 2024 | | | [CodeKGC: Code Language Model for Generative Knowledge Graph Construction](https://dl.acm.org/doi/abs/10.1145/3641850) | ACM TALLIP | 2024-03 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/CodeKGC) | | [Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models](https://arxiv.org/abs/2310.11085) | Arxiv | 2024-02 | [GitHub](https://github.com/oezyurty/REPLM) | | [In-Context Learning for Few-Shot Nested Named Entity Recognition](https://arxiv.org/abs/2402.01182) | Arxiv | 2024-02 | []() | | [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) | | [Heuristic-Driven Link-of-Analogy Prompting: Enhancing Large Language Models for Document-Level Event Argument Extraction](https://arxiv.org/abs/2311.06555v2) | Arxiv | 2024-02 | []() | | [LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty](https://arxiv.org/abs/2402.10573) | WWW | 2024 | | [Small Language Model Is a Good Guide for Large Language Model in Chinese Entity Relation Extraction](https://arxiv.org/abs/2402.14373) | Arxiv | 2024-02 | | | [C-ICL: Contrastive In-context Learning for Information Extraction](https://arxiv.org/abs/2402.11254) | Arxiv | 2024-02 | | [Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering](https://arxiv.org/abs/2303.16416v3) | Arxiv | 2024-01 | [GitHub](https://github.com/BIDS-Xu-Lab/Clinical_Entity_Recognition_Using_GPT_models) | | [Chain of Thought with Explicit Evidence Reasoning for Few-shot Relation Extraction](https://aclanthology.org/2023.findings-emnlp.153/) | EMNLP Findings | 2023-12 | | | [GPT-RE: In-context Learning for Relation Extraction using Large Language Models](https://arxiv.org/abs/2305.02105) | EMNLP | 2023-12 | [GitHub](https://github.com/YukinoWan/GPT-RE) | | [Guideline Learning for In-context Information Extraction](https://arxiv.org/abs/2310.05066) | EMNLP | 2023-12 | []() | | [Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!](https://arxiv.org/abs/2303.08559) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/mayubo2333/LLM-IE) | | [Retrieval-Augmented Code Generation for Universal Information Extraction](https://arxiv.org/abs/2311.02962) | Arxiv | 2023-11 | []() | | [Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment](https://arxiv.org/abs/2310.10590) | Arxiv | 2023-10 | | [GPT-NER: Named Entity Recognition via Large Language Models](https://arxiv.org/abs/2304.10428) | Arxiv | 2023-10 | [GitHub](https://github.com/ShuheWang1998/GPT-NER) | | [GPT Struct Me: Probing GPT Models on Narrative Entity Extraction](https://ieeexplore.ieee.org/abstract/document/10350182) | WI-IAT | 2023-10 | [GitHub](https://github.com/hmosousa/gpt_struct_me) | | [Learning In-context Learning for Named Entity Recognition ](https://aclanthology.org/2023.acl-long.764/) | ACL | 2023-07 | [GitHub](https://github.com/chen700564/metaner-icl) | | [Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors](https://aclanthology.org/2023.findings-acl.50.pdf) | ACL Findings | 2023-07 | [GitHub](https://github.com/OSU-NLP-Group/QA4RE) | | [Code4Struct: Code Generation for Few-Shot Event Structure Prediction](https://arxiv.org/abs/2210.12810) | ACL | 2023-07 | [GitHub](https://github.com/xingyaoww/code4struct) | | [CODEIE: Large Code Generation Models are Better Few-Shot Information Extractors](https://arxiv.org/abs/2305.05711) | ACL | 2023-07 | [GitHub](https://github.com/artpli/CodeIE) | | [How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?](https://arxiv.org/abs/2305.01555) | ACL Workshop | 2023-07 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/UnleashLLMRE) | | [PromptNER : Prompting For Named Entity Recognition](https://arxiv.org/abs/2305.15444) | Arxiv | 2023-06 | [GitHub](https://github.com/tricktreat/PromptNER) | | [Unified Text Structuralization with Instruction-tuned Language Models](https://arxiv.org/abs/2303.14956) | Arxiv | 2023-03 | []() | ## Zero-shot ### Zero-shot Prompting | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis](https://aclanthology.org/2024.acl-long.476/) | ACL | 2024 | [GitHub](https://github.com/OceannTwT/era-cot) | | [Astronomical Knowledge Entity Extraction in Astrophysics Journal Articles via Large Language Models](https://iopscience.iop.org/article/10.1088/1674-4527/ad3d15/meta) | Others | 2024-04 | []() | | [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) | | [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024-04 | []() | | [Retrieval-Augmented Generation-based Relation Extraction](https://arxiv.org/abs/2404.13397) | Arxiv | 2024-04 | [GitHub](https://github.com/sefeoglu/RAG4RE) | | [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Arxiv | 2024-04 | []() | | [Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors](https://arxiv.org/abs/2404.17807) | IJCAI | 2024-04 | []() | | [Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models](https://aclanthology.org/2024.naacl-short.49/) | NAACL Short | 2024 | [GitHub](https://github.com/Emma1066/Self-Improve-Zero-Shot-NER) | | [CodeKGC: Code Language Model for Generative Knowledge Graph Construction](https://dl.acm.org/doi/abs/10.1145/3641850) | ACM TALLIP | 2024-03 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/CodeKGC) | | [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | NAACL | 2024-03 | [GitHub](https://github.com/allenai/beacon) | | [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) | | [A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction](https://arxiv.org/abs/2402.13364) | Arxiv | 2024-02 | | [Small Language Model Is a Good Guide for Large Language Model in Chinese Entity Relation Extraction](https://arxiv.org/abs/2402.14373) | Arxiv | 2024-02 | | | [Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering](https://arxiv.org/abs/2303.16416v3) | Arxiv | 2024-01 | [GitHub](https://github.com/BIDS-Xu-Lab/Clinical_Entity_Recognition_Using_GPT_models) | | [Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs](https://arxiv.org/abs/2312.00552) | EMNLP | 2023-12 | [GitHub](https://github.com/qingwang-isu/AugURE) | | [Prompt-NER: Zero-shot Named Entity Recognition in Astronomy Literature via Large Language Models](https://arxiv.org/abs/2310.17892) | Arxiv | 2023-10 | | | [Revisiting Large Language Models as Zero-shot Relation Extractors](https://arxiv.org/abs/2310.05028) | EMNLP Findings | 2023-10 | []() | | [Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors](https://aclanthology.org/2023.findings-acl.50.pdf) | ACL Findings | 2023-07 | [GitHub](https://github.com/OSU-NLP-Group/QA4RE) | | [Code4Struct: Code Generation for Few-Shot Event Structure Prediction](https://arxiv.org/abs/2210.12810) | ACL | 2023-07 | [GitHub](https://github.com/xingyaoww/code4struct) | | [A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction](https://openreview.net/pdf?id=3xGnOrUqt1) | NeurIPS Workshop | 2023-10 | []() | | [Global Constraints with Prompting for Zero-Shot Event Argument Classification](https://aclanthology.org/2023.findings-eacl.191/) | EACL Findings | 2023-05 | | | [Zero-Shot Information Extraction via Chatting with ChatGPT](https://arxiv.org/abs/2302.10205) | Arxiv | 2023-02 | [GitHub](https://github.com/cocacola-lab/ChatIE) | ### Cross-Domain Learning | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction](https://aclanthology.org/2024.acl-long.475/) | ACL | 2024 | [GitHub](https://ict-goknow.github.io/knowcoder/) | | [VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models](https://aclanthology.org/2024.acl-long.134/) | ACL | 2024 | [GitHub](https://github.com/emseoyk/VerifiNER) | | [Rethinking Negative Instances for Generative Named Entity Recognition](https://aclanthology.org/2024.findings-acl.206/) | ACL Findings | 2024 | [GitHub](https://github.com/yyDing1/GNER) | | [IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus](https://aclanthology.org/2024.acl-short.13/) | ACL Short | 2024 | [GitHub](https://github.com/zjunlp/IEPile?tab=readme-ov-file) | | [Diluie: constructing diverse demonstrations of in-context learning with large language model for unified information extraction](https://link.springer.com/article/10.1007/s00521-024-09728-5) | Others | 2024-04 | [GitHub](https://github.com/Phevos75/DILUIE) | | [Advancing Entity Recognition in Biomedicine via Instruction Tuning of Large Language Models]() | Bioinformatics | 2024-03 | [GitHub](https://github.com/BIDS-Xu-Lab/BioNER-LLaMA) | | [ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models]() | COLING | 2024 | | | [ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Refinement](https://arxiv.org/abs/2401.13218) | Arxiv | 2024-01 | []() | | [YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction](https://arxiv.org/abs/2312.15548) | Arxiv | 2024-04 | | [GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction](https://openreview.net/pdf?id=Y3wpuxd7u9) | ICLR | 2024 | [GitHub](https://github.com/hitz-zentroa/GoLLIE) | | [UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition](https://openreview.net/pdf?id=r65xfUb76p) | ICLR | 2024 | [GitHub](https://github.com/universal-ner/universal-ner) | | [InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction](https://arxiv.org/abs/2304.08085) | Arxiv | 2023-04 | [GitHub](https://github.com/BeyonderXX/InstructUIE) | | [DEEPSTRUCT: Pretraining of Language Models for Structure Prediction](https://aclanthology.org/2022.findings-acl.67/) | ACL Findings | 2022-05 | [GitHub](https://github.com/wang-research-lab/deepstruct) | | [Multilingual generative language models for zero-shot cross-lingual event argument extraction](https://aclanthology.org/2022.acl-long.317.pdf) | ACL | 2022-05 | [GitHub](https://github.com/PlusLabNLP/X-Gear) | ### Cross-Type Learning | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Document-level event argument extraction by conditional generation](https://aclanthology.org/2021.naacl-main.69.pdf) | NAACL | 2021-06 | [GitHub](https://github.com/raspberryice/gen-arg) | ## Data Augmentation ### Data Annotation | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?](https://arxiv.org/abs/2405.02602) | Arxiv | 2024-05 | []() | | [MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks](https://arxiv.org/abs/2404.00457) | Arxiv | 2024-03 | [GitHub](https://github.com/KomeijiForce/MetaIE) | | [Augmenting NER Datasets with LLMs: Towards Automated and Refined Annotation](https://arxiv.org/abs/2404.01334) | Arxiv | 2024-03 | []() | | [NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data](https://arxiv.org/abs/2402.15343) | Arxiv | 2024-02 | | [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) | | [LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition](https://arxiv.org/abs/2402.14568) | Arxiv | 2024-02 | []() | | [LLMaAA: Making Large Language Models as Active Annotators](https://arxiv.org/abs/2310.19596) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/ridiculouz/LLMAAA) | | [Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs](https://arxiv.org/abs/2312.00552) | EMNLP | 2023-12 | [GitHub](https://github.com/qingwang-isu/AugURE) | | [Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models](https://aclanthology.org/2023.emnlp-main.334.pdf) | EMNLP | 2023-12 | [GitHub](https://github.com/bigai-nlco/DocGNRE) | | [How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?](https://arxiv.org/abs/2305.01555) | ACL Workshop | 2023-07 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/UnleashLLMRE) | | [Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction](https://aclanthology.org/2023.bionlp-1.15/) | bioNLP Workshop | 2023-07 | [GitHub](https://github.com/arkhn/bio-nlp2023) | | [Does Synthetic Data Generation of LLMs Help Clinical Text Mining?](https://arxiv.org/abs/2303.04360) | Arxiv | 2023-04 | []() | | [Unleash GPT-2 Power for Event Detection](https://aclanthology.org/2021.acl-long.490.pdf) | ACL | 2021-08 | []() | ### Knowledge Retrieval | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition](https://aclanthology.org/2024.findings-acl.76/) | ACL Findings | 2024 | [GitHub](https://github.com/JinYuanLi0012/RiVEG) | | [Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction](https://arxiv.org/abs/2401.13598) | WWW | 2024 | | [Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset](https://arxiv.org/abs/2310.10118) | EMNLP | 2023-12 | [GitHub](https://github.com/CompNet/conivel/tree/gen) | | [Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge](https://arxiv.org/abs/2305.12212) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/JinYuanLi0012/PGIM) | | [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Arxiv | 2023-08 | []() | ### Inverse Generation | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Arxiv | 2024-03 | []() | | [Improving Event Definition Following For Zero-Shot Event Detection](https://arxiv.org/abs/2403.02586) | Arxiv | 2024-03 | []() | | [ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models](https://aclanthology.org/2024.findings-acl.947/) | ACL Findings | 2024 | [GitHub](https://github.com/StefanHeng/ProgGen) | | [Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction](https://arxiv.org/abs/2402.11142) | Arxiv | 2024-02 | | [Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction](https://arxiv.org/abs/2303.04132) | EMNLP | 2023-12 | [GitHub](https://github.com/epfl-dlab/SynthIE) | | [Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks](https://aclanthology.org/2023.findings-acl.578/) | ACL Findings | 2023-07 | []() | | [Event Extraction as Question Generation and Answering](https://aclanthology.org/2023.acl-short.143.pdf) | ACL Short | 2023-07 | [GitHub](https://github.com/dataminr-ai/Event-Extraction-as-Question-Generation-and-Answering) | | [STAR: Boosting Low-Resource Event Extraction by Structure-to-Text Data Generation with Large Language Models](https://ojs.aaai.org/index.php/AAAI/article/view/29839) | AAAI | 2024-03 | []() | ### Synthetic Datasets for Instruction-tuning | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Rethinking Negative Instances for Generative Named Entity Recognition](https://aclanthology.org/2024.findings-acl.206/) | ACL Findings | 2024 | [GitHub](https://github.com/yyDing1/GNER) | | [UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition](https://openreview.net/pdf?id=r65xfUb76p) | ICLR | 2024-01 | [GitHub](https://github.com/universal-ner/universal-ner) | | [GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer](https://arxiv.org/abs/2311.08526) | Arxiv | 2023-11 | [GitHub](https://github.com/urchade/GLiNER) | | [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Arxiv | 2023-08 | | ## Prompts Design ### Question Answer | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [Knowledge-Enriched Prompt for Low-Resource Named Entity Recognition](https://dl.acm.org/doi/abs/10.1145/3659948) | TALLIP | 2024-04 | []() | | [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Others | 2024-04 | []() | | [Revisiting Large Language Models as Zero-shot Relation Extractors](https://arxiv.org/abs/2310.05028) | EMNLP Findings | 2023-12 | | | [Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors](https://aclanthology.org/2023.findings-acl.50.pdf) | ACL Findings | 2023-07 | [GitHub](https://github.com/OSU-NLP-Group/QA4RE) | | [Zero-Shot Information Extraction via Chatting with ChatGPT](https://arxiv.org/abs/2302.10205) | Arxiv | 2023-02 | [GitHub](https://github.com/cocacola-lab/ChatIE) | ### Chain of Thought | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition](https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae095/7665312) | Others | 2024-05 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) | | [Inspire the Large Language Model by External Knowledge on BioMedical Named Entity Recognition](https://arxiv.org/abs/2309.12278) | Arxiv | 2023-09 | | | [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Arxiv | 2023-08 | | | [Revisiting Relation Extraction in the era of Large Language Models](https://aclanthology.org/2023.acl-long.868.pdf) | ACL | 2023-07 | [GitHub](https://sominw.com/ACL23LLMs) | | [Zero-shot Temporal Relation Extraction with ChatGPT](https://aclanthology.org/2023.bionlp-1.7/) | BioNLP | 2023-07 | | | [PromptNER : Prompting For Named Entity Recognition](https://arxiv.org/abs/2305.15444) | Arxiv | 2023-06 | | ### Self-Improvement | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models](https://aclanthology.org/2024.findings-acl.947/) | ACL Findings | 2024 | [GitHub](https://github.com/StefanHeng/ProgGen) | | [ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Refinement](https://arxiv.org/abs/2401.13218) | Arxiv | 2024-01 | []() | | [Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models](https://aclanthology.org/2024.naacl-short.49/) | NAACL Short | 2024 | [GitHub](https://github.com/Emma1066/Self-Improve-Zero-Shot-NER) | ## Constrained Decoding Generation | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction](https://ojs.aaai.org/index.php/AAAI/article/view/29919) | AAAI | 2024-03 | [GitHub](https://github.com/urchade/ATG) | | [Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning](https://aclanthology.org/2023.emnlp-main.674/) | EMNLP | 2024-01 | [GitHub](https://github.com/epfl-dlab/GCD) | | [DORE: Document Ordered Relation Extraction based on Generative Framework](https://aclanthology.org/2022.findings-emnlp.253/) | EMNLP Findings | 2022-12 | | [Autoregressive Structured Prediction with Language Models](https://aclanthology.org/2022.findings-emnlp.70/) | EMNLP Findings | 2022-12 | [GitHub](https://github.com/lyutyuh/ASP) | | [Unified Structure Generation for Universal Information Extraction](https://aclanthology.org/2022.acl-long.395/) | ACL | 2022-05 | [GitHub](https://github.com/yhcc/BARTABSA) | # Specific Domain | Paper | Domain | Venue | Date | Code | | :----- | :--------------: | :-------: | :---------: |:---------: | | [Granular Entity Mapper: Advancing Fine-grained Multimodal Named Entity Recognition and Grounding](https://aclanthology.org/2024.findings-emnlp.183/) | Multimodal |EMNLP Findings | 2024 | | | [LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition](https://aclanthology.org/2024.findings-acl.76/) | Multimodal | ACL Findings | 2024 | [GitHub](https://github.com/JinYuanLi0012/RiVEG) | | [RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition](https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae095/7665312) | Medical | Others | 2024-05 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) | | [Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?](https://arxiv.org/abs/2405.02602) | Astronomy | Arxiv | 2024-05 | []() | | [Astronomical Knowledge Entity Extraction in Astrophysics Journal Articles via Large Language Models](https://iopscience.iop.org/article/10.1088/1674-4527/ad3d15/meta) | Astronomy | Others | 2024-04 | []() | | [VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition](https://arxiv.org/abs/2404.17835) | Biomedical | Arxiv | 2024-04 | [GitHub](https://github.com/Eulring/VANER) | | [LLMs in Biomedicine: A study on clinical Named Entity Recognition](https://arxiv.org/pdf/2404.07376) | Biomedical | Arxiv | 2024-04 | []() | | [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Software | Others | 2024-04 | []() | | [Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning](https://www.researchgate.net/profile/Rafael-Nunes-35/publication/379665297_Out_of_Sesame_Street_A_Study_of_Portuguese_Legal_Named_Entity_Recognition_Through_In-Context_Learning/links/6614701839e7641c0ba6879b/Out-of-Sesame-Street-A-Study-of-Portuguese-Legal-Named-Entity-Recognition-Through-In-Context-Learning.pdf) | Legal | ResearchGate | 2024-04 | []() | | [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Scientific | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) | | [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Acupuncture Point | Arxiv | 2024-04 | []() | | [Advancing Entity Recognition in Biomedicine via Instruction Tuning of Large Language Models]() | Biomedical | Bioinformatics | 2024-03 | [GitHub](https://github.com/BIDS-Xu-Lab/BioNER-LLaMA) | | [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Endangered Species | Arxiv | 2024-03 | []() | | [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | Historical | COLING | 2024-03 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) | | [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | Biomedical | NAACL | 2024-03 | [GitHub](https://github.com/allenai/beacon) | | [Improving LLM-Based Health Information Extraction with In-Context Learning](https://link.springer.com/chapter/10.1007/978-981-97-1717-0_4)| Health | Others | 2024-03 | | [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Scientific | Nat. Commun. | 2024-02 | [GitHub](https://github.com/LBNLP/NERRE) | | [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | Pharmacovigilance | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) | | [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Scientific | Nat. Commun. | 2024-02 | [GitHub](https://github.com/lbnlp/nerre-llama) | | [Combining prompt‑based language models and weak supervision for labeling named entity recognition on legal documents](https://link.springer.com/article/10.1007/s10506-023-09388-1) | Legal | Others | 2024-02 | | [Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering](https://arxiv.org/abs/2303.16416v3) | Clinical | Arxiv | 2024-01 | [GitHub](https://github.com/BIDS-Xu-Lab/Clinical_Entity_Recognition_Using_GPT_models) | | [Impact of Sample Selection on In-Context Learning for Entity Extraction from Scientific Writing](https://aclanthology.org/2023.findings-emnlp.338/)| Scientific | EMNLP Findings | 2023-12 | [GitHub](https://github.com/adalin16/ICL_EE)| | [Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge](https://arxiv.org/abs/2305.12212) | Multimodal | ENMLP Findings | 2023-12 | [GitHub](https://github.com/JinYuanLi0012/PGIM) | | [In-context Learning for Few-shot Multimodal Named Entity Recognition](https://aclanthology.org/2023.findings-emnlp.196/) | Multimodal | ENMLP Findings | 2023-12 | | | [PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature](https://arxiv.org/abs/2311.07715) | Polymer Material | Arxiv | 2023-11 | [GitHub](https://github.com/jerry3027/PolyIE) | | [Prompt-NER: Zero-shot Named Entity Recognition in Astronomy Literature via Large Language Models](https://arxiv.org/abs/2310.17892) | Astronomical | Arxiv | 2023-10 | | | [Inspire the Large Language Model by External Knowledge on BioMedical Named Entity Recognition](https://arxiv.org/abs/2309.12278) | Biomedical | Arxiv | 2023-09 | | | [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Multimodal | Arxiv | 2023-08 | | | [DICE: Data-Efficient Clinical Event Extraction with Generative Models](https://aclanthology.org/2023.acl-long.886.pdf) | Clinical | ACL | 2023-07 | [GitHub](https://github.com/derekmma/DICE) | | [How far is Language Model from 100% Few-shot Named Entity Recognition in Medical Domain](https://arxiv.org/abs/2307.00186) | Medical | Arxiv | 2023-07 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) | | [Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction](https://aclanthology.org/2023.bionlp-1.15/) | Multilingual / Clinical | BioNLP | 2023-07 | [GitHub](https://github.com/arkhn/bio-nlp2023) | | [Does Synthetic Data Generation of LLMs Help Clinical Text Mining?](https://arxiv.org/abs/2303.04360) | Clinical | Arxiv | 2023-04 | | | [Yes but.. Can ChatGPT Identify Entities in Historical Documents](https://arxiv.org/abs/2303.17322) | Historical | JCDL | 2023-03 | | | [Zero-shot Clinical Entity Recognition using ChatGPT](https://arxiv.org/abs/2303.16416) | Clinical | Arxiv | 2023-03 | | | [Structured information extraction from complex scientific text with fine-tuned large language models](https://arxiv.org/abs/2212.05238) | Scientific | Arxiv | 2022-12 | [Demo](http://www.matscholar.com/info-extraction) | | [Multilingual generative language models for zero-shot cross-lingual event argument extraction](https://aclanthology.org/2022.acl-long.317.pdf) | Multilingual | ACL | 2022-05 | [GitHub](https://github.com/PlusLabNLP/X-Gear) | # Evaluation and Analysis | Paper | Venue | Date | Code | | :----- | :--------------: | :------- | :---------: | | [TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction](https://aclanthology.org/2024.findings-acl.760/) | ACL Findings | 2024 | [GitHub](https://github.com/ej0cl6/TextEE) | | [IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus](https://aclanthology.org/2024.acl-short.13/) | ACL Short | 2024 | [GitHub](https://github.com/zjunlp/IEPile?tab=readme-ov-file) | | [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) | | [GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models](https://aclanthology.org/2024.naacl-long.155/) | NAACL | 2024 | [GitHub](https://github.com/pat-jj/GenRES) | | [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024 | []() | | [Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?](https://arxiv.org/abs/2405.02602) | Arxiv | 2024-05 | []() | | [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Arxiv | 2024-04 | []() | | [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) | | [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Arxiv | 2024-03 | []() | | [LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities](https://arxiv.org/abs/2305.13168) | Arxiv | 2024-02 | [GitHub](https://github.com/zjunlp/AutoKG)| | [Few shot clinical entity recognition in three languages: Masked language models outperform LLM prompting](https://arxiv.org/abs/2402.12801) | Arxiv | 2024-02 | | [Information Extraction from Legal Wills: How Well Does GPT-4 Do?](https://aclanthology.org/2023.findings-emnlp.287/) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/ml4ai/ie4wills/) | | [Information Extraction in Low-Resource Scenarios: Survey and Perspective](https://arxiv.org/abs/2202.08063)| Arxiv | 2023-12 | [GitHub](https://github.com/zjunlp/Low-resource-KEPapers) | | [Empirical Study of Zero-Shot NER with ChatGPT](https://arxiv.org/abs/2310.10035) | EMNLP | 2023-12 | [GitHub](https://github.com/Emma1066/Zero-Shot-NER-with-ChatGPT) | | [NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval](https://arxiv.org/abs/2310.14282) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/katzurik/NERetrieve) | | [Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction](https://aclanthology.org/2023.emnlp-main.360/) | EMNLP | 2023-12 | [GitHub](https://github.com/qijimrc/ROBUST) | | [PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature](https://arxiv.org/abs/2311.07715) | Arxiv | 2023-11 | [GitHub](https://github.com/jerry3027/PolyIE) | | [XNLP: An Interactive Demonstration System for Universal Structured NLP](https://arxiv.org/abs/2308.01846) | Arxiv | 2023-08 | [Demo](http://xnlp.haofei.vip/) | | [A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks](https://arxiv.org/abs/2307.12114) | Arxiv | 2023-07 | | | [How far is Language Model from 100% Few-shot Named Entity Recognition in Medical Domain](https://arxiv.org/abs/2307.00186) | Arxiv | 2023-07 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) | | [Revisiting Relation Extraction in the era of Large Language Models](https://aclanthology.org/2023.acl-long.868.pdf) | ACL | 2023-07 | [GitHub](https://sominw.com/ACL23LLMs) | | [Zero-shot Temporal Relation Extraction with ChatGPT](https://aclanthology.org/2023.bionlp-1.7/) | BioNLP | 2023-07 | | | [InstructIE: A Chinese Instruction-based Information Extraction Dataset](https://arxiv.org/abs/2305.11527) | Arxiv | 2023-05 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm) | | [Is Information Extraction Solved by ChatGPT? An Analysis of Performance, Evaluation Criteria, Robustness and Errors](https://arxiv.org/abs/2305.14450) | Arxiv | 2023-05 | [GitHub](https://github.com/FreedomIntelligence/Evaluation-of-ChatGPT-on-Information-Extraction) | | [Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness](https://arxiv.org/abs/2304.11633) | Arxiv | 2023-04 | [GitHub](https://github.com/pkuserc/ChatGPT_for_IE) | | [Exploring the Feasibility of ChatGPT for Event Extraction](https://arxiv.org/abs/2303.03836) | Arxiv | 2023-03 | | | [Yes but.. Can ChatGPT Identify Entities in Historical Documents](https://arxiv.org/abs/2303.17322) | JCDL | 2023-03 | | | [Zero-shot Clinical Entity Recognition using ChatGPT](https://arxiv.org/abs/2303.16416) | Arxiv | 2023-03 | | | [Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again](https://aclanthology.org/2022.findings-emnlp.329/) | EMNLP Findings | 2022-12 | [GitHub](https://github.com/dki-lab/few-shot-bioIE) | | [Large Language Models are Few-Shot Clinical Information Extractors](https://arxiv.org/abs/2205.12689) | EMNLP | 2022-12 | [Huggingface](https://huggingface.co/datasets/mitclinicalml/clinical-ie) | # Project and Toolkit | Paper | Type | Venue | Date | Link | | :------ | :------: | :------: | :------: | :------: | | ONEKE | Project | - | - | [Link](http://oneke.openkg.cn/) | | [TechGPT-2.0: A Large Language Model Project to Solve the Task of Knowledge Graph Construction](https://arxiv.org/abs/2401.04507) | Project | Arxiv | 2024-01 | [Link](https://github.com/neukg/TechGPT-2.0) | | [CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction](https://arxiv.org/abs/2307.00769) | Toolkit | Arxiv | 2023-07 | [Link](https://github.com/cocacola-lab/CollabKG) | # Recently Updated Papers ## 2024/09/04 | Paper | Venue | Date | Code | | :------ | :------: | :------: | :------: | | [Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction](https://aclanthology.org/2024.acl-long.187/) | ACL | 2024-08 | [GitHub](https://github.com/JianhaoChen-nju/TSDRE) | | [Epidemic Information Extraction for Event-Based Surveillance using Large Language Models](https://link.springer.com/chapter/10.1007/978-981-97-4581-4_17) | ICICT | 2024-08 | | | [SpeechEE: A Novel Benchmark for Speech Event Extraction](https://arxiv.org/abs/2408.09462) | ACM MM | 2024-08 | [GitHub](https://speechee.github.io/) | | [HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction](https://arxiv.org/abs/2408.04948)| Arxiv | 2024-08 | | | [Knowledge AI: Fine-tuning NLP Models for Facilitating Scientific Knowledge Extraction and Understanding](https://arxiv.org/abs/2408.04651)| Arxiv | 2024-08 | | | [Target Prompting for Information Extraction with Vision Language Model](https://arxiv.org/abs/2408.03834)| Arxiv | 2024-08 | | | [Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language Models](https://arxiv.org/abs/2408.15796) | Arxiv | 2024-08 | [GitHub](https://github.com/GEODE-project/ner-llm)| | [Utilizing Large Language Models for Named Entity Recognition in Traditional Chinese Medicine against COVID-19 Literature: Comparative Study](https://arxiv.org/abs/2408.13501) | Arxiv | 2024-08 | | | [CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition](https://arxiv.org/abs/2408.12834) | ECAI | 2024-08 | | | [LLMs are not Zero-Shot Reasoners for Biomedical Information Extraction](https://arxiv.org/abs/2408.12249) | Arxiv | 2024-08 | | | [Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition](https://www.arxiv.org/abs/2407.17344) | Arxiv | 2024-07 | | | [MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models](https://arxiv.org/abs/2407.10953) | Arxiv | 2024-08 | [GitHub](https://ganchengguang.github.io/MRE/) | | [FsPONER: Few-shot Prompt Optimization for Named Entity Recognition in Domain-specific Scenarios](https://arxiv.org/abs/2407.08035) | ECAI | 2024-07 | [GitHub](https://github.com/markustyj/FsPONER_ECAI2024) | | [Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters](https://arxiv.org/abs/2407.01406) | KaLLM workshop | 2024-07 | [GitHub](https://github.com/d-gurgurov/Injecting-Commonsense-Knowledge-into-LLMs) | | [Show Less, Instruct More: Enriching Prompts with Definitions and Guidelines for Zero-Shot NER](https://arxiv.org/abs/2407.01272) | Arxiv | 2024-07 | | | [Large Language Models Struggle in Token-Level Clinical Named Entity Recognition](https://arxiv.org/abs/2407.00731) | AMIA | 2024-08 | | | [GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks]() | Arxiv | 2024-08 | | | [Retrieval Augmented Instruction Tuning for Open NER with Large Language Models](https://arxiv.org/abs/2406.17305) | Arxiv | 2024-06 | [GitHub](https://github.com/Emma1066/Retrieval-Augmented-IT-OpenNER) | | [Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition](https://arxiv.org/abs/2406.11192) | Arxiv | 2024-06 | [GitHub](https://github.com/UmeanNever/B2NER) | | [Fighting Against the Repetitive Training and Sample Dependency Problem in Few-shot Named Entity Recognition](https://ieeexplore.ieee.org/abstract/document/10463035) | IEEE Access | 2024-06 | [GitHub](https://github.com/changtianluckyforever/SMCS_project) | | [llmNER: (Zero\|Few)-Shot Named Entity Recognition, Exploiting the Power of Large Language Models](https://arxiv.org/abs/2406.04528) | Arxiv | 2024-06 | [GitHub](https://github.com/plncmm/llmner) | | [Assessing the Performance of Chinese Open Source Large Language Models in Information Extraction Tasks](https://arxiv.org/abs/2406.02079) | Arxiv | 2024-06 | | # Datasets \* denotes the dataset is multimodal. # refers to the number of categories or sentences.
Task Dataset Domain #Class #Train #Val #Test Link
NER ACE04 News 7 6202 745 812 Link
ACE05 News 7 7299 971 1060 Link
BC5CDR Biomedical 2 4560 4581 4797 Link
Broad Twitter Corpus Social Media 3 6338 1001 2000 Link
CADEC Biomedical 1 5340 1097 1160 Link
CoNLL03 News 4 14041 3250 3453 Link
CoNLLpp News 4 14041 3250 3453 Link
CrossNER-AI Artificial Intelligence 14 100 350 431 Link
CrossNER-Literature Literary 12 100 400 416
CrossNER-Music Musical 13 100 380 465
CrossNER-Politics Political 9 199 540 650
CrossNER-Science Scientific 17 200 450 543
FabNER Scientific 12 9435 2182 2064 Link
Few-NERD General 66 131767 18824 37468 Link
FindVehicle Traffic 21 21565 20777 20777 Link
GENIA Biomedical 5 15023 1669 1854 Link
HarveyNER Social Media 4 3967 1301 1303 Link
MIT-Movie Social Media 12 9774 2442 2442 Link
MIT-Restaurant Social Media 8 7659 1520 1520 Link
MultiNERD Wikipedia 16 134144 10000 10000 Link
NCBI Biomedical 4 5432 923 940 Link
OntoNotes 5.0 General 18 59924 8528 8262 Link
ShARe13 Biomedical 1 8508 12050 9009 Link
ShARe14 Biomedical 1 17404 1360 15850 Link
SNAP* Social Media 4 4290 1432 1459 Link
Temporal Twitter Corpus (TTC) Social Meida 3 10000 500 1500 Link
Tweebank-NER Social Media 4 1639 710 1201 Link
Twitter2015* Social Media 4 4000 1000 3357 Link
Twitter2017* Social Media 4 3373 723 723 Link
TwitterNER7 Social Media 7 7111 886 576 Link
WikiDiverse* News 13 6312 755 757 Link
WNUT2017 Social Media 6 3394 1009 1287 Link
RE ACE05 News 7 10051 2420 2050 Link
ADE Biomedical 1 3417 427 428 Link
CoNLL04 News 5 922 231 288 Link
DocRED Wikipedia 96 3008 300 700 Link
MNRE* Social Media 23 12247 1624 1614 Link
NYT News 24 56196 5000 5000 Link
Re-TACRED News 40 58465 19584 13418 Link
SciERC Scientific 7 1366 187 397 Link
SemEval2010 General 19 6507 1493 2717 Link
TACRED News 42 68124 22631 15509 Link
TACREV News 42 68124 22631 15509 Link
EE ACE05 News 33/22 17172 923 832 Link
CASIE Cybersecurity 5/26 11189 1778 3208 Link
GENIA11 Biomedical 9/11 8730 1091 1092 Link
GENIA13 Biomedical 13/7 4000 500 500 Link
PHEE Biomedical 2/16 2898 961 968 Link
RAMS News 139/65 7329 924 871 Link
WikiEvents Wikipedia 50/59 5262 378 492 Link
## Star History [![Star History Chart](https://api.star-history.com/svg?repos=quqxui/Awesome-LLM4IE-Papers&type=Date)](https://star-history.com/#quqxui/Awesome-LLM4IE-Papers&Date)