# Awesome-LLM4IE-Papers **Repository Path**: orangego/Awesome-LLM4IE-Papers ## Basic Information - **Project Name**: Awesome-LLM4IE-Papers - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-10-27 - **Last Updated**: 2025-10-27 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Awesome-LLM4IE-Papers 🔥🔥🔥 **The article has been accepted by Frontiers of Computer Science (FCS)**. --- Awesome papers about generative Information extraction using LLMs
./update_new_papers_list.
- **2024/09/04** Add 22 papers
- **2024/06/06** Add 41 papers
- **2024/03/30** Add 27 papers
- **2024/03/29** Add 20 papers
# Information Extraction tasks
A taxonomy by various tasks.
## Named Entity Recognition
Models targeting only ner tasks.
### Entity Typing
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing](https://aclanthology.org/2023.findings-emnlp.1040/) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/yanlinf/CASENT) |
| [Generative Entity Typing with Curriculum Learning](https://arxiv.org/abs/2210.02914) | EMNLP | 2022-12 | [GitHub](https://github.com/siyuyuan/GET) |
### Entity Identification & Typing
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Granular Entity Mapper: Advancing Fine-grained Multimodal Named Entity Recognition and Grounding](https://aclanthology.org/2024.findings-emnlp.183/) | EMNLP Findings | 2024 | |
| [Double-Checker: Large Language Model as a Checker for Few-shot Named Entity Recognition](https://aclanthology.org/2024.findings-emnlp.180/) | EMNLP Findings | 2024 | [GitHub](https://github.com/fanshu6hao/Double-Checker) |
| [VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models](https://aclanthology.org/2024.acl-long.134/) | ACL | 2024 | [GitHub](https://github.com/emseoyk/VerifiNER) |
| [ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models](https://aclanthology.org/2024.findings-acl.947/) | ACL Findings | 2024 | [GitHub](https://github.com/StefanHeng/ProgGen) |
| [Rethinking Negative Instances for Generative Named Entity Recognition](https://aclanthology.org/2024.findings-acl.206/) | ACL Findings | 2024 | [GitHub](https://github.com/yyDing1/GNER) |
| [LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition](https://aclanthology.org/2024.findings-acl.76/) | ACL Findings | 2024 | [GitHub](https://github.com/JinYuanLi0012/RiVEG) |
| [RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition](https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae095/7665312) | Others | 2024-05 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) |
| [P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models](https://arxiv.org/abs/2405.04960) | Arxiv | 2024-06 | [GitHub](https://github.com/jiangguochaoGG/P-ICL) |
| [Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?](https://arxiv.org/abs/2405.02602) | Arxiv | 2024-05 | []() |
| [Know-Adapter: Towards Knowledge-Aware Parameter-Efficient Transfer Learning for Few-shot Named Entity Recognition](https://aclanthology.org/2024.lrec-main.854/) | COLING | 2024 | []() |
| [ToNER: Type-oriented Named Entity Recognition with Generative Language Model](https://aclanthology.org/2024.lrec-main.1412.pdf) | COLING | 2024 | []() |
| [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) |
| [Astronomical Knowledge Entity Extraction in Astrophysics Journal Articles via Large Language Models](https://iopscience.iop.org/article/10.1088/1674-4527/ad3d15/meta) | Others | 2024-04 | []() |
| [LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking](https://arxiv.org/abs/2404.05624) | Arxiv | 2024-04 | [GitHub](https://github.com/YFR718/LTNER) |
| [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Others | 2024-04 | []() |
| [Knowledge-Enriched Prompt for Low-Resource Named Entity Recognition](https://dl.acm.org/doi/abs/10.1145/3659948) | TALLIP | 2024-04 | []() |
| [VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition](https://arxiv.org/abs/2404.17835) | Arxiv | 2024-04 | [GitHub](https://github.com/Eulring/VANER) |
| [LLMs in Biomedicine: A study on clinical Named Entity Recognition](https://arxiv.org/pdf/2404.07376) | Arxiv | 2024-04 | []() |
| [Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning](https://www.researchgate.net/profile/Rafael-Nunes-35/publication/379665297_Out_of_Sesame_Street_A_Study_of_Portuguese_Legal_Named_Entity_Recognition_Through_In-Context_Learning/links/6614701839e7641c0ba6879b/Out-of-Sesame-Street-A-Study-of-Portuguese-Legal-Named-Entity-Recognition-Through-In-Context-Learning.pdf) | ResearchGate | 2024-04 | []() |
| [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) |
| [LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty](https://arxiv.org/abs/2402.10573) | WWW | 2024 |
| [Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models](https://aclanthology.org/2024.naacl-short.49/) | NAACL Short | 2024 | [GitHub](https://github.com/Emma1066/Self-Improve-Zero-Shot-NER) |
| [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | NAACL | 2024 | [GitHub](https://github.com/allenai/beacon) |
| [MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks](https://arxiv.org/abs/2404.00457) | Arxiv | 2024-03 | [GitHub](https://github.com/KomeijiForce/MetaIE) |
| [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Arxiv | 2024-03 | []() |
| [Augmenting NER Datasets with LLMs: Towards Automated and Refined Annotation](https://arxiv.org/abs/2404.01334) | Arxiv | 2024-03 | []() |
| [ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context](https://ojs.aaai.org/index.php/AAAI/article/view/29892)| AAAI | 2024 |
| [Embedded Named Entity Recognition using Probing Classifiers](https://arxiv.org/abs/2403.11747) | Arxiv | 2024-03 | [GitHub](https://github.com/nicpopovic/EMBER) |
| [In-Context Learning for Few-Shot Nested Named Entity Recognition](https://arxiv.org/abs/2402.01182) | Arxiv | 2024-02 | []() |
| [LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition](https://arxiv.org/abs/2402.14568) | Arxiv | 2024-02 | []() |
| [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Nature Communications | 2024-02 | [GitHub](https://github.com/lbnlp/nerre-llama) |
| [NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data](https://arxiv.org/abs/2402.15343) | Arxiv | 2024-02 |
| [A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction](https://arxiv.org/abs/2402.13364) | Arxiv | 2024-02 |
| [PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition](https://arxiv.org/abs/2402.04838) | Arxiv | 2024-02 |
| [Small Language Model Is a Good Guide for Large Language Model in Chinese Entity Relation Extraction](https://arxiv.org/abs/2402.14373) | Arxiv | 2024-02 | |
| [C-ICL: Contrastive In-context Learning for Information Extraction](https://arxiv.org/abs/2402.11254) | Arxiv | 2024-02 |
| [UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition](https://openreview.net/pdf?id=r65xfUb76p) | ICLR | 2024 | [GitHub](https://github.com/universal-ner/universal-ner) |
| [Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering](https://arxiv.org/abs/2303.16416v3) | Arxiv | 2024-01 | [GitHub](https://github.com/BIDS-Xu-Lab/Clinical_Entity_Recognition_Using_GPT_models) |
| [2INER: Instructive and In-Context Learning on Few-Shot Named Entity Recognition](https://aclanthology.org/2023.findings-emnlp.259/) | EMNLP Findings | 2023-12 | |
| [In-context Learning for Few-shot Multimodal Named Entity Recognition](https://aclanthology.org/2023.findings-emnlp.196/) | EMNLP Findings | 2023-12 | |
| [Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!](https://arxiv.org/abs/2303.08559) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/mayubo2333/LLM-IE) |
| [Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset](https://arxiv.org/abs/2310.10118) | EMNLP | 2023-12 | [GitHub](https://github.com/CompNet/conivel/tree/gen) |
| [LLMaAA: Making Large Language Models as Active Annotators](https://arxiv.org/abs/2310.19596) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/ridiculouz/LLMAAA) |
| [Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge](https://arxiv.org/abs/2305.12212) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/JinYuanLi0012/PGIM) |
| [GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer](https://arxiv.org/abs/2311.08526) | Arxiv | 2023-11 | [GitHub](https://github.com/urchade/GLiNER) |
| [GPT Struct Me: Probing GPT Models on Narrative Entity Extraction](https://ieeexplore.ieee.org/abstract/document/10350182) | WI-IAT | 2023-10 | [GitHub](https://github.com/hmosousa/gpt_struct_me) |
| [GPT-NER: Named Entity Recognition via Large Language Models](https://arxiv.org/abs/2304.10428) | Arxiv | 2023-10 | [GitHub](https://github.com/ShuheWang1998/GPT-NER) |
| [Prompt-NER: Zero-shot Named Entity Recognition in Astronomy Literature via Large Language Models](https://arxiv.org/abs/2310.17892) | Arxiv | 2023-10 | |
| [Inspire the Large Language Model by External Knowledge on BioMedical Named Entity Recognition](https://arxiv.org/abs/2309.12278) | Arxiv | 2023-09 | |
| [One Model for All Domains: Collaborative Domain-Prefx Tuning for Cross-Domain NER](https://arxiv.org/abs/2301.10410) | IJCAI | 2023-09 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/cross) |
| [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Arxiv | 2023-08 | |
| [Learning In-context Learning for Named Entity Recognition ](https://aclanthology.org/2023.acl-long.764/) | ACL | 2023-07 | [GitHub](https://github.com/chen700564/metaner-icl) |
| [Debiasing Generative Named Entity Recognition by Calibrating Sequence Likelihood](https://aclanthology.org/2023.acl-short.98/) | ACL Short | 2023-07 | |
| [Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks](https://aclanthology.org/2023.findings-acl.578/) | ACL Findings | 2023-07 | |
| [Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction](https://aclanthology.org/2023.bionlp-1.15/) | BioNLP | 2023-07 | [GitHub](https://github.com/arkhn/bio-nlp2023) |
| [NAG-NER: a Unified Non-Autoregressive Generation Framework for Various NER Tasks](https://aclanthology.org/2023.acl-industry.65/) | ACL Industry | 2023-07 |
| [Unified Named Entity Recognition as Multi-Label Sequence Generation](https://ieeexplore.ieee.org/abstract/document/10191921) | IJCNN | 2023-06 |
| [PromptNER : Prompting For Named Entity Recognition](https://arxiv.org/abs/2305.15444) | Arxiv | 2023-06 | |
| [Does Synthetic Data Generation of LLMs Help Clinical Text Mining?](https://arxiv.org/abs/2303.04360) | Arxiv | 2023-04 | |
| [Unified Text Structuralization with Instruction-tuned Language Models](https://arxiv.org/abs/2303.14956) | Arxiv | 2023-03 | []() |
| [Structured information extraction from complex scientific text with fine-tuned large language models](https://arxiv.org/abs/2212.05238) | Arxiv | 2022-12 | [Demo](http://www.matscholar.com/info-extraction) |
| [LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting](https://aclanthology.org/2022.coling-1.209/) | COLING | 2022-10 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/few-shot) |
| [De-bias for generative extraction in unified NER task](https://aclanthology.org/2022.acl-long.59.pdf) | ACL | 2022-05 | |
| [InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER](https://arxiv.org/abs/2203.03903) | Arxiv | 2022-03 | |
| [Document-level Entity-based Extraction as Template Generation](https://aclanthology.org/2021.emnlp-main.426/) | EMNLP | 2021-11 | [GitHub](https://github.com/PlusLabNLP/TempGen) |
| [A Unified Generative Framework for Various NER Subtasks](https://arxiv.org/abs/2106.01223) | ACL | 2021-08 | [GitHub](https://github.com/yhcc/BARTNER) |
| [Template-Based Named Entity Recognition Using BART](https://aclanthology.org/2021.findings-acl.161.pdf) | ACL Findings | 2021-08 | [GitHub](https://github.com/Nealcly/templateNER) |
## Relation Extraction
Models targeting only RE tasks.
### Relation Classification
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Others | 2024-04 | []() |
| [CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model](https://arxiv.org/abs/2404.18085) | Arxiv | 2024-04 | [GitHub](https://github.com/SkyuForever/CRE-LLM) |
| [Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction](https://arxiv.org/abs/2404.17809) | IJCAI | 2024-04 | []() |
| [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024-04 | []() |
| [Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors](https://arxiv.org/abs/2404.17807) | IJCAI | 2024-04 | []() |
| [Retrieval-Augmented Generation-based Relation Extraction](https://arxiv.org/abs/2404.13397) | Arxiv | 2024-04 | [GitHub](https://github.com/sefeoglu/RAG4RE) |
| [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Arxiv | 2024-04 | []() |
| [STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models](https://ojs.aaai.org/index.php/AAAI/article/view/29839) | AAAI | 2024-03 | |
| [Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction](https://arxiv.org/abs/2402.11142) | Arxiv | 2024-02 |
| [Chain of Thought with Explicit Evidence Reasoning for Few-shot Relation Extraction](https://aclanthology.org/2023.findings-emnlp.153/) | EMNLP Findings | 2023-12 | |
| [GPT-RE: In-context Learning for Relation Extraction using Large Language Models](https://arxiv.org/abs/2305.02105) | EMNLP | 2023-12 | [GitHub](https://github.com/YukinoWan/GPT-RE) |
| [Guideline Learning for In-context Information Extraction](https://arxiv.org/abs/2310.05066) | EMNLP | 2023-12 | |
| [Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!](https://arxiv.org/abs/2303.08559) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/mayubo2333/LLM-IE) |
| [LLMaAA: Making Large Language Models as Active Annotators](https://arxiv.org/abs/2310.19596) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/ridiculouz/LLMAAA) |
| [Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs](https://arxiv.org/abs/2312.00552) | EMNLP | 2023-12 | [GitHub](https://github.com/qingwang-isu/AugURE) |
| [Revisiting Large Language Models as Zero-shot Relation Extractors](https://arxiv.org/abs/2310.05028) | EMNLP Findings | 2023-12 | |
| [Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment](https://arxiv.org/abs/2310.10590) | Arxiv | 2023-10 |
| [Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors](https://aclanthology.org/2023.findings-acl.50.pdf) | ACL Findings | 2023-07 | [GitHub](https://github.com/OSU-NLP-Group/QA4RE) |
| [How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?](https://arxiv.org/abs/2305.01555) | ACL Workshop | 2023-07 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/UnleashLLMRE) |
| [Sequence generation with label augmentation for relation extraction](https://ojs.aaai.org/index.php/AAAI/article/view/26532)| AAAI | 2023-06 | [GitHub](https://github.com/pkuserc/RELA) |
| [Does Synthetic Data Generation of LLMs Help Clinical Text Mining?](https://arxiv.org/abs/2303.04360) | Arxiv | 2023-04 | |
| [DORE: Document Ordered Relation Extraction based on Generative Framework](https://aclanthology.org/2022.findings-emnlp.253/) | EMNLP Findings | 2022-12 |
| [REBEL: Relation Extraction By End-to-end Language generation](https://aclanthology.org/2021.findings-emnlp.204/) | EMNLP Findings | 2021-11 | [GitHub](https://github.com/babelscape/rebel) |
### Relation Triplet
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis](https://aclanthology.org/2024.acl-long.476/) | ACL | 2024 | [GitHub](https://github.com/OceannTwT/era-cot) |
| [AutoRE: Document-Level Relation Extraction with Large Language Models](https://aclanthology.org/2024.acl-demos.20/) | ACL Demos | 2024 | [GitHub](https://github.com/bigdante/AutoRE) |
| [Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors](https://arxiv.org/abs/2404.17807) | IJCAI | 2024-04 | []() |
| [Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction](https://arxiv.org/abs/2401.13598) | WWW | 2024 |
| [Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction](https://aclanthology.org/2024.lrec-main.778/) | COLING | 2024 | [GitHub](https://github.com/Ding-Papa/Evaluating-filtering-coling24) |
| [Unlocking Instructive In-Context Learning with Tabular Prompting for Relational Triple Extraction](https://aclanthology.org/2024.lrec-main.1488/) | COLING | 2024 | |
| [A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction](https://arxiv.org/abs/2402.13364) | Arxiv | 2024-02 |
| [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Nature Communications | 2024-02 | [GitHub](https://github.com/lbnlp/nerre-llama) |
| [Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models](https://arxiv.org/abs/2310.11085) | Arxiv | 2024-02 | [GitHub](https://github.com/oezyurty/REPLM) |
| [Small Language Model Is a Good Guide for Large Language Model in Chinese Entity Relation Extraction](https://arxiv.org/abs/2402.14373) | Arxiv | 2024-02 | |
| [Efficient Data Learning for Open Information Extraction with Pre-trained Language Models](https://aclanthology.org/2023.findings-emnlp.869/) | EMNLP Findings | 2023-12 |
| [Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment](https://arxiv.org/abs/2310.10590) | Arxiv | 2023-10 |
| [Unified Text Structuralization with Instruction-tuned Language Models](https://arxiv.org/abs/2303.14956) | Arxiv | 2023-03 | []() |
| [Document-level Entity-based Extraction as Template Generation](https://aclanthology.org/2021.emnlp-main.426/) | EMNLP | 2021-11 | [GitHub](https://github.com/PlusLabNLP/TempGen) |
### Relation Strict
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks](https://arxiv.org/abs/2404.00457) | Arxiv | 2024-03 | [GitHub](https://github.com/KomeijiForce/MetaIE) |
| [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Arxiv | 2024-03 | []() |
| [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024-03 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) |
| [An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction](https://ojs.aaai.org/index.php/AAAI/article/view/29919) | AAAI | 2024-03 | [GitHub](https://github.com/urchade/ATG) |
| [C-ICL: Contrastive In-context Learning for Information Extraction](https://arxiv.org/abs/2402.11254) | Arxiv | 2024-02 |
| [REBEL: Relation Extraction By End-to-end Language generation](https://aclanthology.org/2021.findings-emnlp.204/) | EMNLP Findings | 2021-11 | [GitHub](https://github.com/babelscape/rebel) |
## Event Extraction
Models targeting only EE tasks.
### Event Detection
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Improving Event Definition Following For Zero-Shot Event Detection](https://arxiv.org/abs/2403.02586) | Arxiv | 2024-03 | []() |
| [Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment](https://arxiv.org/abs/2310.10590) | Arxiv | 2023-10 |
| [Unified Text Structuralization with Instruction-tuned Language Models](https://arxiv.org/abs/2303.14956) | Arxiv | 2023-03 | []() |
| [Unleash GPT-2 Power for Event Detection](https://aclanthology.org/2021.acl-long.490.pdf) | ACL | 2021-08 | []() |
### Event Argument Extraction
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [LLMs Learn Task Heuristics from Demonstrations: A Heuristic-Driven Prompting Strategy for Document-Level Event Argument Extraction](https://aclanthology.org/2024.acl-long.647/) | ACL | 2024 | [GitHub](https://github.com/hzzhou01/HD-LoA-Prompting) |
| [Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction](https://aclanthology.org/2024.findings-acl.564/) | ACL Findings | 2024 | [GitHub](https://github.com/LWL-cpu/DEEIA) |
| [KeyEE: Enhancing Low-Resource Generative Event Extraction with Auxiliary Keyword Sub-Prompt](https://ieeexplore.ieee.org/abstract/document/10506770) | Others | 2024-04 | [GitHub](https://github.com/OStars/KeyEE) |
| [MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks](https://arxiv.org/abs/2404.00457) | Arxiv | 2024-03 | [GitHub](https://github.com/KomeijiForce/MetaIE) |
| [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) |
| [ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Refinement](https://arxiv.org/abs/2401.13218) | Arxiv | 2024-01 | []() |
| [Context-Aware Prompt for Generation-based Event Argument Extraction with Diffusion Models](https://dl.acm.org/doi/10.1145/3583780.3614820) | CIKM | 2023-10 | []() |
| [Contextualized Soft Prompts for Extraction of Event Arguments](https://aclanthology.org/2023.findings-acl.266/) | ACL Findings | 2023-07 | |
| [AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model](https://aclanthology.org/2023.acl-long.615/) | ACL | 2023-07 | [GitHub](https://github.com/PlusLabNLP/AMPERE) |
| [Code4Struct: Code Generation for Few-Shot Event Structure Prediction](https://arxiv.org/abs/2210.12810) | ACL | 2023-07 | [GitHub](https://github.com/xingyaoww/code4struct) |
| [Event Extraction as Question Generation and Answering](https://aclanthology.org/2023.acl-short.143.pdf) | ACL short | 2023-07 | [GitHub](https://github.com/dataminr-ai/Event-Extraction-as-Question-Generation-and-Answering) |
| [Global Constraints with Prompting for Zero-Shot Event Argument Classification](https://aclanthology.org/2023.findings-eacl.191/) | EACL Findings | 2023-05 | |
| [Prompt for extraction? PAIE: prompting argument interaction for event argument extraction](https://aclanthology.org/2022.acl-long.466/) | ACL | 2022-05 | [GitHub](https://github.com/mayubo2333/PAIE) |
### Event Detection & Argument Extraction
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction](https://aclanthology.org/2024.findings-acl.760/) | ACL Findings | 2024 | [GitHub](https://github.com/ej0cl6/TextEE) |
| [EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models](https://arxiv.org/abs/2402.11430) | Arxiv | 2024-02 |
| [Guideline Learning for In-context Information Extraction](https://arxiv.org/abs/2310.05066) | EMNLP | 2023-12 | |
| [DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction](https://aclanthology.org/2023.findings-emnlp.121) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/GangZhao98/DemoSG) |
| [Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!](https://arxiv.org/abs/2303.08559) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/mayubo2333/LLM-IE) |
| [DICE: Data-Efficient Clinical Event Extraction with Generative Models](https://aclanthology.org/2023.acl-long.886.pdf) | ACL | 2023-07 | [GitHub](https://github.com/derekmma/DICE) |
| [A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction](https://openreview.net/pdf?id=3xGnOrUqt1) | NeurIPS Workshop | 2023-10 | []() |
| [STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models](https://ojs.aaai.org/index.php/AAAI/article/view/29839) | AAAI | 2024-03 | []() |
| [DEGREE: A Data-Efficient Generative Event Extraction Model](https://aclanthology.org/2022.naacl-main.138/) | NAACL | 2022-07 | [GitHub](https://github.com/PlusLabNLP/DEGREE) |
| [ClarET: Pre-training a correlation-aware context-to-event transformer for event-centric generation and classification](https://aclanthology.org/2022.acl-long.183/) | ACL | 2022-05 | [GitHub](https://github.com/yczhou001/ClarET) |
| [Dynamic prefix-tuning for generative template-based event extraction](https://aclanthology.org/2022.acl-long.358.pdf) | ACL | 2022-05 | []() |
| [Text2event: Controllable sequence-to- structure generation for end-to-end event extraction](https://arxiv.org/abs/2106.09232) | ACL | 2021-08 | [GitHub](https://github.com/luyaojie/text2event) |
| [Document-level event argument extraction by conditional generation](https://aclanthology.org/2021.naacl-main.69.pdf) | NAACL | 2021-06 | [GitHub](https://github.com/raspberryice/gen-arg) |
## Universal Information Extraction
Unified models targeting multiple IE tasks.
### NL-LLMs based
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Diluie: constructing diverse demonstrations of in-context learning with large language model for unified information extraction](https://link.springer.com/article/10.1007/s00521-024-09728-5) | Others | 2024-04 | [GitHub](https://github.com/Phevos75/DILUIE) |
| [ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models]() | COLING | 2024 | |
| [YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction](https://arxiv.org/abs/2312.15548) | Arxiv | 2024-04 |
| [Set Learning for Generative Information Extraction](https://aclanthology.org/2023.emnlp-main.806.pdf) | EMNLP | 2023-12 | []() |
| [GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect](https://arxiv.org/abs/2311.06838) | Arxiv | 2023-11 | []() |
| [InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction](https://arxiv.org/abs/2304.08085) | Arxiv | 2023-04 | [GitHub](https://github.com/BeyonderXX/InstructUIE) |
| [Zero-Shot Information Extraction via Chatting with ChatGPT](https://arxiv.org/abs/2302.10205) | Arxiv | 2023-02 | [GitHub](https://github.com/cocacola-lab/ChatIE) |
| [GenIE: Generative Information Extraction](https://aclanthology.org/2022.naacl-main.342.pdf) | NAACL | 2022-07 | [GitHub](https://github.com/epfl-dlab/GenIE) |
| [DEEPSTRUCT: Pretraining of Language Models for Structure Prediction](https://aclanthology.org/2022.findings-acl.67/) | ACL Findings | 2022-05 | [GitHub](https://github.com/wang-research-lab/deepstruct) |
| [Unified Structure Generation for Universal Information Extraction](https://aclanthology.org/2022.acl-long.395/) | ACL | 2022-05 | [GitHub](https://github.com/yhcc/BARTABSA) |
| [Structured prediction as translation between augmented natural languages](https://arxiv.org/abs/2101.05779) | ICLR | 2021-01 | [GitHub](https://github.com/amazon-science/tanl) |
### Code-LLMs based
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction](https://aclanthology.org/2024.acl-long.475/) | ACL | 2024 | [GitHub](https://ict-goknow.github.io/knowcoder/) |
| [GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction](https://openreview.net/pdf?id=Y3wpuxd7u9) | ICLR | 2024 | [GitHub](https://github.com/hitz-zentroa/GoLLIE) |
| [Retrieval-Augmented Code Generation for Universal Information Extraction](https://arxiv.org/abs/2311.02962) | Arxiv | 2023-11 | []() |
| [CODEIE: Large Code Generation Models are Better Few-Shot Information Extractors](https://arxiv.org/abs/2305.05711) | ACL | 2023-07 | [GitHub](https://github.com/artpli/CodeIE) |
| [CodeKGC: Code Language Model for Generative Knowledge Graph Construction](https://dl.acm.org/doi/abs/10.1145/3641850) | ACM TALLIP | 2024-03 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/CodeKGC) |
# Information Extraction Techniques
A taxonomy by techniques.
## Supervised Fine-tuning
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Rethinking Negative Instances for Generative Named Entity Recognition](https://aclanthology.org/2024.findings-acl.206/) | ACL Findings | 2024 | [GitHub](https://github.com/yyDing1/GNER) |
| [Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction](https://aclanthology.org/2024.findings-acl.564/) | ACL Findings | 2024 | [GitHub](https://github.com/LWL-cpu/DEEIA) |
| [AutoRE: Document-Level Relation Extraction with Large Language Models](https://aclanthology.org/2024.acl-demos.20/) | ACL Demos | 2024 | [GitHub](https://github.com/bigdante/AutoRE) |
| [Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction](https://arxiv.org/abs/2404.17809) | IJCAI | 2024-04 | []() |
| [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024-04 | []() |
| [An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction](https://ojs.aaai.org/index.php/AAAI/article/view/29919) | AAAI | 2024 | [GitHub](https://github.com/urchade/ATG) |
| [Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction](https://aclanthology.org/2024.lrec-main.778/) | COLING | 2024 | [GitHub](https://github.com/Ding-Papa/Evaluating-filtering-coling24) |
| [ToNER: Type-oriented Named Entity Recognition with Generative Language Model](https://aclanthology.org/2024.lrec-main.1412.pdf) | COLING | 2024 | []() |
| [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) |
| [KeyEE: Enhancing Low-Resource Generative Event Extraction with Auxiliary Keyword Sub-Prompt](https://ieeexplore.ieee.org/abstract/document/10506770) | Others | 2024-04 | [GitHub](https://github.com/OStars/KeyEE) |
| [VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition](https://arxiv.org/abs/2404.17835) | Arxiv | 2024-04 | [GitHub](https://github.com/Eulring/VANER) |
| [LLMs in Biomedicine: A study on clinical Named Entity Recognition](https://arxiv.org/pdf/2404.07376) | Arxiv | 2024-04 | []() |
| [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) |
| [CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model](https://arxiv.org/abs/2404.18085) | Arxiv | 2024-04 | [GitHub](https://github.com/SkyuForever/CRE-LLM) |
| [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Arxiv | 2024-04 | []() |
| [Improving Event Definition Following For Zero-Shot Event Detection](https://arxiv.org/abs/2403.02586) | Arxiv | 2024-03 | []() |
| [Embedded Named Entity Recognition using Probing Classifiers](https://arxiv.org/abs/2403.11747) | Arxiv | 2024-03 | [GitHub](https://github.com/nicpopovic/EMBER) |
| [EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models](https://arxiv.org/abs/2402.11430) | Arxiv | 2024-02 |
| [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Nature Communications | 2024-02 | [GitHub](https://github.com/lbnlp/nerre-llama) |
| [PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition](https://arxiv.org/abs/2402.04838) | Arxiv | 2024-02 |
| [UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition](https://openreview.net/pdf?id=r65xfUb76p) | ICLR | 2024 | [GitHub](https://github.com/universal-ner/universal-ner) |
| [GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction](https://openreview.net/pdf?id=Y3wpuxd7u9) | ICLR | 2024 | [GitHub](https://github.com/hitz-zentroa/GoLLIE) |
| [Set Learning for Generative Information Extraction](https://aclanthology.org/2023.emnlp-main.806.pdf) | EMNLP | 2023-12 | []() |
| [Efficient Data Learning for Open Information Extraction with Pre-trained Language Models](https://aclanthology.org/2023.findings-emnlp.869/) | EMNLP Findings | 2023-12 |
| [DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction](https://aclanthology.org/2023.findings-emnlp.121) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/GangZhao98/DemoSG) |
| [Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing](https://aclanthology.org/2023.findings-emnlp.1040/) | EMNLP Findings | 2023-12 | []() |
| [GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect](https://arxiv.org/abs/2311.06838) | Arxiv | 2023-11 | []() |
| [GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer](https://arxiv.org/abs/2311.08526) | Arxiv | 2023-11 | [GitHub](https://github.com/urchade/GLiNER) |
| [Context-Aware Prompt for Generation-based Event Argument Extraction with Diffusion Models](https://dl.acm.org/doi/10.1145/3583780.3614820) | CIKM | 2023-10 | []() |
| [Contextualized Soft Prompts for Extraction of Event Arguments](https://aclanthology.org/2023.findings-acl.266/) | ACL Findings | 2023-07 | |
| [AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model](https://aclanthology.org/2023.acl-long.615/) | ACL | 2023-07 | [GitHub](https://github.com/PlusLabNLP/AMPERE) |
| [Debiasing Generative Named Entity Recognition by Calibrating Sequence Likelihood](https://aclanthology.org/2023.acl-short.98/) | ACL short | 2023-07 | []() |
| [DICE: Data-Efficient Clinical Event Extraction with Generative Models](https://aclanthology.org/2023.acl-long.886.pdf) | ACL | 2023-07 | [GitHub](https://github.com/derekmma/DICE) |
| [Event Extraction as Question Generation and Answering](https://aclanthology.org/2023.acl-short.143.pdf) | ACL short | 2023-07 | [GitHub](https://github.com/dataminr-ai/Event-Extraction-as-Question-Generation-and-Answering) |
| [NAG-NER: a Unified Non-Autoregressive Generation Framework for Various NER Tasks](https://aclanthology.org/2023.acl-industry.65/) | ACL Industry | 2023-07 |
| [Sequence generation with label augmentation for relation extraction](https://ojs.aaai.org/index.php/AAAI/article/view/26532)| AAAI | 2023-06 | [GitHub](https://github.com/pkuserc/RELA) |
| [Unified Named Entity Recognition as Multi-Label Sequence Generation](https://ieeexplore.ieee.org/abstract/document/10191921) | IJCNN | 2023-06 |
| [InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction](https://arxiv.org/abs/2304.08085) | Arxiv | 2023-04 | [GitHub](https://github.com/BeyonderXX/InstructUIE) |
| [Structured information extraction from complex scientific text with fine-tuned large language models](https://arxiv.org/abs/2212.05238) | Arxiv | 2022-12 | [Demo](http://www.matscholar.com/info-extraction) |
| [Generative Entity Typing with Curriculum Learning](https://arxiv.org/abs/2210.02914) | EMNLP | 2022-12 | [GitHub](https://github.com/siyuyuan/GET) |
| [DORE: Document Ordered Relation Extraction based on Generative Framework](https://aclanthology.org/2022.findings-emnlp.253/) | EMNLP Findings | 2022-12 |
| [LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model](https://openreview.net/pdf?id=a8qX5RG36jd) | NeurIPS | 2022-10 | [GitHub](https://github.com/ChocoWu/LasUIE) |
| [LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting](https://aclanthology.org/2022.coling-1.209/) | COLING | 2022-10 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/few-shot) |
| [GenIE: Generative Information Extraction](https://aclanthology.org/2022.naacl-main.342.pdf) | NAACL | 2022-07 | [GitHub](https://github.com/epfl-dlab/GenIE) |
| [DEGREE: A Data-Efficient Generative Event Extraction Model](https://aclanthology.org/2022.naacl-main.138/) | NAACL | 2022-07 | [GitHub](https://github.com/PlusLabNLP/DEGREE) |
| [ClarET: Pre-training a correlation-aware context-to-event transformer for event-centric generation and classification](https://aclanthology.org/2022.acl-long.183/) | ACL | 2022-05 | [GitHub](https://github.com/yczhou001/ClarET) |
| [DEEPSTRUCT: Pretraining of Language Models for Structure Prediction](https://aclanthology.org/2022.findings-acl.67/) | ACL Findings | 2022-05 | [GitHub](https://github.com/wang-research-lab/deepstruct) |
| [Dynamic prefix-tuning for generative template-based event extraction](https://aclanthology.org/2022.acl-long.358.pdf) | ACL | 2022-05 | []() |
| [Prompt for extraction? PAIE: prompting argument interaction for event argument extraction](https://aclanthology.org/2022.acl-long.466/) | ACL | 2022-05 | [GitHub](https://github.com/mayubo2333/PAIE) |
| [Unified Structure Generation for Universal Information Extraction](https://aclanthology.org/2022.acl-long.395/) | ACL | 2022-05 | [GitHub](https://github.com/yhcc/BARTABSA) |
| [De-bias for generative extraction in unified NER task](https://aclanthology.org/2022.acl-long.59.pdf) | ACL | 2022-05 | []() |
| [Document-level Entity-based Extraction as Template Generation](https://aclanthology.org/2021.emnlp-main.426/) | EMNLP | 2021-11 | [GitHub](https://github.com/PlusLabNLP/TempGen) |
| [REBEL: Relation Extraction By End-to-end Language generation](https://aclanthology.org/2021.findings-emnlp.204/) | EMNLP Findings | 2021-11 | [GitHub](https://github.com/babelscape/rebel) |
| [A Unified Generative Framework for Various NER Subtasks](https://arxiv.org/abs/2106.01223) | ACL | 2021-08 | [GitHub](https://github.com/yhcc/BARTNER) |
| [Template-Based Named Entity Recognition Using BART](https://aclanthology.org/2021.findings-acl.161.pdf) | ACL Findings | 2021-08 | [GitHub](https://github.com/Nealcly/templateNER) |
| [Text2event: Controllable sequence-to- structure generation for end-to-end event extraction](https://arxiv.org/abs/2106.09232) | ACL | 2021-08 | [GitHub](https://github.com/luyaojie/text2event) |
| [Document-level event argument extraction by conditional generation](https://aclanthology.org/2021.naacl-main.69.pdf) | NAACL | 2021-06 | [GitHub](https://github.com/raspberryice/gen-arg) |
| [Structured prediction as translation between augmented natural languages](https://arxiv.org/abs/2101.05779) | ICLR | 2021-01 | [GitHub](https://github.com/amazon-science/tanl) |
## Few-shot
### Few-shot Fine-tuning
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Diluie: constructing diverse demonstrations of in-context learning with large language model for unified information extraction](https://link.springer.com/article/10.1007/s00521-024-09728-5) | Others | 2024-04 | [GitHub](https://github.com/Phevos75/DILUIE) |
| [KeyEE: Enhancing Low-Resource Generative Event Extraction with Auxiliary Keyword Sub-Prompt](https://ieeexplore.ieee.org/abstract/document/10506770) | Others | 2024-04 | [GitHub](https://github.com/OStars/KeyEE) |
| [Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors](https://arxiv.org/abs/2404.17807) | IJCAI | 2024-04 | []() |
| [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | NAACL | 2024-03 | [GitHub](https://github.com/allenai/beacon) |
| [DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction](https://aclanthology.org/2023.findings-emnlp.121) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/GangZhao98/DemoSG) |
| [One Model for All Domains: Collaborative Domain-Prefx Tuning for Cross-Domain NER](https://arxiv.org/abs/2301.10410) | IJCAI | 2023-09 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/cross) |
| [LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting](https://aclanthology.org/2022.coling-1.209/) | COLING | 2022-10 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/ner/few-shot) |
| [Unified Structure Generation for Universal Information Extraction](https://aclanthology.org/2022.acl-long.395/) | ACL | 2022-05 | [GitHub](https://github.com/yhcc/BARTABSA) |
| [InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER](https://arxiv.org/abs/2203.03903) | Arxiv | 2022-03 | |
| [Template-Based Named Entity Recognition Using BART](https://aclanthology.org/2021.findings-acl.161.pdf) | ACL Findings | 2021-08 | [GitHub](https://github.com/Nealcly/templateNER) |
| [Structured prediction as translation between augmented natural languages](https://arxiv.org/abs/2101.05779) | ICLR | 2021-01 | [GitHub](https://github.com/amazon-science/tanl) |
### In-Context Learning
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction](https://aclanthology.org/2024.findings-acl.760/) | ACL Findings | 2024 | [GitHub](https://github.com/ej0cl6/TextEE) |
| [RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition](https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae095/7665312) | Others | 2024-05 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) |
| [P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models](https://arxiv.org/abs/2405.04960) | Arxiv | 2024-06 | [GitHub](https://github.com/jiangguochaoGG/P-ICL) |
| [LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking](https://arxiv.org/abs/2404.05624) | Arxiv | 2024-04 | [GitHub](https://github.com/YFR718/LTNER) |
| [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Others | 2024-04 | []() |
| [LLMs in Biomedicine: A study on clinical Named Entity Recognition](https://arxiv.org/pdf/2404.07376) | Arxiv | 2024-04 | []() |
| [Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning](https://www.researchgate.net/profile/Rafael-Nunes-35/publication/379665297_Out_of_Sesame_Street_A_Study_of_Portuguese_Legal_Named_Entity_Recognition_Through_In-Context_Learning/links/6614701839e7641c0ba6879b/Out-of-Sesame-Street-A-Study-of-Portuguese-Legal-Named-Entity-Recognition-Through-In-Context-Learning.pdf) | ResearchGate | 2024-04 | []() |
| [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) |
| [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024-04 | []() |
| [Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models](https://aclanthology.org/2024.naacl-short.49/) | NAACL Short | 2024 | [GitHub](https://github.com/Emma1066/Self-Improve-Zero-Shot-NER) |
| [ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context](https://ojs.aaai.org/index.php/AAAI/article/view/29892)| AAAI | 2024 |
| [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | NAACL | 2024 | [GitHub](https://github.com/allenai/beacon) |
| [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) |
| [Unlocking Instructive In-Context Learning with Tabular Prompting for Relational Triple Extraction](https://aclanthology.org/2024.lrec-main.1488/) | COLING | 2024 | |
| [CodeKGC: Code Language Model for Generative Knowledge Graph Construction](https://dl.acm.org/doi/abs/10.1145/3641850) | ACM TALLIP | 2024-03 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/CodeKGC) |
| [Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models](https://arxiv.org/abs/2310.11085) | Arxiv | 2024-02 | [GitHub](https://github.com/oezyurty/REPLM) |
| [In-Context Learning for Few-Shot Nested Named Entity Recognition](https://arxiv.org/abs/2402.01182) | Arxiv | 2024-02 | []() |
| [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) |
| [Heuristic-Driven Link-of-Analogy Prompting: Enhancing Large Language Models for Document-Level Event Argument Extraction](https://arxiv.org/abs/2311.06555v2) | Arxiv | 2024-02 | []() |
| [LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty](https://arxiv.org/abs/2402.10573) | WWW | 2024 |
| [Small Language Model Is a Good Guide for Large Language Model in Chinese Entity Relation Extraction](https://arxiv.org/abs/2402.14373) | Arxiv | 2024-02 | |
| [C-ICL: Contrastive In-context Learning for Information Extraction](https://arxiv.org/abs/2402.11254) | Arxiv | 2024-02 |
| [Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering](https://arxiv.org/abs/2303.16416v3) | Arxiv | 2024-01 | [GitHub](https://github.com/BIDS-Xu-Lab/Clinical_Entity_Recognition_Using_GPT_models) |
| [Chain of Thought with Explicit Evidence Reasoning for Few-shot Relation Extraction](https://aclanthology.org/2023.findings-emnlp.153/) | EMNLP Findings | 2023-12 | |
| [GPT-RE: In-context Learning for Relation Extraction using Large Language Models](https://arxiv.org/abs/2305.02105) | EMNLP | 2023-12 | [GitHub](https://github.com/YukinoWan/GPT-RE) |
| [Guideline Learning for In-context Information Extraction](https://arxiv.org/abs/2310.05066) | EMNLP | 2023-12 | []() |
| [Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!](https://arxiv.org/abs/2303.08559) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/mayubo2333/LLM-IE) |
| [Retrieval-Augmented Code Generation for Universal Information Extraction](https://arxiv.org/abs/2311.02962) | Arxiv | 2023-11 | []() |
| [Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment](https://arxiv.org/abs/2310.10590) | Arxiv | 2023-10 |
| [GPT-NER: Named Entity Recognition via Large Language Models](https://arxiv.org/abs/2304.10428) | Arxiv | 2023-10 | [GitHub](https://github.com/ShuheWang1998/GPT-NER) |
| [GPT Struct Me: Probing GPT Models on Narrative Entity Extraction](https://ieeexplore.ieee.org/abstract/document/10350182) | WI-IAT | 2023-10 | [GitHub](https://github.com/hmosousa/gpt_struct_me) |
| [Learning In-context Learning for Named Entity Recognition ](https://aclanthology.org/2023.acl-long.764/) | ACL | 2023-07 | [GitHub](https://github.com/chen700564/metaner-icl) |
| [Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors](https://aclanthology.org/2023.findings-acl.50.pdf) | ACL Findings | 2023-07 | [GitHub](https://github.com/OSU-NLP-Group/QA4RE) |
| [Code4Struct: Code Generation for Few-Shot Event Structure Prediction](https://arxiv.org/abs/2210.12810) | ACL | 2023-07 | [GitHub](https://github.com/xingyaoww/code4struct) |
| [CODEIE: Large Code Generation Models are Better Few-Shot Information Extractors](https://arxiv.org/abs/2305.05711) | ACL | 2023-07 | [GitHub](https://github.com/artpli/CodeIE) |
| [How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?](https://arxiv.org/abs/2305.01555) | ACL Workshop | 2023-07 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/UnleashLLMRE) |
| [PromptNER : Prompting For Named Entity Recognition](https://arxiv.org/abs/2305.15444) | Arxiv | 2023-06 | [GitHub](https://github.com/tricktreat/PromptNER) |
| [Unified Text Structuralization with Instruction-tuned Language Models](https://arxiv.org/abs/2303.14956) | Arxiv | 2023-03 | []() |
## Zero-shot
### Zero-shot Prompting
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis](https://aclanthology.org/2024.acl-long.476/) | ACL | 2024 | [GitHub](https://github.com/OceannTwT/era-cot) |
| [Astronomical Knowledge Entity Extraction in Astrophysics Journal Articles via Large Language Models](https://iopscience.iop.org/article/10.1088/1674-4527/ad3d15/meta) | Others | 2024-04 | []() |
| [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) |
| [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024-04 | []() |
| [Retrieval-Augmented Generation-based Relation Extraction](https://arxiv.org/abs/2404.13397) | Arxiv | 2024-04 | [GitHub](https://github.com/sefeoglu/RAG4RE) |
| [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Arxiv | 2024-04 | []() |
| [Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors](https://arxiv.org/abs/2404.17807) | IJCAI | 2024-04 | []() |
| [Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models](https://aclanthology.org/2024.naacl-short.49/) | NAACL Short | 2024 | [GitHub](https://github.com/Emma1066/Self-Improve-Zero-Shot-NER) |
| [CodeKGC: Code Language Model for Generative Knowledge Graph Construction](https://dl.acm.org/doi/abs/10.1145/3641850) | ACM TALLIP | 2024-03 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/CodeKGC) |
| [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | NAACL | 2024-03 | [GitHub](https://github.com/allenai/beacon) |
| [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) |
| [A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction](https://arxiv.org/abs/2402.13364) | Arxiv | 2024-02 |
| [Small Language Model Is a Good Guide for Large Language Model in Chinese Entity Relation Extraction](https://arxiv.org/abs/2402.14373) | Arxiv | 2024-02 | |
| [Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering](https://arxiv.org/abs/2303.16416v3) | Arxiv | 2024-01 | [GitHub](https://github.com/BIDS-Xu-Lab/Clinical_Entity_Recognition_Using_GPT_models) |
| [Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs](https://arxiv.org/abs/2312.00552) | EMNLP | 2023-12 | [GitHub](https://github.com/qingwang-isu/AugURE) |
| [Prompt-NER: Zero-shot Named Entity Recognition in Astronomy Literature via Large Language Models](https://arxiv.org/abs/2310.17892) | Arxiv | 2023-10 | |
| [Revisiting Large Language Models as Zero-shot Relation Extractors](https://arxiv.org/abs/2310.05028) | EMNLP Findings | 2023-10 | []() |
| [Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors](https://aclanthology.org/2023.findings-acl.50.pdf) | ACL Findings | 2023-07 | [GitHub](https://github.com/OSU-NLP-Group/QA4RE) |
| [Code4Struct: Code Generation for Few-Shot Event Structure Prediction](https://arxiv.org/abs/2210.12810) | ACL | 2023-07 | [GitHub](https://github.com/xingyaoww/code4struct) |
| [A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction](https://openreview.net/pdf?id=3xGnOrUqt1) | NeurIPS Workshop | 2023-10 | []() |
| [Global Constraints with Prompting for Zero-Shot Event Argument Classification](https://aclanthology.org/2023.findings-eacl.191/) | EACL Findings | 2023-05 | |
| [Zero-Shot Information Extraction via Chatting with ChatGPT](https://arxiv.org/abs/2302.10205) | Arxiv | 2023-02 | [GitHub](https://github.com/cocacola-lab/ChatIE) |
### Cross-Domain Learning
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction](https://aclanthology.org/2024.acl-long.475/) | ACL | 2024 | [GitHub](https://ict-goknow.github.io/knowcoder/) |
| [VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models](https://aclanthology.org/2024.acl-long.134/) | ACL | 2024 | [GitHub](https://github.com/emseoyk/VerifiNER) |
| [Rethinking Negative Instances for Generative Named Entity Recognition](https://aclanthology.org/2024.findings-acl.206/) | ACL Findings | 2024 | [GitHub](https://github.com/yyDing1/GNER) |
| [IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus](https://aclanthology.org/2024.acl-short.13/) | ACL Short | 2024 | [GitHub](https://github.com/zjunlp/IEPile?tab=readme-ov-file) |
| [Diluie: constructing diverse demonstrations of in-context learning with large language model for unified information extraction](https://link.springer.com/article/10.1007/s00521-024-09728-5) | Others | 2024-04 | [GitHub](https://github.com/Phevos75/DILUIE) |
| [Advancing Entity Recognition in Biomedicine via Instruction Tuning of Large Language Models]() | Bioinformatics | 2024-03 | [GitHub](https://github.com/BIDS-Xu-Lab/BioNER-LLaMA) |
| [ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models]() | COLING | 2024 | |
| [ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Refinement](https://arxiv.org/abs/2401.13218) | Arxiv | 2024-01 | []() |
| [YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction](https://arxiv.org/abs/2312.15548) | Arxiv | 2024-04 |
| [GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction](https://openreview.net/pdf?id=Y3wpuxd7u9) | ICLR | 2024 | [GitHub](https://github.com/hitz-zentroa/GoLLIE) |
| [UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition](https://openreview.net/pdf?id=r65xfUb76p) | ICLR | 2024 | [GitHub](https://github.com/universal-ner/universal-ner) |
| [InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction](https://arxiv.org/abs/2304.08085) | Arxiv | 2023-04 | [GitHub](https://github.com/BeyonderXX/InstructUIE) |
| [DEEPSTRUCT: Pretraining of Language Models for Structure Prediction](https://aclanthology.org/2022.findings-acl.67/) | ACL Findings | 2022-05 | [GitHub](https://github.com/wang-research-lab/deepstruct) |
| [Multilingual generative language models for zero-shot cross-lingual event argument extraction](https://aclanthology.org/2022.acl-long.317.pdf) | ACL | 2022-05 | [GitHub](https://github.com/PlusLabNLP/X-Gear) |
### Cross-Type Learning
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Document-level event argument extraction by conditional generation](https://aclanthology.org/2021.naacl-main.69.pdf) | NAACL | 2021-06 | [GitHub](https://github.com/raspberryice/gen-arg) |
## Data Augmentation
### Data Annotation
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?](https://arxiv.org/abs/2405.02602) | Arxiv | 2024-05 | []() |
| [MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks](https://arxiv.org/abs/2404.00457) | Arxiv | 2024-03 | [GitHub](https://github.com/KomeijiForce/MetaIE) |
| [Augmenting NER Datasets with LLMs: Towards Automated and Refined Annotation](https://arxiv.org/abs/2404.01334) | Arxiv | 2024-03 | []() |
| [NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data](https://arxiv.org/abs/2402.15343) | Arxiv | 2024-02 |
| [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) |
| [LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition](https://arxiv.org/abs/2402.14568) | Arxiv | 2024-02 | []() |
| [LLMaAA: Making Large Language Models as Active Annotators](https://arxiv.org/abs/2310.19596) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/ridiculouz/LLMAAA) |
| [Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs](https://arxiv.org/abs/2312.00552) | EMNLP | 2023-12 | [GitHub](https://github.com/qingwang-isu/AugURE) |
| [Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models](https://aclanthology.org/2023.emnlp-main.334.pdf) | EMNLP | 2023-12 | [GitHub](https://github.com/bigai-nlco/DocGNRE) |
| [How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?](https://arxiv.org/abs/2305.01555) | ACL Workshop | 2023-07 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm/UnleashLLMRE) |
| [Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction](https://aclanthology.org/2023.bionlp-1.15/) | bioNLP Workshop | 2023-07 | [GitHub](https://github.com/arkhn/bio-nlp2023) |
| [Does Synthetic Data Generation of LLMs Help Clinical Text Mining?](https://arxiv.org/abs/2303.04360) | Arxiv | 2023-04 | []() |
| [Unleash GPT-2 Power for Event Detection](https://aclanthology.org/2021.acl-long.490.pdf) | ACL | 2021-08 | []() |
### Knowledge Retrieval
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition](https://aclanthology.org/2024.findings-acl.76/) | ACL Findings | 2024 | [GitHub](https://github.com/JinYuanLi0012/RiVEG) |
| [Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction](https://arxiv.org/abs/2401.13598) | WWW | 2024 |
| [Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset](https://arxiv.org/abs/2310.10118) | EMNLP | 2023-12 | [GitHub](https://github.com/CompNet/conivel/tree/gen) |
| [Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge](https://arxiv.org/abs/2305.12212) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/JinYuanLi0012/PGIM) |
| [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Arxiv | 2023-08 | []() |
### Inverse Generation
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Arxiv | 2024-03 | []() |
| [Improving Event Definition Following For Zero-Shot Event Detection](https://arxiv.org/abs/2403.02586) | Arxiv | 2024-03 | []() |
| [ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models](https://aclanthology.org/2024.findings-acl.947/) | ACL Findings | 2024 | [GitHub](https://github.com/StefanHeng/ProgGen) |
| [Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction](https://arxiv.org/abs/2402.11142) | Arxiv | 2024-02 |
| [Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction](https://arxiv.org/abs/2303.04132) | EMNLP | 2023-12 | [GitHub](https://github.com/epfl-dlab/SynthIE) |
| [Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks](https://aclanthology.org/2023.findings-acl.578/) | ACL Findings | 2023-07 | []() |
| [Event Extraction as Question Generation and Answering](https://aclanthology.org/2023.acl-short.143.pdf) | ACL Short | 2023-07 | [GitHub](https://github.com/dataminr-ai/Event-Extraction-as-Question-Generation-and-Answering) |
| [STAR: Boosting Low-Resource Event Extraction by Structure-to-Text Data Generation with Large Language Models](https://ojs.aaai.org/index.php/AAAI/article/view/29839) | AAAI | 2024-03 | []() |
### Synthetic Datasets for Instruction-tuning
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Rethinking Negative Instances for Generative Named Entity Recognition](https://aclanthology.org/2024.findings-acl.206/) | ACL Findings | 2024 | [GitHub](https://github.com/yyDing1/GNER) |
| [UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition](https://openreview.net/pdf?id=r65xfUb76p) | ICLR | 2024-01 | [GitHub](https://github.com/universal-ner/universal-ner) |
| [GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer](https://arxiv.org/abs/2311.08526) | Arxiv | 2023-11 | [GitHub](https://github.com/urchade/GLiNER) |
| [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Arxiv | 2023-08 | |
## Prompts Design
### Question Answer
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [Knowledge-Enriched Prompt for Low-Resource Named Entity Recognition](https://dl.acm.org/doi/abs/10.1145/3659948) | TALLIP | 2024-04 | []() |
| [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Others | 2024-04 | []() |
| [Revisiting Large Language Models as Zero-shot Relation Extractors](https://arxiv.org/abs/2310.05028) | EMNLP Findings | 2023-12 | |
| [Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors](https://aclanthology.org/2023.findings-acl.50.pdf) | ACL Findings | 2023-07 | [GitHub](https://github.com/OSU-NLP-Group/QA4RE) |
| [Zero-Shot Information Extraction via Chatting with ChatGPT](https://arxiv.org/abs/2302.10205) | Arxiv | 2023-02 | [GitHub](https://github.com/cocacola-lab/ChatIE) |
### Chain of Thought
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition](https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae095/7665312) | Others | 2024-05 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) |
| [Inspire the Large Language Model by External Knowledge on BioMedical Named Entity Recognition](https://arxiv.org/abs/2309.12278) | Arxiv | 2023-09 | |
| [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Arxiv | 2023-08 | |
| [Revisiting Relation Extraction in the era of Large Language Models](https://aclanthology.org/2023.acl-long.868.pdf) | ACL | 2023-07 | [GitHub](https://sominw.com/ACL23LLMs) |
| [Zero-shot Temporal Relation Extraction with ChatGPT](https://aclanthology.org/2023.bionlp-1.7/) | BioNLP | 2023-07 | |
| [PromptNER : Prompting For Named Entity Recognition](https://arxiv.org/abs/2305.15444) | Arxiv | 2023-06 | |
### Self-Improvement
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models](https://aclanthology.org/2024.findings-acl.947/) | ACL Findings | 2024 | [GitHub](https://github.com/StefanHeng/ProgGen) |
| [ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Refinement](https://arxiv.org/abs/2401.13218) | Arxiv | 2024-01 | []() |
| [Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models](https://aclanthology.org/2024.naacl-short.49/) | NAACL Short | 2024 | [GitHub](https://github.com/Emma1066/Self-Improve-Zero-Shot-NER) |
## Constrained Decoding Generation
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction](https://ojs.aaai.org/index.php/AAAI/article/view/29919) | AAAI | 2024-03 | [GitHub](https://github.com/urchade/ATG) |
| [Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning](https://aclanthology.org/2023.emnlp-main.674/) | EMNLP | 2024-01 | [GitHub](https://github.com/epfl-dlab/GCD) |
| [DORE: Document Ordered Relation Extraction based on Generative Framework](https://aclanthology.org/2022.findings-emnlp.253/) | EMNLP Findings | 2022-12 |
| [Autoregressive Structured Prediction with Language Models](https://aclanthology.org/2022.findings-emnlp.70/) | EMNLP Findings | 2022-12 | [GitHub](https://github.com/lyutyuh/ASP) |
| [Unified Structure Generation for Universal Information Extraction](https://aclanthology.org/2022.acl-long.395/) | ACL | 2022-05 | [GitHub](https://github.com/yhcc/BARTABSA) |
# Specific Domain
| Paper | Domain | Venue | Date | Code |
| :----- | :--------------: | :-------: | :---------: |:---------: |
| [Granular Entity Mapper: Advancing Fine-grained Multimodal Named Entity Recognition and Grounding](https://aclanthology.org/2024.findings-emnlp.183/) | Multimodal |EMNLP Findings | 2024 | |
| [LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition](https://aclanthology.org/2024.findings-acl.76/) | Multimodal | ACL Findings | 2024 | [GitHub](https://github.com/JinYuanLi0012/RiVEG) |
| [RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition](https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae095/7665312) | Medical | Others | 2024-05 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) |
| [Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?](https://arxiv.org/abs/2405.02602) | Astronomy | Arxiv | 2024-05 | []() |
| [Astronomical Knowledge Entity Extraction in Astrophysics Journal Articles via Large Language Models](https://iopscience.iop.org/article/10.1088/1674-4527/ad3d15/meta) | Astronomy | Others | 2024-04 | []() |
| [VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition](https://arxiv.org/abs/2404.17835) | Biomedical | Arxiv | 2024-04 | [GitHub](https://github.com/Eulring/VANER) |
| [LLMs in Biomedicine: A study on clinical Named Entity Recognition](https://arxiv.org/pdf/2404.07376) | Biomedical | Arxiv | 2024-04 | []() |
| [Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models](https://arxiv.org/pdf/2404.05587) | Software | Others | 2024-04 | []() |
| [Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning](https://www.researchgate.net/profile/Rafael-Nunes-35/publication/379665297_Out_of_Sesame_Street_A_Study_of_Portuguese_Legal_Named_Entity_Recognition_Through_In-Context_Learning/links/6614701839e7641c0ba6879b/Out-of-Sesame-Street-A-Study-of-Portuguese-Legal-Named-Entity-Recognition-Through-In-Context-Learning.pdf) | Legal | ResearchGate | 2024-04 | []() |
| [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Scientific | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) |
| [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Acupuncture Point | Arxiv | 2024-04 | []() |
| [Advancing Entity Recognition in Biomedicine via Instruction Tuning of Large Language Models]() | Biomedical | Bioinformatics | 2024-03 | [GitHub](https://github.com/BIDS-Xu-Lab/BioNER-LLaMA) |
| [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Endangered Species | Arxiv | 2024-03 | []() |
| [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | Historical | COLING | 2024-03 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) |
| [On-the-fly Definition Augmentation of LLMs for Biomedical NER](https://arxiv.org/abs/2404.00152) | Biomedical | NAACL | 2024-03 | [GitHub](https://github.com/allenai/beacon) |
| [Improving LLM-Based Health Information Extraction with In-Context Learning](https://link.springer.com/chapter/10.1007/978-981-97-1717-0_4)| Health | Others | 2024-03 |
| [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Scientific | Nat. Commun. | 2024-02 | [GitHub](https://github.com/LBNLP/NERRE) |
| [Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study](https://arxiv.org/abs/2402.15663) | Pharmacovigilance | EACL | 2024-02 | [GitHub](https://github.com/ZhaoyueSun/phee-with-chatgpt) |
| [Structured information extraction from scientific text with large language models](https://www.nature.com/articles/s41467-024-45563-x) | Scientific | Nat. Commun. | 2024-02 | [GitHub](https://github.com/lbnlp/nerre-llama) |
| [Combining prompt‑based language models and weak supervision for labeling named entity recognition on legal documents](https://link.springer.com/article/10.1007/s10506-023-09388-1) | Legal | Others | 2024-02 |
| [Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering](https://arxiv.org/abs/2303.16416v3) | Clinical | Arxiv | 2024-01 | [GitHub](https://github.com/BIDS-Xu-Lab/Clinical_Entity_Recognition_Using_GPT_models) |
| [Impact of Sample Selection on In-Context Learning for Entity Extraction from Scientific Writing](https://aclanthology.org/2023.findings-emnlp.338/)| Scientific | EMNLP Findings | 2023-12 | [GitHub](https://github.com/adalin16/ICL_EE)|
| [Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge](https://arxiv.org/abs/2305.12212) | Multimodal | ENMLP Findings | 2023-12 | [GitHub](https://github.com/JinYuanLi0012/PGIM) |
| [In-context Learning for Few-shot Multimodal Named Entity Recognition](https://aclanthology.org/2023.findings-emnlp.196/) | Multimodal | ENMLP Findings | 2023-12 | |
| [PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature](https://arxiv.org/abs/2311.07715) | Polymer Material | Arxiv | 2023-11 | [GitHub](https://github.com/jerry3027/PolyIE) |
| [Prompt-NER: Zero-shot Named Entity Recognition in Astronomy Literature via Large Language Models](https://arxiv.org/abs/2310.17892) | Astronomical | Arxiv | 2023-10 | |
| [Inspire the Large Language Model by External Knowledge on BioMedical Named Entity Recognition](https://arxiv.org/abs/2309.12278) | Biomedical | Arxiv | 2023-09 | |
| [Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction](https://arxiv.org/abs/2306.14122) | Multimodal | Arxiv | 2023-08 | |
| [DICE: Data-Efficient Clinical Event Extraction with Generative Models](https://aclanthology.org/2023.acl-long.886.pdf) | Clinical | ACL | 2023-07 | [GitHub](https://github.com/derekmma/DICE) |
| [How far is Language Model from 100% Few-shot Named Entity Recognition in Medical Domain](https://arxiv.org/abs/2307.00186) | Medical | Arxiv | 2023-07 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) |
| [Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction](https://aclanthology.org/2023.bionlp-1.15/) | Multilingual / Clinical | BioNLP | 2023-07 | [GitHub](https://github.com/arkhn/bio-nlp2023) |
| [Does Synthetic Data Generation of LLMs Help Clinical Text Mining?](https://arxiv.org/abs/2303.04360) | Clinical | Arxiv | 2023-04 | |
| [Yes but.. Can ChatGPT Identify Entities in Historical Documents](https://arxiv.org/abs/2303.17322) | Historical | JCDL | 2023-03 | |
| [Zero-shot Clinical Entity Recognition using ChatGPT](https://arxiv.org/abs/2303.16416) | Clinical | Arxiv | 2023-03 | |
| [Structured information extraction from complex scientific text with fine-tuned large language models](https://arxiv.org/abs/2212.05238) | Scientific | Arxiv | 2022-12 | [Demo](http://www.matscholar.com/info-extraction) |
| [Multilingual generative language models for zero-shot cross-lingual event argument extraction](https://aclanthology.org/2022.acl-long.317.pdf) | Multilingual | ACL | 2022-05 | [GitHub](https://github.com/PlusLabNLP/X-Gear) |
# Evaluation and Analysis
| Paper | Venue | Date | Code |
| :----- | :--------------: | :------- | :---------: |
| [TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction](https://aclanthology.org/2024.findings-acl.760/) | ACL Findings | 2024 | [GitHub](https://github.com/ej0cl6/TextEE) |
| [IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus](https://aclanthology.org/2024.acl-short.13/) | ACL Short | 2024 | [GitHub](https://github.com/zjunlp/IEPile?tab=readme-ov-file) |
| [CHisIEC: An Information Extraction Corpus for Ancient Chinese History](https://aclanthology.org/2024.lrec-main.283/) | COLING | 2024 | [GitHub](https://github.com/tangxuemei1995/CHisIEC) |
| [GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models](https://aclanthology.org/2024.naacl-long.155/) | NAACL | 2024 | [GitHub](https://github.com/pat-jj/GenRES) |
| [Empirical Analysis of Dialogue Relation Extraction with Large Language Models](https://arxiv.org/abs/2404.17802) | IJCAI | 2024 | []() |
| [Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?](https://arxiv.org/abs/2405.02602) | Arxiv | 2024-05 | []() |
| [Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations](https://arxiv.org/pdf/2404.05415) | Arxiv | 2024-04 | []() |
| [Mining experimental data from Materials Science literature with Large Language Models: an evaluation study](https://arxiv.org/abs/2401.11052) | Arxiv | 2024-04 | [GitHub](https://github.com/lfoppiano/MatSci-LumEn) |
| [Distilling Named Entity Recognition Models for Endangered Species from Large Language Models](https://arxiv.org/abs/2403.15430) | Arxiv | 2024-03 | []() |
| [LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities](https://arxiv.org/abs/2305.13168) | Arxiv | 2024-02 | [GitHub](https://github.com/zjunlp/AutoKG)|
| [Few shot clinical entity recognition in three languages: Masked language models outperform LLM prompting](https://arxiv.org/abs/2402.12801) | Arxiv | 2024-02 |
| [Information Extraction from Legal Wills: How Well Does GPT-4 Do?](https://aclanthology.org/2023.findings-emnlp.287/) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/ml4ai/ie4wills/) |
| [Information Extraction in Low-Resource Scenarios: Survey and Perspective](https://arxiv.org/abs/2202.08063)| Arxiv | 2023-12 | [GitHub](https://github.com/zjunlp/Low-resource-KEPapers) |
| [Empirical Study of Zero-Shot NER with ChatGPT](https://arxiv.org/abs/2310.10035) | EMNLP | 2023-12 | [GitHub](https://github.com/Emma1066/Zero-Shot-NER-with-ChatGPT) |
| [NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval](https://arxiv.org/abs/2310.14282) | EMNLP Findings | 2023-12 | [GitHub](https://github.com/katzurik/NERetrieve) |
| [Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction](https://aclanthology.org/2023.emnlp-main.360/) | EMNLP | 2023-12 | [GitHub](https://github.com/qijimrc/ROBUST) |
| [PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature](https://arxiv.org/abs/2311.07715) | Arxiv | 2023-11 | [GitHub](https://github.com/jerry3027/PolyIE) |
| [XNLP: An Interactive Demonstration System for Universal Structured NLP](https://arxiv.org/abs/2308.01846) | Arxiv | 2023-08 | [Demo](http://xnlp.haofei.vip/) |
| [A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks](https://arxiv.org/abs/2307.12114) | Arxiv | 2023-07 | |
| [How far is Language Model from 100% Few-shot Named Entity Recognition in Medical Domain](https://arxiv.org/abs/2307.00186) | Arxiv | 2023-07 | [GitHub](https://github.com/ToneLi/RT-Retrieving-and-Thinking) |
| [Revisiting Relation Extraction in the era of Large Language Models](https://aclanthology.org/2023.acl-long.868.pdf) | ACL | 2023-07 | [GitHub](https://sominw.com/ACL23LLMs) |
| [Zero-shot Temporal Relation Extraction with ChatGPT](https://aclanthology.org/2023.bionlp-1.7/) | BioNLP | 2023-07 | |
| [InstructIE: A Chinese Instruction-based Information Extraction Dataset](https://arxiv.org/abs/2305.11527) | Arxiv | 2023-05 | [GitHub](https://github.com/zjunlp/DeepKE/tree/main/example/llm) |
| [Is Information Extraction Solved by ChatGPT? An Analysis of Performance, Evaluation Criteria, Robustness and Errors](https://arxiv.org/abs/2305.14450) | Arxiv | 2023-05 | [GitHub](https://github.com/FreedomIntelligence/Evaluation-of-ChatGPT-on-Information-Extraction) |
| [Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness](https://arxiv.org/abs/2304.11633) | Arxiv | 2023-04 | [GitHub](https://github.com/pkuserc/ChatGPT_for_IE) |
| [Exploring the Feasibility of ChatGPT for Event Extraction](https://arxiv.org/abs/2303.03836) | Arxiv | 2023-03 | |
| [Yes but.. Can ChatGPT Identify Entities in Historical Documents](https://arxiv.org/abs/2303.17322) | JCDL | 2023-03 | |
| [Zero-shot Clinical Entity Recognition using ChatGPT](https://arxiv.org/abs/2303.16416) | Arxiv | 2023-03 | |
| [Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again](https://aclanthology.org/2022.findings-emnlp.329/) | EMNLP Findings | 2022-12 | [GitHub](https://github.com/dki-lab/few-shot-bioIE) |
| [Large Language Models are Few-Shot Clinical Information Extractors](https://arxiv.org/abs/2205.12689) | EMNLP | 2022-12 | [Huggingface](https://huggingface.co/datasets/mitclinicalml/clinical-ie) |
# Project and Toolkit
| Paper | Type | Venue | Date | Link |
| :------ | :------: | :------: | :------: | :------: |
| ONEKE | Project | - | - | [Link](http://oneke.openkg.cn/) |
| [TechGPT-2.0: A Large Language Model Project to Solve the Task of Knowledge Graph Construction](https://arxiv.org/abs/2401.04507) | Project | Arxiv | 2024-01 | [Link](https://github.com/neukg/TechGPT-2.0) |
| [CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction](https://arxiv.org/abs/2307.00769) | Toolkit | Arxiv | 2023-07 | [Link](https://github.com/cocacola-lab/CollabKG) |
# Recently Updated Papers
## 2024/09/04
| Paper | Venue | Date | Code |
| :------ | :------: | :------: | :------: |
| [Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction](https://aclanthology.org/2024.acl-long.187/) | ACL | 2024-08 | [GitHub](https://github.com/JianhaoChen-nju/TSDRE) |
| [Epidemic Information Extraction for Event-Based Surveillance using Large Language Models](https://link.springer.com/chapter/10.1007/978-981-97-4581-4_17) | ICICT | 2024-08 | |
| [SpeechEE: A Novel Benchmark for Speech Event Extraction](https://arxiv.org/abs/2408.09462) | ACM MM | 2024-08 | [GitHub](https://speechee.github.io/) |
| [HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction](https://arxiv.org/abs/2408.04948)| Arxiv | 2024-08 | |
| [Knowledge AI: Fine-tuning NLP Models for Facilitating Scientific Knowledge Extraction and Understanding](https://arxiv.org/abs/2408.04651)| Arxiv | 2024-08 | |
| [Target Prompting for Information Extraction with Vision Language Model](https://arxiv.org/abs/2408.03834)| Arxiv | 2024-08 | |
| [Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language Models](https://arxiv.org/abs/2408.15796) | Arxiv | 2024-08 | [GitHub](https://github.com/GEODE-project/ner-llm)|
| [Utilizing Large Language Models for Named Entity Recognition in Traditional Chinese Medicine against COVID-19 Literature: Comparative Study](https://arxiv.org/abs/2408.13501) | Arxiv | 2024-08 | |
| [CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition](https://arxiv.org/abs/2408.12834) | ECAI | 2024-08 | |
| [LLMs are not Zero-Shot Reasoners for Biomedical Information Extraction](https://arxiv.org/abs/2408.12249) | Arxiv | 2024-08 | |
| [Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition](https://www.arxiv.org/abs/2407.17344) | Arxiv | 2024-07 | |
| [MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models](https://arxiv.org/abs/2407.10953) | Arxiv | 2024-08 | [GitHub](https://ganchengguang.github.io/MRE/) |
| [FsPONER: Few-shot Prompt Optimization for Named Entity Recognition in Domain-specific Scenarios](https://arxiv.org/abs/2407.08035) | ECAI | 2024-07 | [GitHub](https://github.com/markustyj/FsPONER_ECAI2024) |
| [Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters](https://arxiv.org/abs/2407.01406) | KaLLM workshop | 2024-07 | [GitHub](https://github.com/d-gurgurov/Injecting-Commonsense-Knowledge-into-LLMs) |
| [Show Less, Instruct More: Enriching Prompts with Definitions and Guidelines for Zero-Shot NER](https://arxiv.org/abs/2407.01272) | Arxiv | 2024-07 | |
| [Large Language Models Struggle in Token-Level Clinical Named Entity Recognition](https://arxiv.org/abs/2407.00731) | AMIA | 2024-08 | |
| [GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks]() | Arxiv | 2024-08 | |
| [Retrieval Augmented Instruction Tuning for Open NER with Large Language Models](https://arxiv.org/abs/2406.17305) | Arxiv | 2024-06 | [GitHub](https://github.com/Emma1066/Retrieval-Augmented-IT-OpenNER) |
| [Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition](https://arxiv.org/abs/2406.11192) | Arxiv | 2024-06 | [GitHub](https://github.com/UmeanNever/B2NER) |
| [Fighting Against the Repetitive Training and Sample Dependency Problem in Few-shot Named Entity Recognition](https://ieeexplore.ieee.org/abstract/document/10463035) | IEEE Access | 2024-06 | [GitHub](https://github.com/changtianluckyforever/SMCS_project) |
| [llmNER: (Zero\|Few)-Shot Named Entity Recognition, Exploiting the Power of Large Language Models](https://arxiv.org/abs/2406.04528) | Arxiv | 2024-06 | [GitHub](https://github.com/plncmm/llmner) |
| [Assessing the Performance of Chinese Open Source Large Language Models in Information Extraction Tasks](https://arxiv.org/abs/2406.02079) | Arxiv | 2024-06 | |
# Datasets
\* denotes the dataset is multimodal. # refers to the number of categories or sentences.
| Task | Dataset | Domain | #Class | #Train | #Val | #Test | Link |
|---|---|---|---|---|---|---|---|
| NER | ACE04 | News | 7 | 6202 | 745 | 812 | Link |
| ACE05 | News | 7 | 7299 | 971 | 1060 | Link | |
| BC5CDR | Biomedical | 2 | 4560 | 4581 | 4797 | Link | |
| Broad Twitter Corpus | Social Media | 3 | 6338 | 1001 | 2000 | Link | |
| CADEC | Biomedical | 1 | 5340 | 1097 | 1160 | Link | |
| CoNLL03 | News | 4 | 14041 | 3250 | 3453 | Link | |
| CoNLLpp | News | 4 | 14041 | 3250 | 3453 | Link | |
| CrossNER-AI | Artificial Intelligence | 14 | 100 | 350 | 431 | Link | |
| CrossNER-Literature | Literary | 12 | 100 | 400 | 416 | ||
| CrossNER-Music | Musical | 13 | 100 | 380 | 465 | ||
| CrossNER-Politics | Political | 9 | 199 | 540 | 650 | ||
| CrossNER-Science | Scientific | 17 | 200 | 450 | 543 | ||
| FabNER | Scientific | 12 | 9435 | 2182 | 2064 | Link | |
| Few-NERD | General | 66 | 131767 | 18824 | 37468 | Link | |
| FindVehicle | Traffic | 21 | 21565 | 20777 | 20777 | Link | |
| GENIA | Biomedical | 5 | 15023 | 1669 | 1854 | Link | |
| HarveyNER | Social Media | 4 | 3967 | 1301 | 1303 | Link | |
| MIT-Movie | Social Media | 12 | 9774 | 2442 | 2442 | Link | |
| MIT-Restaurant | Social Media | 8 | 7659 | 1520 | 1520 | Link | |
| MultiNERD | Wikipedia | 16 | 134144 | 10000 | 10000 | Link | |
| NCBI | Biomedical | 4 | 5432 | 923 | 940 | Link | |
| OntoNotes 5.0 | General | 18 | 59924 | 8528 | 8262 | Link | |
| ShARe13 | Biomedical | 1 | 8508 | 12050 | 9009 | Link | |
| ShARe14 | Biomedical | 1 | 17404 | 1360 | 15850 | Link | |
| SNAP* | Social Media | 4 | 4290 | 1432 | 1459 | Link | |
| Temporal Twitter Corpus (TTC) | Social Meida | 3 | 10000 | 500 | 1500 | Link | |
| Tweebank-NER | Social Media | 4 | 1639 | 710 | 1201 | Link | |
| Twitter2015* | Social Media | 4 | 4000 | 1000 | 3357 | Link | |
| Twitter2017* | Social Media | 4 | 3373 | 723 | 723 | Link | |
| TwitterNER7 | Social Media | 7 | 7111 | 886 | 576 | Link | |
| WikiDiverse* | News | 13 | 6312 | 755 | 757 | Link | |
| WNUT2017 | Social Media | 6 | 3394 | 1009 | 1287 | Link | |
| RE | ACE05 | News | 7 | 10051 | 2420 | 2050 | Link |
| ADE | Biomedical | 1 | 3417 | 427 | 428 | Link | |
| CoNLL04 | News | 5 | 922 | 231 | 288 | Link | |
| DocRED | Wikipedia | 96 | 3008 | 300 | 700 | Link | |
| MNRE* | Social Media | 23 | 12247 | 1624 | 1614 | Link | |
| NYT | News | 24 | 56196 | 5000 | 5000 | Link | |
| Re-TACRED | News | 40 | 58465 | 19584 | 13418 | Link | |
| SciERC | Scientific | 7 | 1366 | 187 | 397 | Link | |
| SemEval2010 | General | 19 | 6507 | 1493 | 2717 | Link | |
| TACRED | News | 42 | 68124 | 22631 | 15509 | Link | |
| TACREV | News | 42 | 68124 | 22631 | 15509 | Link | |
| EE | ACE05 | News | 33/22 | 17172 | 923 | 832 | Link |
| CASIE | Cybersecurity | 5/26 | 11189 | 1778 | 3208 | Link | |
| GENIA11 | Biomedical | 9/11 | 8730 | 1091 | 1092 | Link | |
| GENIA13 | Biomedical | 13/7 | 4000 | 500 | 500 | Link | |
| PHEE | Biomedical | 2/16 | 2898 | 961 | 968 | Link | |
| RAMS | News | 139/65 | 7329 | 924 | 871 | Link | |
| WikiEvents | Wikipedia | 50/59 | 5262 | 378 | 492 | Link |