GithubHelp home page GithubHelp logo

xianghuisun / some-ner-models Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 16.62 MB

简要总结下依存分析(Dependency Parsing)的一些数据集和相关工作

Python 82.17% Jupyter Notebook 17.12% Shell 0.71%

some-ner-models's Introduction

DP+NER-Relatedwork-BrieflySummarize

简要总结下依存分析和实体识别的一些数据集以及相关工作

all_models文件夹下包含所有下列所有提及的模型

数据集

数据集 地址 来源 备注
PTB-3.3.0 PTB-3.3.0 链接
ontonotes englishChinese 链接 训练、验证、测试的划分以Pradhan et al.为标准,因此可以认为是有专用的训练集和测试集的
其中训练集:59924、测试集:8262、验证集:8528
GENIA GENIA 链接 没有专用的训练集和测试集
CoNLL03 CoNLL03 链接1,链接2 训练集:14041、测试集:3453、验证集:3250
实体类型有四个:PER、LOC、ORG、MISC
BioNLP
NCBI
BC5CDR
医疗数据集 链接 这类数据集大多用于Cross-domain和Multi-task NER
W-NUT 官网 链接 社交媒体类型的NER数据

各个数据集上论文给出的实验结果(取最高得分)

模型分类这一列将所有模型分为3类:

模型分类 说明
DP+NER 这一类的模型既用了DP标签,也用了NER标签
NER 这一类的模型仅仅用了NER标签
Multi-task 这一类的模型仅仅使用NER标签,额外的任务采用不需要标签的无监督任务

Ontonotes 5.0

Model 模型分类 P R F1
A Joint Model for Named Entity Recognition With Sentence-Level Entity Type Attentions Multi-task 89.85 89.22 89.53
Dependency-Guided LSTM-CRF Model for Named Entity Recognition DP+NER 88.59 90.17 89.88
Better Feature Integration for Named Entity Recognition DP+NER 90.14 91.58 90.85
Named Entity Recognition as Dependency Parsing NER 91.1 91.5 91.3

CoNLL03

Model 模型分类 P R F1
Semi-supervised Multitask Learning for Sequence Labeling Multi-task - - 86.26
Empower Sequence Labeling with Task-Aware Neural Language Model Multi-task - - 91.85
Dependency-Guided LSTM-CRF Model for Named Entity Recognition DP+NER 92.2 92.5 92.4
A Joint Model for Named Entity Recognition With Sentence-Level Entity Type Attentions Multi-task 92.17 92.51 92.73
Named Entity Recognition as Dependency Parsing NER 93.7 93.3 93.5
A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition Multi-task - - 93.6

GENIA

Model 模型分类 P R F1
A Boundary-aware Neural Model for Nested Named Entity Recognition Multi-task 75.9 73.6 74.7
A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition Multi-task 80.03 78.9 79.6
Named Entity Recognition as Dependency Parsing NER 81.8 79.3 80.5

注:因为GENIA是嵌套实体数据集,所以该数据集上的实验模型通常不会考虑依存分析、跨领域。

实验结果简要分析:

  1. 每一篇论文尽量都避过了与其他论文相比
  2. 各个模型的实验配置大不相同,大多采用ELMo、BERT、char-level embedding以及依存分析依赖的embedding,然后拼接在一起。(表格中列出的是最优结果,论文中的实验结果表明,如果不用BERT向量,效果下降2~3个点)
  3. 每一个模型所用的具体实验参数配置在后面详细介绍
  4. 综合来看,Named Entity Recognition as Dependency Parsing 这篇论文在所有数据集上效果都是最好的。

论文简要总结

DP+NER

DP+NER这里有三篇论文,其中两篇属于DP+NER,即:既需要DP标签,也需要NER标签。

论文 会议 实验设置 备注
Dependency-Guided LSTM-CRF Model for Named Entity Recognition EMNLP 2019 输入向量为字符向量+依存分析向量+EMLo的拼接 用与不用ELMo向量相差1~2个点。
Better Feature Integration for Named Entity Recognition NAACL 2021 输入向量为字符向量+依存分析向量+BERT+POS向量的拼接 用与不用BERT相差1~2个点
Named Entity Recognition as Dependency Parsing ACL 2020 输入向量为字符向量+fasttext+BERT的拼接 使用了BERT-large,去掉后下降2.4个点

多任务

多任务这里有3个模型,均不需要额外的数据来实现多任务。其中:

  • 有两篇是采用语言模型作为辅助任务。
  • 有一篇是让模型额外预测句子中有哪些实体标签。
论文 会议 任务1 任务2 实验设置 备注
Semi-supervised Multitask Learning for Sequence Labeling ACL 2017 NER LM 输入向量就是随机初始化的词向量,编码器是BiLSTM 利用LM任务辅助NER。
Empower Sequence Labeling with Task-Aware Neural Language Model AAAI 2018 NER LM 输入向量Glove,编码器是BiLSTM 也是利用LM辅助NER,与上一篇类似。代码2](https://github.com/sgrvinod/a-PyTorch-Tutorial-to-Sequence-Labeling)
A Joint Model for Named Entity Recognition With Sentence-Level Entity Type Attentions IEEE 2021 NER 预测句子中有哪些实体类型 输入向量是词向量+字符向量+BERT拼接 任务2是论文提出的,即:除了预测句子中每一个单词的ner标签,额外预测这个句子中有哪些实体标签,即预测这个句子中是否存在NER、LOC还是ORG等。这个任务的设计不需要额外数据标注

CrossDomain

论文 会议 source target 实验设置 备注
CrossNER: Evaluating Cross-Domain Named Entity Recognition AAAI 2021 CoNLL03 论文给出的5个数据集 以政治领域为例:首先在搜集的政治领域语料库上采用掩码语言模型预训练,然后在政治领域的有标注NER数据上微调,来实现Cross-domain。 论文收集公布了5个专业领域的NER数据集,而且也提供了对应的领域相关的预训练语料
Cross-Domain NER using Cross-Domain Language Modeling ACL 2019 CoNLL03 医疗数据集

其它

地址 说明
https://github.com/yzhangcs/parser 支持很多parsing的功能,可以直接转成Conll形式。貌似支持将成分分析转换为依存分析的功能。
https://github.com/ShannonAI/mrc-for-flat-nested-ner/blob/master/ner2mrc/download.md NER数据集 (包括ACE04、ACE05、GENIA、CoNLL2003以及Ontonotes),只不过是处理成MRC形式的
https://github.com/juand-r/entity-recognition-datasets NER数据集,不局限于英文
https://arxiv.org/pdf/1812.09449.pdf A Survey on Deep Learning for Named Entity Recognition

some-ner-models's People

Contributors

xianghuisun avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.