GithubHelp home page GithubHelp logo

Comments (7)

ljynlp avatar ljynlp commented on August 13, 2024 1

可以考虑Named Entity Recognition as Dependency Parsing中的处理方法,对出现嵌套的实体的头尾关系概率进行比较,选取最大的那一个

from w2ner.

yrn1126 avatar yrn1126 commented on August 13, 2024

分词用的是jieba精确模式,我看您公开的处理后的resume数据集是有‘word‘这项的,想请问您分词用的是哪种方式

from w2ner.

ljynlp avatar ljynlp commented on August 13, 2024

虽然resume数据中我们进行了分词,但实际上代码并没有用到,本质上还是在字符级别上进行抽取。
由于我们使用的是macro f1,所以没有考虑不同标签的数量,从你的结果中可以看出test上头尾关系的预测结果有一部分其实比val上的结果差,应该不是解码的问题,可能是test和val之间存在差异,比如同一类型实体数目在两个数据上相差较大。
val: [0.99797655 0.98461538 0.94179894 0.90293454 0.85561497 0.99166667
0.8 0.98181818]
test: [0.99777767 0.985705 0.91578947 0.90581162 0.81385281 0.98876404
0.85964912 1. ]

from w2ner.

yrn1126 avatar yrn1126 commented on August 13, 2024

在flat数据集中,predict有嵌套结果,例如m17文化广场实体被预测为 m17、文化广场、m17文化广场,在训练数据中没有标注嵌套实体,如果希望只把模型当作flat ner模型,我应该怎样做呢?

from w2ner.

lzf00 avatar lzf00 commented on August 13, 2024

虽然resume数据中我们进行了分词,但实际上代码并没有用到,本质上还是在字符级别上进行抽取。 由于我们使用的是macro f1,所以没有考虑不同标签的数量,从你的结果中可以看出test上头尾关系的预测结果有一部分其实比val上的结果差,应该不是解码的问题,可能是test和val之间存在差异,比如同一类型实体数目在两个数据上相差较大。 val: [0.99797655 0.98461538 0.94179894 0.90293454 0.85561497 0.99166667 0.8 0.98181818] test: [0.99777767 0.985705 0.91578947 0.90581162 0.81385281 0.98876404 0.85964912 1. ]
根据你的问题,中文数据集中没有用到句子的分词word这一数据,如果我想应用这一部分的数据,该如何加入到现有的模型中呢,或者是否可以讲base-bert改为全词覆盖的中文 BERT 预训练模型BERT-wwm?

from w2ner.

lzf00 avatar lzf00 commented on August 13, 2024

作者您好,首先感谢您的分享。有一个问题想要请教,在我自己构建的数据集(中文,flat)上进行实验时,test的label和entity的准召相差还比较大,val时相差不是很大,请问这是decode时出现了什么问题呢 2022-04-01 17:47:44 - INFO: Epoch: 9 2022-04-01 17:48:01 - INFO: +---------+--------+--------+-----------+--------+ | Train 9 | Loss | F1 | Precision | Recall | +---------+--------+--------+-----------+--------+ | Label | 0.0061 | 0.9698 | 0.9694 | 0.9703 | +---------+--------+--------+-----------+--------+ 2022-04-01 17:48:02 - INFO: EVAL Label F1 [0.99797655 0.98461538 0.94179894 0.90293454 0.85561497 0.99166667 0.8 0.98181818] 2022-04-01 17:48:02 - INFO: +--------+--------+-----------+--------+ | EVAL 9 | F1 | Precision | Recall | +--------+--------+-----------+--------+ | Label | 0.9321 | 0.9258 | 0.9389 | | Entity | 0.9207 | 0.9187 | 0.9226 | +--------+--------+-----------+--------+ 2022-04-01 17:48:03 - INFO: TEST Label F1 [0.99777767 0.985705 0.91578947 0.90581162 0.81385281 0.98876404 0.85964912 1. ] 2022-04-01 17:48:03 - INFO: +--------+--------+-----------+--------+ | TEST 9 | F1 | Precision | Recall | +--------+--------+-----------+--------+ | Label | 0.9334 | 0.9176 | 0.9513 | | Entity | 0.8928 | 0.8799 | 0.9061 | +--------+--------+-----------+--------+ 2022-04-01 17:48:03 - INFO: Best DEV F1: 0.9230 2022-04-01 17:48:03 - INFO: Best TEST F1: 0.8848 2022-04-01 17:48:08 - INFO: TEST Label F1 [0.99751797 0.98505523 0.9197861 0.904 0.79831933 0.98876404 0.84581498 1. ] 2022-04-01 17:48:08 - INFO: +------------+--------+-----------+--------+ | TEST Final | F1 | Precision | Recall | +------------+--------+-----------+--------+ | Label | 0.9299 | 0.9131 | 0.9486 | | Entity | 0.8848 | 0.8688 | 0.9014 | +------------+--------+-----------+------

请问你自己构建的中文数据集,超参数是这么设置的,还是和resume-zh的一样吗

from w2ner.

ljynlp avatar ljynlp commented on August 13, 2024

根据你的问题,中文数据集中没有用到句子的分词word这一数据,如果我想应用这一部分的数据,该如何加入到现有的模型中呢,或者是否可以讲base-bert改为全词覆盖的中文 BERT 预训练模型BERT-wwm?

将BERT-base替换为BERT-wwm是可行的,实验做起来也很简单。另外可以尝试将word信息融入卷积模块中。

请问你自己构建的中文数据集,超参数是这么设置的,还是和resume-zh的一样吗

不同的数据需要的参数可能不太一样,需要根据实验结果进行相对应的调试。

from w2ner.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.