GithubHelp home page GithubHelp logo

Comments (11)

zxlzr avatar zxlzr commented on June 16, 2024

您可以试试epoch为1,train_batch_size为64看结果是否正常

from deepke.

Tubggg avatar Tubggg commented on June 16, 2024

您可以试试epoch为1,train_batch_size为64看结果是否正常

由于计算资源限制,我只能设置epoch为1,train_batch_size为32的结果,如下:
[2024-01-19 23:06:14,854][main][INFO] - ***** Eval results *****
[2024-01-19 23:06:14,854][main][INFO] -
precision recall f1-score support

      好转       0.00      0.00      0.00        69
    无法判断       0.00      0.00      0.00        17
      未愈       0.00      0.00      0.00        22
      痊愈       0.00      0.00      0.00         1

micro avg 0.00 0.00 0.00 109
macro avg 0.00 0.00 0.00 109
weighted avg 0.00 0.00 0.00 109

from deepke.

xxupiano avatar xxupiano commented on June 16, 2024

您好,看结果可能数据量只有100多条,太少了,epoch调大多训练几个epoch(GPU内存限制只会影响batch_size大小)。推荐使用fewshot NER

from deepke.

Tubggg avatar Tubggg commented on June 16, 2024

您好,看结果可能数据量只有100多条,太少了,epoch调大多训练几个epoch(GPU内存限制只会影响batch_size大小)。推荐使用fewshot NER

这个是验证集上的,训练集大概300,然后我epoch设置的是200,我觉得应该是够的,我用另外一个run_lstmcrf.py可以正常跑出结果。

from deepke.

zxlzr avatar zxlzr commented on June 16, 2024

对于bert来说可能样本有点少了,建议您多收集一些数据样本再试试。

from deepke.

zxlzr avatar zxlzr commented on June 16, 2024

请问您的问题是否解决?

from deepke.

Tubggg avatar Tubggg commented on June 16, 2024

请问您的问题是否解决?

暂时未解决,令我不解的是在训练集上的f1也都是0,预测出来的都是标签‘O’

from deepke.

zxlzr avatar zxlzr commented on June 16, 2024

您重新pull最新的代码,数据量扩大一些,多跑几个epoch再测下训练集效果,如果仍未全0请联系我们。

from deepke.

zxlzr avatar zxlzr commented on June 16, 2024

请问您的问题解决了吗

from deepke.

Tubggg avatar Tubggg commented on June 16, 2024

请问您的问题解决了吗

结果还是全0,我再仔细检查下是否是我数据方面的问题,之后有问题的话再进行咨询,感谢解答!

from deepke.

zxlzr avatar zxlzr commented on June 16, 2024

好的,您有问题可以随时问

from deepke.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.