GithubHelp home page GithubHelp logo

显存不够 about deepke HOT 12 CLOSED

whwususu avatar whwususu commented on September 25, 2024
显存不够

from deepke.

Comments (12)

guihonghao avatar guihonghao commented on September 25, 2024

你好,建议增加 --bits 4 参数进行量化

from deepke.

whwususu avatar whwususu commented on September 25, 2024

你好,建议增加 --bits 4 参数进行量化

在哪个文件更改啊,我现在没有进行微调,只是用这个模型去进行提取,就报了这个错误,谢谢!
image

from deepke.

guihonghao avatar guihonghao commented on September 25, 2024

你已经进行了4bits量化,建议减小输入长度 max_length=1024, max_new_tokens=512

from deepke.

whwususu avatar whwususu commented on September 25, 2024

你已经进行了4bits量化,建议减小输入长度 max_length=1024, max_new_tokens=512

已经启动起来了,感谢!
image
image
image
怎么抽也只能抽取到第一条,是我问的方式不对吗?
image

from deepke.

guihonghao avatar guihonghao commented on September 25, 2024

我们训练集中文本长度偏短,因此建议实际使用中不要使用过长的文本进行抽取。

from deepke.

whwususu avatar whwususu commented on September 25, 2024

我们训练集中文本长度偏短,因此建议实际使用中不要使用过长的文本进行抽取。

没办法啊,都是一篇一篇文档过来。我是做了下测试。后面如果真的需要用,那可能真的需要学习了。

from deepke.

zxlzr avatar zxlzr commented on September 25, 2024

您可以暂时对文档进行划窗口来进行抽取,跑多次抽取模型

from deepke.

whwususu avatar whwususu commented on September 25, 2024

您可以暂时对文档进行划窗口来进行抽取,跑多次抽取模型

那上下文变化,准确度应该会下降。然后能不能直接读pdf之类,我现在是通过程序读取的pdf,转为了string

from deepke.

guihonghao avatar guihonghao commented on September 25, 2024

当前阶段的模型可能还无法很好的处理文档级的抽取,我们将在下一个版本的模型中加强这方面的能力。

from deepke.

whwususu avatar whwususu commented on September 25, 2024

当前阶段的模型可能还无法很好的处理文档级的抽取,我们将在下一个版本的模型中加强这方面的能力。

期待中

from deepke.

gangqing avatar gangqing commented on September 25, 2024

@whwususu 请问你是怎么解决的?我也是报这个错,减小了输入输出长度也还是报错。

from deepke.

1223243 avatar 1223243 commented on September 25, 2024

请问一下,你知道怎么使用vscode调试这个代码吗

from deepke.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.