GithubHelp home page GithubHelp logo

Comments (5)

mymusise avatar mymusise commented on August 23, 2024

你的语料很大吗,很大的话可以考虑用TFRecord, 最近也有计划把数据预处理迁移到TFRecord

from gpt2-quickly.

850886470 avatar 850886470 commented on August 23, 2024

你的语料很大吗,很大的话可以考虑用TFRecord, 最近也有计划把数据预处理迁移到TFRecord

就是用的git项目里原有的诗歌内容。。没做什么修改。大佬如果直接用诗歌训练正常会占用多少内存呀。上午我把batch_size减少成1了,可以运行一会了,但运行到一半还是会超过32G。

我原以为诗歌语料32G足够了。。 我再把诗歌减少一半试试

from gpt2-quickly.

mymusise avatar mymusise commented on August 23, 2024

好像上次换成sentenceprice后导致了一些bug,我修复下,稍等一会

from gpt2-quickly.

mymusise avatar mymusise commented on August 23, 2024

修复了构建字典的bug,不过好像你这个问题不是这个导致的,应该是上次我改动了测试的参数配置,不过我本地和colab上运行了好像都不会占用32G内存这么多,你有用GPU吗?
或者你试试现在的测试参数配置

from gpt2-quickly.

850886470 avatar 850886470 commented on August 23, 2024

现在参数可以了。我后来也把层数,token减少了。

感谢大佬,感觉开始入门了。

from gpt2-quickly.

Related Issues (8)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.