GithubHelp home page GithubHelp logo

Comments (10)

skarl-api avatar skarl-api commented on July 18, 2024

@yanwii

from dynamic-seq2seq.

yanwii avatar yanwii commented on July 18, 2024

根据你的encoder的词汇量来设置,因为模型文件中是固定大小,所以第二次更改会报错。
根据你的情况,删除模型重新训练吧。
这个值可以适当上调。

from dynamic-seq2seq.

skarl-api avatar skarl-api commented on July 18, 2024

@yanwii 那这个代码怎么写呢? 问和答的数目不一直 如果单单是读取 设置一个值 下次训练恐怕还有存在问题 请指教

from dynamic-seq2seq.

yanwii avatar yanwii commented on July 18, 2024

比如你问题的词汇量是7000 那么encoder_vocab_size=7000
答的词汇量是6000 那么decoder_vocab_size=6000

from dynamic-seq2seq.

skarl-api avatar skarl-api commented on July 18, 2024

@yanwii encoder_vocab_size 我就是这样设置的 获取encoder_vocabulary和decoder_vocabulary的行数分别赋值 假如第一次这些都是2818 生成了10000.ckpt
第二次再次想训练更多数据就没办法在10000.ckpt基础上继续训练 否则就会报错

@yanwii UnimplementedError (see above for traceback): TensorArray has size zero, but element shape [?,20] is not fully defined. Currently only static shapes are supported when packing zero-size TensorArrays.

另外问一下这个问题是?

from dynamic-seq2seq.

yanwii avatar yanwii commented on July 18, 2024

tensorflow的机制决定了模型无法修改,所以你可以适当的把数值调大一点,方便下一次能够继续训练。不然就只能重新训练了。

from dynamic-seq2seq.

skarl-api avatar skarl-api commented on July 18, 2024

@yanwii 大概明白了 我现在第一次把样本相对控制在一个数字然后以后一直保持这个数字在训练 这样是可以的吧?

from dynamic-seq2seq.

yanwii avatar yanwii commented on July 18, 2024

这样是可以的。

from dynamic-seq2seq.

skarl-api avatar skarl-api commented on July 18, 2024

@yanwii 如何在ckpt的基础之上继续训练呢 我第一次训练的结果是8.1M 后面我训练了很多很多都是8.1M 应该是没有涨 这样不管怎么训练都不会聪明 好奇怪啊 求解~

from dynamic-seq2seq.

yanwii avatar yanwii commented on July 18, 2024

因为这个项目是很久之前做的,可以跟进一下我的另外一个项目基于Pytorch的中文聊天机器人

from dynamic-seq2seq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.