GithubHelp home page GithubHelp logo

请问,执行的embedding_convert.py过程出现AssertionError: The sum of weights (-3801.82958984375) is wrong怎么解决? about billa HOT 8 CLOSED

chk4991 avatar chk4991 commented on August 23, 2024
请问,执行的embedding_convert.py过程出现AssertionError: The sum of weights (-3801.82958984375) is wrong怎么解决?

from billa.

Comments (8)

Neutralzz avatar Neutralzz commented on August 23, 2024

重新检查你原始llama的权重

from billa.

chk4991 avatar chk4991 commented on August 23, 2024

重新检查你原始llama的权重

用的是原始llama7b的consolidated.00.pth吗

from billa.

hopeforus avatar hopeforus commented on August 23, 2024

最后程序运行完成出现这个数字是不是就代表权重合作成功?

from billa.

Neutralzz avatar Neutralzz commented on August 23, 2024

参考issue#8

from billa.

ShadowPower avatar ShadowPower commented on August 23, 2024

我有相同的问题,CPU是6800H,没有独显

from billa.

wwngh1233 avatar wwngh1233 commented on August 23, 2024

我遇到了相同的问题,可以推荐下哪里下到作者用的原始的llama权重吗

from billa.

FloatingIsland2 avatar FloatingIsland2 commented on August 23, 2024

重新检查你原始llama的权重

用以下llama初始权重https://huggingface.co/nyanko7/LLaMA-7B/tree/main,同样出错
AssertionError: The sum of weights (-3801.82958984375) is wrong.
有推荐的正确初始权重吗?

另外6.1发现作者更新了embedding_convert.py中的求和部分,现在可以跳过求和部分直接转换,这样是否正确?

from billa.

Neutralzz avatar Neutralzz commented on August 23, 2024

重新检查你原始llama的权重

用以下llama初始权重https://huggingface.co/nyanko7/LLaMA-7B/tree/main,同样出错 AssertionError: The sum of weights (-3801.82958984375) is wrong. 有推荐的正确初始权重吗?

另外6.1发现作者更新了embedding_convert.py中的求和部分,现在可以跳过求和部分直接转换,这样是否正确?

否,如果是转换SFT模型,求和结果必须等于代码中EMBED_SUM的结果。

from billa.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.