GithubHelp home page GithubHelp logo

Comments (10)

DifferentialityDevelopment avatar DifferentialityDevelopment commented on June 16, 2024 1

It's busy uploading, you can find it here:
https://huggingface.co/Azamorn/Meta-Llama-3-8B-Instruct-Distributed

from distributed-llama.

DifferentialityDevelopment avatar DifferentialityDevelopment commented on June 16, 2024

You need to point it at the original folder of Meta-Llama-3-8B-Instruct
not the one with safetensors but with the pth files, I already converted mine earlier today.
I'll upload it to huggingface for you.

from distributed-llama.

unclemusclez avatar unclemusclez commented on June 16, 2024

i used download.sh and the key they provide. that was the error i got. I would like to be able to convert for the future reference.
Tried this on 3 different machines.

Thank you for the converted model. I will post my results with 8x RPi 3b

from distributed-llama.

unclemusclez avatar unclemusclez commented on June 16, 2024
Model name: Meta-Llama-3-8B-Instruct
Target float type: q40
Target file: dllama_meta-llama-3-8b-instruct_q40.bin

I assumed this script would work for all of the LLama3 models, but it does not. Any chance we could get some instruct and or codellama conversion scripts?

from distributed-llama.

b4rtaz avatar b4rtaz commented on June 16, 2024

Could you try to run convert-llama.py from the directory where you have that script?

distributed-llama/converter % python convert-llama.py <modelPath> <targetFloatType>

Btw: in your logs I see JSONDecodeError. Are you pointing a correct directory?

from distributed-llama.

unclemusclez avatar unclemusclez commented on June 16, 2024

Yes.

The script DOES NOT work with Instruct Models or CodeLLama Models.

However, It DOES WORK for 8B, non-instruct, non-code models.

from distributed-llama.

DifferentialityDevelopment avatar DifferentialityDevelopment commented on June 16, 2024

Could you give more information, what repo, what folder you pointing it at etc?

I should probably say it again, it doesn't work with safetensor files, but with .pth files.
Ideally the script would work with safetensor models as well but not right now.

from distributed-llama.

unclemusclez avatar unclemusclez commented on June 16, 2024

i am downloading models directly from the meta repository. In order to download these models, you need a key mailed to you. I run the download.sh script as provided. when executing the converter script on models that are designed for Instruct/Code, the script fails.

https://i.imgur.com/XjvNKLl.png

When executing the Tokenizer script for instruct models:

https://i.imgur.com/Q4akLbO.png

from distributed-llama.

DifferentialityDevelopment avatar DifferentialityDevelopment commented on June 16, 2024

I downloaded them straight from huggingface
https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
then I used the files in the original folder in the model repo.

from distributed-llama.

unclemusclez avatar unclemusclez commented on June 16, 2024

i'll try this maybe i downloaded something wrong

from distributed-llama.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.