Comments (10)
It's busy uploading, you can find it here:
https://huggingface.co/Azamorn/Meta-Llama-3-8B-Instruct-Distributed
from distributed-llama.
You need to point it at the original folder of Meta-Llama-3-8B-Instruct
not the one with safetensors but with the pth files, I already converted mine earlier today.
I'll upload it to huggingface for you.
from distributed-llama.
i used download.sh and the key they provide. that was the error i got. I would like to be able to convert for the future reference.
Tried this on 3 different machines.
Thank you for the converted model. I will post my results with 8x RPi 3b
from distributed-llama.
Model name: Meta-Llama-3-8B-Instruct
Target float type: q40
Target file: dllama_meta-llama-3-8b-instruct_q40.bin
I assumed this script would work for all of the LLama3 models, but it does not. Any chance we could get some instruct and or codellama conversion scripts?
from distributed-llama.
Could you try to run convert-llama.py
from the directory where you have that script?
distributed-llama/converter % python convert-llama.py <modelPath> <targetFloatType>
Btw: in your logs I see JSONDecodeError
. Are you pointing a correct directory?
from distributed-llama.
Yes.
The script DOES NOT work with Instruct Models or CodeLLama Models.
However, It DOES WORK for 8B, non-instruct, non-code models.
from distributed-llama.
Could you give more information, what repo, what folder you pointing it at etc?
I should probably say it again, it doesn't work with safetensor files, but with .pth files.
Ideally the script would work with safetensor models as well but not right now.
from distributed-llama.
i am downloading models directly from the meta repository. In order to download these models, you need a key mailed to you. I run the download.sh script as provided. when executing the converter script on models that are designed for Instruct/Code, the script fails.
https://i.imgur.com/XjvNKLl.png
When executing the Tokenizer script for instruct models:
https://i.imgur.com/Q4akLbO.png
from distributed-llama.
I downloaded them straight from huggingface
https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
then I used the files in the original folder in the model repo.
from distributed-llama.
i'll try this maybe i downloaded something wrong
from distributed-llama.
Related Issues (20)
- terminate called after throwing an instance of 'ReadSocketException' HOT 35
- Vulkan Acceleration HOT 32
- How To Add Suppoerted Model HOT 2
- network utilization HOT 3
- (Crashing on Low Memory SBC) main invoked oom-killer: gfp_mask=0x1100dca(GFP_HIGHUSER_MOVABLE|__GFP_ZERO), order=0, oom_score_adj=0 HOT 51
- [Setup] Multiple Apple Silicon Macs: Questions HOT 1
- Support for another models (ollama models) HOT 3
- [Feature Suggest] From All-Reduce to Ring-All-Reduce HOT 1
- Support nSlices > nKvHeads
- [Feature Suggest] Config File alternative to Command Line Arguments HOT 2
- Request: Community Discord? HOT 1
- convert into .bin HOT 2
- support multi nvidia jetson agx orin? HOT 3
- master and worker started but with problems HOT 5
- float-type f32 will not start HOT 2
- dllama-api hosted on 127.0.0.1 HOT 2
- dllama-api invokes "what(): Invalid tokenizer file " HOT 2
- what(): Cannot create socket HOT 1
- What about mobile phones? HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from distributed-llama.