... would need vocab_type bpe, see here for illustration <a href="https://colab.re

support for llama3 in autoquant about llm-course HOT 3 OPEN

mlabonne commented on July 20, 2024

support for llama3 in autoquant

from llm-course.

Comments (3)

CrispStrobe commented on July 20, 2024 1

indeed might be better to wait with regard to the pre-tokenizer. i am not completely sure i understood the procedure for new models like say llama3 merges. but my current understanding is illustrated by this updated kaggle script.
there is also now a problem with older models: there are some models, like phi2, which need convert-hf-to-gguf.py and not convert.py. and after the new pre-tokenizer-fix, some of these will not easily work now.
i wonder why the script not simply falls back on default in such cases. my workaround is to just use an older version for such cases.
so atm we have at least 3 number of cases afaik:

old models like phi2 ==> older convert-hf-to-gguf.py
new bpe models like llama3 ==> newer convert-hf-to-gguf.py with complicated pre-tokenizer-handling
others ==> convert.py

from llm-course.

CrispStrobe commented on July 20, 2024

in the meanwhile, there is also a fix for the pretokenizer. i have included it in this Kaggle notebook. of course you can adapt it if you wish.

from llm-course.

mlabonne commented on July 20, 2024

Sorry for the slow response, thanks a lot for opening this issue. I saw a lot of comments about issues with the tokenization in GGUF, so I don't know if it's the right time to update AutoQuant.

I like your improvements in the first notebook. Do you think I should transfer them or should I wait until the situation is fixed?

from llm-course.

Recommend Projects

support for llama3 in autoquant about llm-course HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs