Comments (3)
Check the 20th line in tokenize_dataset_rows.py and see if the path to the model is correct. If the path is correct, then check if the model weights are correct.
from chatglm3-finetune.
If the above model does not have any issues, please check the version of the datasets.If there are no issues with the model mentioned above, check the version of the dataset. datasets==2.10.1
from chatglm3-finetune.
Solved! That's greate. Thank you for your quick response!
I following your advise change the mode value in tokenize_dataset_rows.py. It worked. I had change the model value in finetune.py too. The finetunning process was succeed too.
This is a greate job.
Do you have plan to support QLoRA?
By the way you had better add the instruction to change the model value in tokenize_dataset_rows.py and finetune.py in the README.md.
from chatglm3-finetune.
Related Issues (19)
- from model.modeling_chatglm import ChatGLMForConditionalGeneration not found HOT 1
- FineTune CUDA out of memory HOT 10
- No module named 'model' HOT 1
- 大佬能否出一个多卡训练的脚本?
- 大佬可否提供一个int4量化微调的代码参考一下 HOT 1
- 请问下 如何用这种openai api的方式推理微调过的 模型。 HOT 1
- 微调后的推理结果残缺不全 HOT 2
- 如何增加工具,使得模型能够识别? HOT 1
- 应该如何测试微调出来的效果? HOT 1
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xa4 in position 64: illegal multibyte sequence HOT 3
- SFTTrainner封装训练时间倍增 HOT 2
- lora存checkpoint的问题 HOT 2
- 你好,我用您的代码跑不管是multi还是默认的都无法自动部署在多卡上,请问需要调整代码解决吗,GPU是Tesla T4*4 HOT 3
- evaluate时显存暴增 HOT 2
- It is recommended to support multiple GPU cards HOT 2
- huggingface_hub.utils._validators.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: './bge-large-zh'. HOT 1
- RuntimeError: probability tensor contains either `inf`, `nan` or element < 0 HOT 2
- FileNotFoundError: [Errno 2] No such file or directory: 'data/npc_data.csv'
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chatglm3-finetune.