Comments (7)
设置peft_path 就可以恢复训练。
from medicalgpt.
感谢,我测试了加了peft_path这个参数还是有问题,报错
File "/home/xxx/miniconda3/lib/python3.8/site-packages/transformers/trainer.py", line 2128, in _load_from_checkpoint
raise ValueError(f"Can't find a valid checkpoint at {resume_from_checkpoint}")
ValueError: Can't find a valid checkpoint at outputs-pt-v1/checkpoint-8000
已经配置了
--peft_path ~/MedicalGPT/scripts/outputs-pt-v1/checkpoint-8000,这个目录下文件如下
adapter_config.json
adapter_model.bin
optimizer.pt
rng_state_0.pth
rng_state_1.pth
rng_state_2.pth
rng_state_3.pth
scaler.pt
scheduler.pt
trainer_state.json
training_args.bin
from medicalgpt.
加了peft_path,就可以把resume_from_checkpoint的逻辑注释掉,我一会儿改下。
from medicalgpt.
lora的恢复训练用参数peft_path,全参的恢复训练用resume_from_checkpoint
from medicalgpt.
lora的恢复训练用参数peft_path,全参的恢复训练用resume_from_checkpoint
使用peft_path 后,日志如下:
Peft from pre-trained model: /root/autodl-tmp/finetune-sft/outputs-sft-v2/checkpoint-32500
{'loss': 2.1882, 'learning_rate': 1.1111111111111112e-08, 'epoch': 0.0}
{'loss': 0.9015, 'learning_rate': 1.1111111111111112e-07, 'epoch': 0.0}
{'loss': 1.0434, 'learning_rate': 2.2222222222222224e-07, 'epoch': 0.0}
0%| | 22/36000 [02:20<63:07:57, 6.32s/it]
这个看上去还是重新开始呢
from medicalgpt.
是继续训练,看loss可知。
from medicalgpt.
lora的恢复训练用参数peft_path,全参的恢复训练用resume_from_checkpoint
使用peft_path 后,日志如下: Peft from pre-trained model: /root/autodl-tmp/finetune-sft/outputs-sft-v2/checkpoint-32500
{'loss': 2.1882, 'learning_rate': 1.1111111111111112e-08, 'epoch': 0.0} {'loss': 0.9015, 'learning_rate': 1.1111111111111112e-07, 'epoch': 0.0} {'loss': 1.0434, 'learning_rate': 2.2222222222222224e-07, 'epoch': 0.0} 0%| | 22/36000 [02:20<63:07:57, 6.32s/it]
这个看上去还是重新开始呢
you can pass resume_from_checkpoint=True
to the trainer to skip previous steps. See huggingface/transformers#24274
from medicalgpt.
Related Issues (20)
- vocab扩展后的模型合并问题 HOT 1
- ppo训练时出现问题:UserWarning: KL divergence is starting to become negative: -233.50 HOT 2
- DPO训练,报错:“IndexError: Invalid key: 0 is out of bounds for size 0” HOT 2
- 运行pretraining.py时报错:RuntimeError: CUDA error: device-side assert triggered HOT 4
- 医学大模型全流程体验 HOT 2
- 关于llama3的权重转换 HOT 1
- ValueError: Please specify target_modules in peft_config HOT 1
- PPO和SFT阶段数据集 HOT 2
- 大佬,DPO训练报错 HOT 4
- 大佬,DPO可以改成inputIds和attention_mask 输入吗 HOT 1
- 支持GLM4微调 HOT 1
- notebook报错 HOT 1
- 增量预训练PT与有监督微调SFT的疑问 HOT 1
- RuntimeError: "nll_loss_out_frame" not implemented for 'Half' HOT 2
- 关于本地训练问题 HOT 1
- 增量预训练,这样的input_ids的格式是不是有问题,帮忙看看 HOT 1
- 从头开始训练 HOT 2
- 运行sh ./run_ppo.sh时遇到错误ValueError: Target modules q_proj,v_proj not found in the base model. Please check the target modules and try again错误复现过程
- 请问是否支持最新的InternLM 2.5? HOT 1
- 训练数据集切分一次,多次重复使用的问题 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from medicalgpt.