Comments (4)
用bert就行
from medicalgpt.
用bert就行
直接改成bert就报错了:
The tokenizer class you load from this checkpoint is not the same type as the class this function is called from. It may result in unexpected tokenization.
The tokenizer class you load from this checkpoint is 'ChatGLMTokenizer'.
The class this function is called from is 'BertTokenizer'.
tokenizer = tokenizer_class.from_pretrained(tokenizer_name_or_path, **tokenizer_kwargs)
File "/usr/local/anaconda3/envs/hj-glm6b2/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 1825, in from_pretrained
return cls._from_pretrained(
File "/usr/local/anaconda3/envs/hj-glm6b2/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 1988, in _from_pretrained
tokenizer = cls(*init_inputs, **init_kwargs)
File "/usr/local/anaconda3/envs/hj-glm6b2/lib/python3.10/site-packages/transformers/models/bert/tokenization_bert.py", line 213, in init
if not os.path.isfile(vocab_file):
File "/usr/local/anaconda3/envs/hj-glm6b2/lib/python3.10/genericpath.py", line 30, in isfile
Traceback (most recent call last):
File "/home/haojing/code/js/Guarantee_Intelligence/train/MedicalGPT/reward_modeling.py", line 645, in
main()
File "/home/haojing/code/js/Guarantee_Intelligence/train/MedicalGPT/reward_modeling.py", line 416, in main
tokenizer = tokenizer_class.from_pretrained(tokenizer_name_or_path, **tokenizer_kwargs)
File "/usr/local/anaconda3/envs/hj-glm6b2/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 1825, in from_pretrained
st = os.stat(path)
TypeError: stat: path should be string, bytes, os.PathLike or integer, not NoneType
return cls._from_pretrained(
File "/usr/local/anaconda3/envs/hj-glm6b2/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 1988, in _from_pretrained
tokenizer = cls(*init_inputs, **init_kwargs)
File "/usr/local/anaconda3/envs/hj-glm6b2/lib/python3.10/site-packages/transformers/models/bert/tokenization_bert.py", line 213, in init
if not os.path.isfile(vocab_file):
File "/usr/local/anaconda3/envs/hj-glm6b2/lib/python3.10/genericpath.py", line 30, in isfile
st = os.stat(path)
TypeError: stat: path should be string, bytes, os.PathLike or integer, not NoneType
from medicalgpt.
model_type = bert
from medicalgpt.
same to #28
from medicalgpt.
Related Issues (20)
- AMD 执行 run_pt.sh失败 HOT 1
- 有没有人能分享下自己微调后的模型id,我懒得弄,只想吃现成的 HOT 1
- vocab扩展后的模型合并问题 HOT 1
- ppo训练时出现问题:UserWarning: KL divergence is starting to become negative: -233.50 HOT 2
- DPO训练,报错:“IndexError: Invalid key: 0 is out of bounds for size 0” HOT 2
- 运行pretraining.py时报错:RuntimeError: CUDA error: device-side assert triggered HOT 4
- 医学大模型全流程体验 HOT 2
- 关于llama3的权重转换 HOT 1
- ValueError: Please specify target_modules in peft_config HOT 1
- PPO和SFT阶段数据集 HOT 2
- 大佬,DPO训练报错 HOT 4
- 大佬,DPO可以改成inputIds和attention_mask 输入吗 HOT 1
- 支持GLM4微调 HOT 1
- notebook报错 HOT 1
- 增量预训练PT与有监督微调SFT的疑问 HOT 1
- RuntimeError: "nll_loss_out_frame" not implemented for 'Half' HOT 2
- 关于本地训练问题 HOT 1
- 增量预训练,这样的input_ids的格式是不是有问题,帮忙看看 HOT 1
- 从头开始训练 HOT 2
- 运行sh ./run_ppo.sh时遇到错误ValueError: Target modules q_proj,v_proj not found in the base model. Please check the target modules and try again错误复现过程
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from medicalgpt.