Comments (4)
chatglm 官方不支持SequenceClassification,我也没写。
from medicalgpt.
chatglm 官方不支持SequenceClassification,我也没写。
你好, 我训练的rw发现对所有句子打分都是负数?。。。
from medicalgpt.
chatglm 官方不支持SequenceClassification,我也没写。
你好, 我训练的rw发现对所有句子打分都是负数?。。。
自己训练的reward model吗? 数据集label是啥样,对啥句子打分是负数
from medicalgpt.
chatglm 官方不支持SequenceClassification,我也没写。
你好, 我训练的rw发现对所有句子打分都是负数?。。。
自己训练的reward model吗? 数据集label是啥样,对啥句子打分是负数
没有label把, 不是一个是chose 一个是reject,两个做一个loss么。
rewards_chosen = model(input_ids=chose["input_ids"].to(device),
attention_mask=chose["attention_mask"].to(device))[0]
我这个rewards_chosen , 全是负数
from medicalgpt.
Related Issues (20)
- vocab扩展后的模型合并问题 HOT 1
- ppo训练时出现问题:UserWarning: KL divergence is starting to become negative: -233.50 HOT 2
- DPO训练,报错:“IndexError: Invalid key: 0 is out of bounds for size 0” HOT 2
- 运行pretraining.py时报错:RuntimeError: CUDA error: device-side assert triggered HOT 4
- 医学大模型全流程体验 HOT 2
- 关于llama3的权重转换 HOT 1
- ValueError: Please specify target_modules in peft_config HOT 1
- PPO和SFT阶段数据集 HOT 2
- 大佬,DPO训练报错 HOT 4
- 大佬,DPO可以改成inputIds和attention_mask 输入吗 HOT 1
- 支持GLM4微调 HOT 1
- notebook报错 HOT 1
- 增量预训练PT与有监督微调SFT的疑问 HOT 1
- RuntimeError: "nll_loss_out_frame" not implemented for 'Half' HOT 2
- 关于本地训练问题 HOT 1
- 增量预训练,这样的input_ids的格式是不是有问题,帮忙看看 HOT 1
- 从头开始训练 HOT 2
- 运行sh ./run_ppo.sh时遇到错误ValueError: Target modules q_proj,v_proj not found in the base model. Please check the target modules and try again错误复现过程
- 请问是否支持最新的InternLM 2.5? HOT 1
- 训练数据集切分一次,多次重复使用的问题 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from medicalgpt.