Related Issues (16)
- Does it really work ob RTX2080Ti ? HOT 1
- supervised_finetune.py failed with a wordaround HOT 1
- SFT with large loss {'loss': 388082722196684.8, 'learning_rate': 0.0, 'epoch': 0.02}
- Can we another format than alpaca-instruct like alpaca-chat instruct format if yes how ?
- 请问运行supervised_finetune.py时报RuntimeError: unscale_() has already been called on this optimizer since the last update().是什么原因,怎么解决? HOT 1
- unable to merge reward adapter into model
- how to evaluate?
- any plans for adding repo using stable vicuna for conversation .. human: assistant
- 跑最后一步报这个警告,要怎么改超参数呢 HOT 3
- 大神和原版vicuna仓库对比过效果吗? HOT 1
- What is the data format to LoRA-fine-tune Vicuna?
- Unable to merge reward adapter into model HOT 2
- 不能理解为什么注释这行代码?
- 请问如何在training reward model中自定义数据集
- python train_reward_model.py failed
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vicuna-lora-rlhf-pytorch.