Comments (2)
@eugenelet Thanks for pointing out that! The current code is modified from the inference code, so we MISSED some functions such as parameter freezing. We will fix it soon. For now, you can temporally add a few lines to freeze the parameters in
Line 332 in db02333
For example, freezing LLM:
for param in list(self.layers.parameters()) + list(self.tok_embeddings.parameters()) + list(self.norm.parameters()) + list(self.output.parameters()):
param.requires_grad = False
from onellm.
Thanks for point this out @csuhan ! I'll use this workaround for now. Keep on the great work!
from onellm.
Related Issues (19)
- Mistral? HOT 1
- About evaluation and training codes HOT 3
- Will the code for SFT be open sourced in the near future? Thanks~ HOT 1
- Fmri data is gray image or some numerical data HOT 1
- Is the Tonizer.py code converting modelities into tokens? HOT 2
- Training code HOT 4
- fmri data HOT 1
- webvid dataset no longer available as of 23 of Feb HOT 1
- 关于专家的职能 HOT 2
- support qwen\chatglm3 model? HOT 1
- Clotho V2 annotation file HOT 5
- Inference inputs multiple modalities other than text at once HOT 2
- How to install petrel_client HOT 1
- License HOT 3
- Images and videos with high resolution HOT 3
- .
- Provide some inference examples HOT 1
- Some confusion about the modalities of depth/normal maps. HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from onellm.