Comments (2)
13b模型发布的时候,还没有audio这一路。
from video-llama.
抱歉让你产生了一些困惑
我之前在另一个issue里回复过,VL/AL的checkpoint是跟language decoder绑定的,同时如果想要正常跑起来VL和AL分支是共享language decoder的,所以这就决定了无法同时加载video 13b的参数和audio 7b的参数
from video-llama.
Related Issues (20)
- What is the input sample of the forward function in videollama HOT 1
- 如何提升下游任务上finetune的效果
- How To: Use hugging face checkpoints downloaded on a CentOS machine HOT 4
- Unable to launch demo HOT 2
- Is video-LLaMA capable of comprehending videos that have faces surrounded by bounding boxes(face recognition)
- Evaluation on large-scale dataset HOT 1
- Compatibility b/w torch and torchvision?
- .
- Possible bugs in LR scheduler
- how to increase the numbers of input frame? HOT 2
- What if no frame_position_embeddings?
- llm在两个阶段都是keep frozen吗? HOT 1
- finetune-billa7b-zh inference error shape '[-1, 136]' is invalid for input of size 137
- Finetune with LoRA and QLoRA
- Error loading the audio
- Problem running demo: Loading checkpoint shards never finishes HOT 1
- modelling_llama.py
- 训练时长?
- Audio input
- 模型错误输出结果
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from video-llama.