GithubHelp home page GithubHelp logo

Comments (6)

PeterMao11 avatar PeterMao11 commented on August 16, 2024

返回demo结果
image

from video-llama.

lixin4ever avatar lixin4ever commented on August 16, 2024

llama_model: "vicuna-7b-delta-v0"

llama_model应该设置为vicuna-7b而不是vicuna-7b-delta的目录, 至于如何将vicuna delta weight转变为vicuna weight, 请参考: https://github.com/DAMO-NLP-SG/Video-LLaMA/tree/main#pre-trained-language-decoder

from video-llama.

PeterMao11 avatar PeterMao11 commented on August 16, 2024

https://huggingface.co/openlm-research/open_llama_7b 下载original LLaMA 模型,python apply_delta.py
--base llama_7b
--target vicuna-7b
--delta vicuna-7b-delta-v0进行转换

但返回结果如下

image

from video-llama.

lixin4ever avatar lixin4ever commented on August 16, 2024

你这个是open_llama, 不是llama, llama权重需要去官方仓库填写表格申请下载

from video-llama.

PeterMao11 avatar PeterMao11 commented on August 16, 2024

使用Video-LLaMA-BiLLA中文模型,除了llm和chpt修改配置,例如qformer,clip这些模型需要修改配置么

from video-llama.

lixin4ever avatar lixin4ever commented on August 16, 2024

不需要的,所有实验里visual encoder (ViT & Q-former)都是用的同一套,但是有一个问题就是中文模型目前我们没有添加对audio输入的支持 (也就是没有AL分支),如果你需要使用Video-LLaMA-BiLLA的话,只能尝试运行VL-only Video-LLaMA, 可以参考一下demo_video.pyvideo_llama_eval.yaml

from video-llama.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.