Comments (4)
这是colossalai没装好
from billa.
7B模型fp16加载到gpu里就要占14G显存的,你要在16G的V100上跑 batch_size估计是能是1或2。。。
from billa.
这是colossalai没装好
这样呀,我试试换个版本重装,谢谢~
from billa.
7B模型fp16加载到gpu里就要占14G显存的,你要在16G的V100上跑 batch_size估计是能是1或2。。。
好的。还想问下第三阶段readme里面提到用了两张A100,为什么这一步比其他的省卡了呢,一张A100 40G能否跑起来这一阶段?
from billa.
Related Issues (20)
- 预训练纯文本Loss问题请教 HOT 7
- 请问是否`-LLM`和`-SFT`两个权重的词向量部分都叠加了原始LLaMA呢? HOT 2
- 打算在开源的BiLLa模型上继续训练,模型转换有困难 HOT 7
- 突然发现预训练模型似乎存在一些问题(sft之前阶段的模型) HOT 2
- 请大佬帮忙解决tokenizer = AutoTokenizer.from_pretrained(model_path, use_fast=False)对错误,困惑多日了 HOT 7
- 模型问答中,自己会断掉是什么原因? HOT 3
- LLaMMa 的数据,为什么问他说是谷歌的?是训练集的问题吗
- 我自己的prompt测试效果不好,可否提供BiLLa-7B-SFT在LogiQA-v2、C3-d、C3-m数据集上的测试prompt呢? HOT 1
- 考虑训练过程中增加tensorboar监控吗 HOT 1
- 请问这段代码的作用是什么? HOT 2
- 关于MathQA部分数据集的问题
- 大问题,BigProblem HOT 1
- 扩充中文词汇表的细节 HOT 1
- sft模型预测时 会出现未输出完自己截断的问题 HOT 1
- pretrain的方法和之后的指令微调有什么不同
- 可以在单卡4090上面运行pretrain_main.py吗
- 基于SFT后的billa权重续训问题
- 增量预训练的loss请教
- The version Video-LLaMA-BiLLA use.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from billa.