Comments (9)
Hi, I used the 40GB gpu.
Data reading was not a major speed bottleneck, but you could try to resize the image on hard drive because some images could have a high resolution.
from albef.
from albef.
Using 8 A100 GPUs, it takes 2-3 days with 4M images, and around 7-8 days with 14M images. You could make training faster by reducing the image resolution to 224 and increasing the batch size, the performance would be roughly the same. You can also try some other memory reduction techniques such as zero-redundancy optimizer.
from albef.
from albef.
Yes I think you could.
The pre-training dataset annotation (image paths and text) is released.
from albef.
Hi, do you get similar performance using 8 V100 GPUs?
can i use 8 32gb-v100 gpus to reproduce your training result? by the way, the code of data preprocessing (filter some pairs) will be released?
…
------------------ Original ------------------ From: Junnan Li @.> Date: Wed,Dec 8,2021 6:05 PM To: salesforce/ALBEF @.> Cc: shoutOutYangJie @.>, Author @.> Subject: Re: [salesforce/ALBEF] what size of your A100 gpu's memory? (Issue#31) Using 8 A100 GPUs, it takes 2-3 days with 4M images, and around 7-8 days with 14M images. You could make training faster by reducing the image resolution to 224 and increasing the batch size, the performance would be roughly the same. You can also try some other memory reduction techniques such as zero-redundancy optimizer. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.
from albef.
from albef.
@shoutOutYangJie
Hi, it seems that you had reproduced the results with 8 V100 GPUs.
- Did you use the same configurations as in Pretrain.yaml?
- How many hours per epoch it took for the training?
- Have you tried to reduce the image resolution from 384 to 224?
Looking forward to your reply.
from albef.
from albef.
Related Issues (20)
- Momentum parameter HOT 2
- NLVR2 Pretrain HOT 1
- Cannot install in python3.6 HOT 1
- change english text_encoder to other language? HOT 2
- Question about answer ranking HOT 2
- Zero-shot capabilities on ImageNet HOT 2
- state_dict = checkpoint['model'] KeyError: 'model' When I using flickr30k.pth HOT 2
- Grounding det.json file for other grounding datasets
- pretrain task
- utils.init_distributed_mode(args) Fail HOT 1
- About dropout and no_grad.
- refcoco on lower resolution
- ITM loss HOT 1
- RefCOCO+ Fine-tuning
- TypeError: '<=' not supported between instances of 'float' and 'str' ? HOT 1
- How can I get Visual Genome ? HOT 2
- ITC & ITM & MLM weight distribution
- RuntimeError: invalid multinomial distribution (sum of probabilities <= 0)
- '/export/share HOT 2
- The code for loss computation of itc is not corresponding to the original paper HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from albef.