Comments (5)
I just opened the implementation that you have referred above, and I noticed that it is an edited version of my repository. It is unprofessional of @speedinghzl to use my code without citing my repo even though I have MIT license. If you are talking about this file, then it seems that this version does not implement loss over multiple scales. This allows it to have a higher batch size. @speedinghzl might have done some other modification to my code, which I do not know of.
from pytorch-deeplab-resnet.
I guess, you cannot run my code with 11.1 GB of GPU memory. If you modify this line, you can make it fit to lower memory. Change random.uniform(0.5, 1.3) to random.uniform(0.5, 1.1) or some even lower number. Lower it in steps till it fits in your GPU. Also, in the original code which is in caffe, the caffe parameter in the solver file iter_size
is 10, which means that gradient is acummulated over 10 iterations and then applied. This is what I have done in my code. It is kind of equivalent to setting the batch_size as 10 (see iter_size
at this link). The library that you mentioned might be doing this.
from pytorch-deeplab-resnet.
But the time taken is not affected by using iter_size. I wanted to increase the batch size in order to decrease the time taken. I am running on a different dataset with 480x853 size images. Maybe that is why I am able to run within 7-8 GB. What image size did you test with? Also, the library mentioned is not using the iter_size trick. It is passing the batch_size parameter (value=10) to the Pytorch DataLoader (I tried using the mentioned library with images of size 321x321 and could run with a batch size of 10).
from pytorch-deeplab-resnet.
A lot of copy paste ai these days...
@isht7 thanks for this repo!
from pytorch-deeplab-resnet.
Thank you for the responses.
from pytorch-deeplab-resnet.
Related Issues (20)
- Why is loss for multiple scales calculated in a strange way? HOT 2
- Arbitrary batch size still unimplemented?
- Training produces model generating blank segmentations HOT 6
- How do get car mask only? HOT 1
- how does iter_size work? HOT 1
- spatial sizes mismatch HOT 1
- the result? HOT 4
- Why change the image channel order after `cv2.imread`
- No Relu in the ClassifierModule HOT 1
- Frozen the statistics of BN? HOT 5
- The size of the prediction HOT 2
- Bad mIOU tested with provided model HOT 8
- problem of train.py HOT 1
- ASPP or LargeFOV? Should be 76.35%.
- docopt HOT 2
- network stucture issue
- read the ground truth of the pascal voc dataset
- where is the crfs implementation? HOT 1
- About image preprocessing HOT 2
- performance HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pytorch-deeplab-resnet.