GithubHelp home page GithubHelp logo

Comments (8)

zhengyang-wang avatar zhengyang-wang commented on July 23, 2024

I haven't explored Tensorflow on multi-GPU currently.

from deeplab-v2--resnet-101--tensorflow.

myhooo avatar myhooo commented on July 23, 2024

I add one GPU at the line "os.environ['CUDA_VISIBLE_DEVICES'] = '1,3'" in the main.py and the code can run on these two GPUs. @zhoufengbuaa

from deeplab-v2--resnet-101--tensorflow.

John1231983 avatar John1231983 commented on July 23, 2024

I do not think so. For multiple GPU, you have to compute average gradient and batch normalization. It is very difficult. For easy, just compute average gradient and it will work. See the example of mnist dataset

from deeplab-v2--resnet-101--tensorflow.

FengLoveBella avatar FengLoveBella commented on July 23, 2024

@myhooo os.environ['CUDA_VISIBLE_DEVICES'] = '1,3' it is absolutely not ok, the gpu1 and gpu3 are allocated, but only the gpu1 is used for network.
@John1231983 I try a lot to use multi-gpu, I really compute average grads and average loss, but there is still some problem. reuse_variables and some else drive me crazy.

from deeplab-v2--resnet-101--tensorflow.

FengLoveBella avatar FengLoveBella commented on July 23, 2024

@zhengyang-wang It is very important to use large batch when semantic segmentation. Multi-gpu is absolutely a good chiose.

from deeplab-v2--resnet-101--tensorflow.

myhooo avatar myhooo commented on July 23, 2024

@zhoufengbuaa Thank you for telling me that I am wrong~ ^_^

from deeplab-v2--resnet-101--tensorflow.

zhengyang-wang avatar zhengyang-wang commented on July 23, 2024

@zhoufengbuaa I'm aware of that. However, there is an easy way as suggested by @John1231983, which is to use accumulated gradients. A similar way is used in the implementation of msc training. You can read my code to figure out how to do it. This approach allows you to use a large batch of larger patches, but it takes longer time to train.

from deeplab-v2--resnet-101--tensorflow.

John1231983 avatar John1231983 commented on July 23, 2024

I thinl gradient is one one problem of multiple gpu. The another is syn. batch norm statistic that is not support in tensorflow now

from deeplab-v2--resnet-101--tensorflow.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.