GithubHelp home page GithubHelp logo

Training with GPUs about deepmath HOT 3 CLOSED

tensorflow avatar tensorflow commented on June 3, 2024
Training with GPUs

from deepmath.

Comments (3)

kbansal avatar kbansal commented on June 3, 2024

I assume you are referring to the TF_NEED_CUDA in the Dockerfile. Yes, you may. We have only tested the evaluation externally on CPU, but it should work with GPU as well -- it possible some other Dockerfile changes need to be made for Docker image to see the GPU. To keep it broadly usable, we don't enable it by default. If you try it, we would happy to hear about your attempt.

from deepmath.

einsiedler0408 avatar einsiedler0408 commented on June 3, 2024

@kbansal , I kind of figured out how to use multiple GPUs for training. The most important changes are (1) add a .bazelrc to deepmath root directory with content of "import %workspace%/tensorflow/.bazelrc", (2) change the version of tensorflow to r1.12 (because there is no compatible tensorflow-estimator version for your current TF version, when using tf.contrib.distribute.MirroredStrategy).
Currently, I am not using Docker. When I have time, I may give you a corresponding DockerFile.

I have some new questions: (1) I am running deepmath/deephol/train/ experiments.py. Am i right? (2) What is the batch size you actually use in the HOList paper?

from deepmath.

kbansal avatar kbansal commented on June 3, 2024

1 - yes, that is the training code. We had not ourselves tried the training code outside of Google and provided it as reference only, so glad to hear you were able to use it.

2- I asked my colleagues, and their recollection is 16 or 32. But try as large a batch_size as you are able to according to memory available.

from deepmath.

Related Issues (17)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.