GithubHelp home page GithubHelp logo

Comments (5)

colesbury avatar colesbury commented on May 3, 2024
  • The 7x7 convolution is zero-padded by 3 on each side, so a total padding of 6 on the width and 6 on the height. The max pooling is also padded (by 1 on each side). I think max pooling is padded with -inf instead of zero. The Torch documentation makes it sound like a total padding of 3, but I don't think that's how it's implemented. (@soumith, is this correct?)
  • Yes, this is a bug -- thanks for catching it. Because of this, the ResNet-200 model has an extra batch norm & ReLU in the first layer, which it shouldn't.
  • The extra copy is to make the -shareGradInput option work correctly with this model. This option re-uses the CUDA storages for gradInput when computing the backward pass to save memory, but the implementation is a fragile hack.

from fb.resnet.torch.

soumith avatar soumith commented on May 3, 2024

yes it's symmetric padding, 3 on each side.

from fb.resnet.torch.

anibali avatar anibali commented on May 3, 2024

@colesbury Why is there no nn.Copy for cifar10?

from fb.resnet.torch.

colesbury avatar colesbury commented on May 3, 2024

@anibali, there should probably be a copy there too in case you run with -shareGradInput

from fb.resnet.torch.

anibali avatar anibali commented on May 3, 2024

Great, thanks. Just trying to figure out how everything works :)

from fb.resnet.torch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.