GithubHelp home page GithubHelp logo

google-research / nested-transformer Goto Github PK

View Code? Open in Web Editor NEW
190.0 8.0 24.0 456 KB

Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf

License: Apache License 2.0

Python 35.14% Shell 0.18% Jupyter Notebook 64.68%
imagenet transformer vision

nested-transformer's Issues

Discrepancies vs Table A1 in paper

I noticed some possible discrepancies of the architecture parameters here vs those in table A1 of the paper

image

For ImageNet models, is it correct that:

  1. The table should say h=[3,3,4]?
  2. The order of the scale_hidden_dims in the table is inverted. That is, hierarchies 1, 2 and 3 should say [4d, 4h] × 2, 1 [2d, 2h] × 2, 4 [d, h] × k, 16?

Model Converge Problem

I am training on a medium-scale dataset that consists of 100,000 images. The learning rate and weight decay as the same as your config but still not working. Any opinion?

Regards,
Khawar Islam

Training hours & Imagenet accuracy

Hello, thanks for sharing your interesting work.

I was trying to reproduce the NesT-T ImageNet result in this link using TPUs.

Here are my TPU-v3 8 cores result (link) by using exactly the same hyperparameters in imagenet_nest_tiny.py

As you can see, it takes 63 hours for training while your result takes 21 hours.
How can I reduce training hours such as your result?
If this difference came from the data loading time, could you tell me the types of data storage that you used?
Right now, I'm using the google cloud storage bucket for data storage.

Furthermore, I can see the accuracy difference around 0.5% (81.0 v.s. 81.5).
Could you explain this difference?

Regarding GradCAT implementation

Hi
I'm interested to work with the Nest model, however I'm facing difficulty with the implementation of GradCAT. Could you please share the implementation?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.