GithubHelp home page GithubHelp logo

Comments (4)

dvornikita avatar dvornikita commented on July 24, 2024

The right way to do it is to change the parameters of the initial tiling in the file boxer.py. The class PriorBoxGrid generates the tiling for a square area. If you make the tiling of an area with aspect ratio 1/2 you will have the right number of boxes that match the outputs of the convolutions.

Also, you may not want to do that at all because during the data augmentation parts of an image are randomly cropped and rescaled to a square. Since there are no constraints on cropping aspect ratio, the final aspect ratio of the patch is different from the original image.
What I'm saying is that even if you change the aspect ratio of the full pipeline you will still train on warped patches. If you train without random crops, the performance if likely to drop for the reason of lacking data augmentation.

from blitznet.

revilokeb avatar revilokeb commented on July 24, 2024

@dvornikita yes, to adjust the tiling does make more sense.

Thank you also for pointing out that data augmentation does random cropping and then rescaling to square. That seems ok for my case however since I need to attach further network heads to the trunk which will require that aspect ratio.

I will close this when I have adjusted the tiling

from blitznet.

dvornikita avatar dvornikita commented on July 24, 2024

No problems.

from blitznet.

juliagviu avatar juliagviu commented on July 24, 2024

Could you please @revilokeb explain deeper how you adjusted the tiling? I am working with 512x1024 images and need to solve the same issue, but I am not sure how to make initial tilings of ratio 1/2. Thanks very much!

from blitznet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.