GithubHelp home page GithubHelp logo

camlaedtke / segmentation_pytorch Goto Github PK

View Code? Open in Web Editor NEW
78.0 78.0 10.0 137.31 MB

Simple image segmentation pipeline in pytorch, using HRNet and SegFormer models

Jupyter Notebook 99.18% Python 0.82%

segmentation_pytorch's People

Contributors

camlaedtke avatar jo-jstrm avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

segmentation_pytorch's Issues

Binary segmentation insides

Hello, I would like to ask you if its possible to use your implementation for a binary segmentation task on a grayscale image data. And if yes, what the simplest way would look like, taking into consideration the loss function and the model parameters, such as num_classes or in_chans?

Thank you

Performance achieved by this implementation

Hi, Thanks for contributing to this repo, which makes me easier to understand SegFormer's architecture.
Did you validate the performance achieved by this implementation? Could it reach the results reported in the paper?
I would appreciate it a lot for your answer.
Best regards

when in used your segformer model load mit_b2, tips occured

missing keys in source state_dict: linear_c4.proj.weight, linear_c4.proj.bias, linear_c3.proj.weight, linear_c3.proj.bias, linear_c2.proj.weight, linear_c2.proj.bias, linear_c1.proj.weight, linear_c1.proj.bias, linear_fuse.weight, linear_fuse.bias, linear_pred.weight, linear_pred.bias

pre-trained models

Hello author, could you please tell me which pre-trained models are used in the code, such as mit_b0.pth?
Thank you very much for your help.

RFC: Bring implementation closer to original

First of all, thank you very much for the great implementation of SegFormer!

While working with your SegFormer version and the pre-trained weights of the original paper, I noticed some minor inconsistencies.

  1. The defaults in the constructor are mixed from different SegFormer versions (or from no version at all).
    • The combination of embed_dims=[64, 128, 256, 512], is not mentioned in the paper
    • depths=[3, 6, 40, 3] and mlp_ratios=[4, 4, 4, 4] are from version b5
    • decoder_dim = 256 is from version b0
  2. There is a sneaky batch norm layer in the original code that your implementation is missing.

Further I managed to convert the pre-trained weights from the original repo to your implementation. Would you like me to add a small conversion util to your repo?

I already opened a PR, but would of course like to discuss the changes here first ๐Ÿฅณ.

Applying Sigmoid/Softmax for the output of model

Hello
How are you?
Thanks for contributing to this project.
I found that the sigmoid/softmax activation is NOT applied for the model's output when it is evaluated.
Please check the function "validate" in your notebook file "segmentation_segformer.ipynb".
Could u explain the reason?
Thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.