GithubHelp home page GithubHelp logo

afformer's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

afformer's Issues

分割模型的解码问题

你好,CLS的decoder返回结果,还需要经过什么操作怎么才能转成语义分割中每一类的像素概率呢,我单独测试的模型返回结果有负数值,你能解答我的疑惑吗?谢谢
image

Similarity kernel in the FSK module

In the manuscript, you used the key k and value v to calculate the self-attention weight matrix. But most transformers used the query and key to calculate the weight matrix. There is no difference in codes, but the expression ( k and v for computing self-attention weight) is confusing. Could you give me an explanation in-depth?

Training outcome problem

@dongbo811 Thank you for your work! I trained 150000epoch on the model based on cityscape data as required, but the final miou was only about 74.23, which is still a little short of the 78.7 mentioned in your paper. Here is my training log file, could you please help me check the problem? Thank you very much!
20230802_150709.log

Number of GPUs used in the training process

Thanks for your fantastic work!
I am curious that how many GPUs (maybe V100?) were used in the training (because I am trying to explore some downstream tasks based on your model.)
I have checked out the configuration folder and found that samples_per_gpu is set to 2 for Cityscapes and 4 for ADE20K. You said that the batch size is set to 16 and 8 for Cityscapes and ADE20K respectively, so you probably used 4 V100?

mmseg - WARNING - The model and loaded state dict do not match exactly

mmseg - WARNING - The model and loaded state dict do not match exactly
size mismatch for stem.0.conv.weight: copying a param with shape torch.Size([32, 3, 3, 3]) from checkpoint, the shape in current model is torch.Size([16, 3, 3, 3]).
size mismatch for stem.0.bn.weight: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
size mismatch for stem.0.bn.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
size mismatch for stem.0.bn.running_mean: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
好像是预训练权重文件 和模型不匹配,请问这个怎么修改呢?

The model and loaded state dict do not match exactly

您好,我使用了配置文件中的AFFormer_small_cityscapes.py并从下载了您提供的AFFormer_small_cityscapes.pth并将其加载到了配置文件中,为什么还是会有这个警告,并抛出StopIteration异常?
屏幕截图 2024-03-04 155045
屏幕截图 2024-03-04 155309
屏幕截图 2024-03-04 155111
屏幕截图 2024-03-04 155715
非常渴望能获得您的帮助

Why is the size of stages 2-4 designed like this?

Dear author, thank you for your contribution. I have a question: the size of the four stages of the deformer has been reduced by half in sequence. In contrast, the stages 2-4 of this model are all H/8 * W/8 in size. What are the considerations for this design?

代码

这个与AFFormer有关的代码不完整吧

On the embodiment of heterogeneity

Hi@dongbo811 ,
You mentioned parallel heterogeneous architecture in your article. In which aspect or module is the concept of heterogeneity reflected? Does it refer to the cluster center of each grid on prototype G? Please point out the error in my understanding. Thank you

mmsegmentation version

Hi, there. Thanks for your wonderful work.
I have a question about the mmsegmenation version when cuda11.3 pytorch 1.10.2 and mmcv==1.5.0
Looking forward to your reply as soon as possible.
Thank you in advance.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.