afformer's People
Forkers
onuralpszr dl-vit little-white001 luffy03 cv-seg d710055071 1787648106 wangxder hell-to-heaven sujith-kumaraafformer's Issues
分割模型的解码问题
KeyError: "EncoderDecoder: 'afformer_base is not in the models registry'"
Hello, thank you for the excellent paper, I also try to reproduce the model in the paper, but when I load the model it will appear KeyError: "EncoderDecoder: 'afformer_base is not in the models registry'". I hope you can answer my questions in your busy schedule
Any config suggestions for afformer large?
Thanks for your great work. Are there possible configs of large models that still have competitive performance?
请问训练迭代次数,在哪里调呢?
Similarity kernel in the FSK module
In the manuscript, you used the key k and value v to calculate the self-attention weight matrix. But most transformers used the query and key to calculate the weight matrix. There is no difference in codes, but the expression ( k and v for computing self-attention weight) is confusing. Could you give me an explanation in-depth?
pretrained models
excuse me, could you share the pretrained model in aliyunpan ?
Training outcome problem
@dongbo811 Thank you for your work! I trained 150000epoch on the model based on cityscape data as required, but the final miou was only about 74.23, which is still a little short of the 78.7 mentioned in your paper. Here is my training log file, could you please help me check the problem? Thank you very much!
20230802_150709.log
can't open file demo\image_demo.py'
can't open file demo\image_demo.py': No such file or directory
Number of GPUs used in the training process
Thanks for your fantastic work!
I am curious that how many GPUs (maybe V100?) were used in the training (because I am trying to explore some downstream tasks based on your model.)
I have checked out the configuration folder and found that samples_per_gpu is set to 2 for Cityscapes and 4 for ADE20K. You said that the batch size is set to 16 and 8 for Cityscapes and ADE20K respectively, so you probably used 4 V100?
mmseg - WARNING - The model and loaded state dict do not match exactly
mmseg - WARNING - The model and loaded state dict do not match exactly
size mismatch for stem.0.conv.weight: copying a param with shape torch.Size([32, 3, 3, 3]) from checkpoint, the shape in current model is torch.Size([16, 3, 3, 3]).
size mismatch for stem.0.bn.weight: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
size mismatch for stem.0.bn.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
size mismatch for stem.0.bn.running_mean: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
好像是预训练权重文件 和模型不匹配,请问这个怎么修改呢?
The model and loaded state dict do not match exactly
你好,我在使用您的模型在ADE20K上进行训练时,发现达不到您在论文中的mIoU指标,发现有10个点的下降
这是我的日志,mIoU只有31.31,但在您的论文的Table1中mIoU却是41.8
20231028_132107.log
Why is the size of stages 2-4 designed like this?
Dear author, thank you for your contribution. I have a question: the size of the four stages of the deformer has been reduced by half in sequence. In contrast, the stages 2-4 of this model are all H/8 * W/8 in size. What are the considerations for this design?
能说一下怎么转成onnx模型吗
后续还会继续更新、优化吗
mmseg/backbones中没有afformer_base.py? 核心代码没有放出来
AssertionError: MMCV==1.5.0 is used but incompatible. Please install mmcv>=2.0.0rc4.
代码
这个与AFFormer有关的代码不完整吧
On the embodiment of heterogeneity
Hi@dongbo811 ,
You mentioned parallel heterogeneous architecture in your article. In which aspect or module is the concept of heterogeneity reflected? Does it refer to the cluster center of each grid on prototype G? Please point out the error in my understanding. Thank you
mmsegmentation version
Hi, there. Thanks for your wonderful work.
I have a question about the mmsegmenation version when cuda11.3 pytorch 1.10.2 and mmcv==1.5.0
Looking forward to your reply as soon as possible.
Thank you in advance.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.