yejin0111 / add-gcn Goto Github PK
View Code? Open in Web Editor NEWADD-GCN: Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition (ECCV 2020)
Home Page: https://arxiv.org/abs/2012.02994
ADD-GCN: Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition (ECCV 2020)
Home Page: https://arxiv.org/abs/2012.02994
file:util.py
function:average_precision()
question1:The problem of ZeroDiVisionError:float Division by Zero often occurs in the function that calculates the average precision of each class in the util.py file.Maybe because pos_count is always zero.The program did not perform +1 operation.
How to solve excuse me?
question2:If label==1 was found in two places in the function average_precision() code. Is there an error here?
@Yejin0111 hi thanks for sharing the code base i had few queiers
Hello, when can the source code be released
Hello,Thank you for your work!
question:assert os.path.exists(model_dir) == True
AssertionError
Often appear this problem, how to solve excuse me?
Do you have a more detailed README.MD?What version of CUDA/TorchVision do you use?
Could you please share your training command? Thanks!
Hello. Thank you for sharing your excellent work!
Since I am new in computer vision and deep learning, I have some doubts about the codes.
Line 137 in b7653fe
I wonder why the final z is added to v?
Another question is:
Is it feasible if I want to apply this method to regression problems?
Or can you provide some ideas about it?
Looking forward to your response.
How to arrive the top mAP(96%)
but my best mAP is 92%. How can I overcome the problem?
args.seed = 1
args.lr = 0.05
args.image_size = 448
args.batch_size = 18 * gpu_num
args.epoch_step = [30, 40]
the test size is 576
How get the visualization of category-specific activation maps ? can you provide the code?
您好,论文中说‘we simply replace SAM with a Conv-LReLU block.’,X的形状是H * W * C,D-GCN需要的形状是C*D。只用卷积好像无法降低张量的维度,这里是怎么转换形状的呢?
Hi, thanks for your excellent work.
I noticed you wrote in your paper and code,"we simply average s_m and s_r to produce the final scores s".
then I experimented with only s_m as the final score, and the result mAP were similar or even higher.
Does this mean that GCN played little role in this work?
args.seed = 1
args.lr = 0.05
args.image_size = 448
args.batch_size = 16 * 2
args.epoch_step = [30, 40]
the test size is 576
I followed the configuration mentioned above and used the model that trained on COCO as the pre-train model for Pascal VOC,but best mAP of test of VOC2007 is 94.04%.How can I overcome the problem?
In voc2007, when Image-size is 576x576 and batch size of each GPU is 18(16), but the out of memory is printed. 4*2080Ti, 11G.
Your memory of GPU?
@Yejin0111 can you please share your pretrained models on google drive or onedrive ? it would be helpful
Thanks in advance
As I don't see your license on github, can I quote and modify parts of your code in my own project?
My project is related to the face task. I would like to introduce the attention part and the GCN part into my method.
If allowed, I will state the reference in the code.
Looking forward to your reply.
Hello,Thank you for your work! How get the visualization of dynamic matrix , can you provide the code?
Hi, thank for this great work. What I have found is that turn cudnn.deterministic to False in line 49 of file main.py make the training way more faster and sometimes leads to better accuracy of the final model. I experience this phenomena with COCO, VOC and even my private dataset. Do you think this is a bug or something ?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.