jasongief / psp_cvpr_2021 Goto Github PK
View Code? Open in Web Editor NEW[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line
[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line
Hi, I think "v_dim=hidden_dim" should be changed to "v_dim=v_dim" on line 197.
Hello, I would like to ask why only the first category of video classification (Church bell) labels per second are given in right_labels.h5, only the first 188 rows show whether there is an event per second, and the values of all subsequent rows It is all 0. Logically, it should contain all the labels of all videos for 10 seconds. If an event occurs, it will be 1, otherwise is background, is 0.
I am very sorry to bother you all the time, and I am very grateful for your contribution.
Hi,
Thanks for your code and congrats on the paper accept. I was going through your code and your model definition. I have 2 enquiries:
Thanks!
Dear authors,
Could you please kindly let me know how many GPUs you used for training the model? and how long will it take? Do you set the early stop or specify the overall training epochs?
Thanks a lot for your help!
Hello, I would like to ask what information is prob_label generated based on? At first I thought it corresponds to the information in annotation.txt, because they all have 4143 lines, but I found that actually prob_label does not correspond to annotation, because the maximum probability of displaying the second line in prob_label is the background.
.
Good work! When I train from scratch for the fully supervised setting, the AVE localization accuracy can achieve 75.8% or 75.2% with different seeds. Any suggestions?
Dear authors,
Thanks for opening this work!
I have downloaded the data as:
data/AVE_Dataset
data/audio_feature.h5 prob_label.h5 right_labels.h5 visual_feature.h5
Looks like it miss the other files e.g. audio_feature_noisy.h5 visual_feature_noisy.h5
Besides, could you please let me know the environment you use?
Can you provide the code to generate the attention map? I tried to run "attention_visualization" of AVE-ECCV18, and use “PSP_fully.pt” to replace the model of the original code, but an error "Nonetype object has no attribute register_forward_hook" appeared, I know it was caused by the replacement of the model, but I don’t know how to solve.
Hi, nice work!
I was very sorry to bother you. When I used your code, there were two results(val and test), I had read your paper, but I did not see the description of result. So, which result did you use in your paper? val or test dataset?
Thank you very much!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.