batra-mlp-lab / avsd Goto Github PK
View Code? Open in Web Editor NEW[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
Home Page: https://video-dialog.com
[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
Home Page: https://video-dialog.com
Hi,
The extracted features can be found here https://drive.google.com/drive/folders/14zlHmNFkCgptiGttwWKrsaaz5vVUFs00?usp=sharing
Originally posted by @hudaAlamri in #2 (comment)
Hi there,
Thanks for your sharing!
I am running this code as a baseline. However, I can not find svqa_utils
which is used at
Line 7 in 08b4697
Traceback (most recent call last):
File "/content/drive/My Drive/Colab Notebooks/train.py", line 167, in
enc_out = encoder(batch)
File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 722, in _call_impl
result = self.forward(*input, **kwargs)
File "/content/drive/My Drive/Colab Notebooks/encoders/lf.py", line 92, in forward
hist_embed = self.hist_rnn(hist_embed, batch['hist_len'])
File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 722, in _call_impl
result = self.forward(*input, **kwargs)
File "/content/drive/My Drive/Colab Notebooks/utils/dynamic_rnn.py", line 34, in forward
sorted_seq_input, lengths=sorted_len, batch_first=True)
File "/usr/local/lib/python3.6/dist-packages/torch/nn/utils/rnn.py", line 234, in pack_padded_sequence
lengths = torch.as_tensor(lengths, dtype=torch.int64)
RuntimeError: CUDA error: device-side assert triggered
ERROR WHILE RUNNING train.py
Hi, thanks for your great works.
May I ask if it possible to share the data files (all the h5 files and json files)?
The instructions on the data preparation is so unclear.
Thanks you very much.
when I run 'python train.py' with default settings and default feature dataset that provided in this Github, there is an error.
Traceback (most recent call last): File "****/avsd/train.py", line 95, in <module> dataset = VisDialDataset(args, ['train']) File "******/avsd/dataloader.py", line 157, in __init__ self._process_history(dtype) File "********/avsd/dataloader.py", line 296, in _process_history = captions[th_id][:max_ques_len + max_ans_len] RuntimeError: The expanded size of the tensor (44) must match the existing size (40) at non-singleton dimension 0. Target sizes: [44]. Tensor sizes: [40]
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.