GithubHelp home page GithubHelp logo

denisdsh / vizwiz-vqa-pytorch Goto Github PK

View Code? Open in Web Editor NEW
60.0 4.0 19.0 355 KB

PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind People

Python 43.87% Jupyter Notebook 56.13%
vizwiz vqa pytorch visual-question-answering

vizwiz-vqa-pytorch's Issues

Unable to reproduce results

Hi,
Thanks a lot for your implementation. I tried running it on the VizWiz data with the default hyperparameters in the repo. Unfortunately, the model only achieves around 36% accuracy on the test set, with around 49% on the train set, and hence suffers from a lot of overfitting. Can you please tell me if there are some other things to finetune or why that might be happening?
I'm using the data from here: https://vizwiz.org/tasks-and-datasets/vqa/

getting keyerror during training...

Model logs will be saved in logs/vizwiz/2022-03-03_15:53:09
train.py
train E000: 0% 0/156 [00:00<?, ?it/s]i = 0
item['question']] = tensor([18, 4, 41, 5, 2, 37, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0])
__getitem() 19905
self.img_names[i] = VizWiz_train_00000010.jpg
i = 128
item['question']] = tensor([ 3, 4, 6, 5, 880, 118, 9, 29, 63, 185, 610, 24,
373, 24, 2126, 127, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0])
__getitem() 19905
self.img_names[i] = VizWiz_train_00000010.jpg
train E000: 0% 0/156 [00:00<?, ?it/s]
Traceback (most recent call last):
File "train.py", line 204, in
main()
File "train.py", line 167, in main
train(model, train_loader, optimizer, tracker, epoch=i, split=config['training']['train_split'])
File "train.py", line 26, in train
for item in tq:
File "/usr/local/lib/python3.7/dist-packages/tqdm/std.py", line 1180, in iter
for obj in iterable:
File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py", line 521, in next
data = self._next_data()
File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py", line 1203, in _next_data
return self._process_data(data)
File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py", line 1229, in _process_data
data.reraise()
File "/usr/local/lib/python3.7/dist-packages/torch/_utils.py", line 434, in reraise
raise exception
KeyError: Caught KeyError in DataLoader worker process 0.
Original Traceback (most recent call last):
File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/_utils/worker.py", line 287, in _worker_loop
data = fetcher.fetch(index)
File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/_utils/fetch.py", line 49, in fetch
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/_utils/fetch.py", line 49, in
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/content/drive/MyDrive/VizWiz-VQA-PyTorch-master/datasets/vqa_dataset.py", line 118, in getitem
feature_id = self.name_to_id[img_name[10]]
KeyError: 'VizWiz_train_00000010.jpg'

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.