GithubHelp home page GithubHelp logo

fpsluozi / tofindwaldo Goto Github PK

View Code? Open in Web Editor NEW
6.0 3.0 0.0 210 KB

Official Repo for "To Find Waldo You Need Contextual Cues: Debiasing Who’s Waldo", ACL 2022 (Short)

natural-language-processing computer-vision multi-modal-learning dataset

tofindwaldo's Introduction

To Find Waldo You Need Contextual Cues: Debiasing Who’s Waldo

This is the official repository for "To Find Waldo You Need Contextual Cues: Debiasing Who’s Waldo".

Prerequisites

  1. Follow the instructions from the original Who's Waldo work and acquire the original dataset as well as the source code.

  2. To generate the needed bottom-up image features, you may either use the original repo or the pytorch re-implementation up to your discretion.

  3. In order to train/test with our splits, simply replace ./dataset_meta/ in the original source code repo with ours, and rerun the data preprocessing steps. We also provide a customizeable training config file config/train-whos-waldo-new-finetune.json for convenience.

License

MIT

Citation

@inproceedings{luo-etal-2022-find,
    title = "To Find Waldo You Need Contextual Cues: Debiasing Who{'}s Waldo",
    author = "Luo, Yiran and Banerjee, Pratyay and Gokhale, Tejas and Yang, Yezhou and Baral, Chitta",
    booktitle = "Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)",
    month = may,
    year = "2022",
    url = "https://aclanthology.org/2022.acl-short.39",
    pages = "355--361",
}

tofindwaldo's People

Contributors

fpsluozi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

tofindwaldo's Issues

Duplicate id's

There appear to be some duplicate id's in the splits.

I used following Python script to identify them:

seen = set()
dupes = [x for x in ids if x in seen or seen.add(x)]

Where ids is a list of all the ids in train, test and val combined.

Following file contains the duplicates I found: dupes.txt

Please correct me if I'm wrong on this.

About max_txt_len.

May I ask if your implementation, when training and testing, filters out examples with text length >100?
Are the results you report in your paper based on this implementation?

About the visual feature extraction.

Hi, could you provide a link for your extracted visual features?

I've extracted the image features from bounding boxes with this repo, but it resulted in NaN when I used these features to train.

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.