GithubHelp home page GithubHelp logo

dalle-pytorch-datasets's Introduction

(Archived)

@robvanvolt has created a much more fleshed out version here: https://github.com/robvanvolt/DALLE-datasets

None of this code works yet. If you'd like to contribute, create a pull request! We need all the datasets we can get. Otherwise come back in a few weeks to check on progress.

This repository includes metadata and instructions for downloading many captioned datasets + generated captions from labels.

Thanks to @yashbonde, we eventually intend to include generated captions for a variety of datasets that don't include captions.

Data Format

Since this is a highly versatile dataset we have a common format for each sample:

{
  "image_id": {
    "labels": ["car", "chair", "something else"],
    "score": [0, 1, 1],
    "caption": "caption goes here",
    "dataset": "open_images_v4"
    "source_split": "train",
    "original_language": "eng",
  }
}
  • image_id: this will be expanded to the complete filepath when training
  • labels: in case the given images has labels add those here, default is None
  • score: in case there is a score against that labels eg. OpenImages, default is None
  • caption: generated caption goes here
  • source_split: what split was this a part of in the datasset it is of
  • dataset: key of the dataset name
  • original_language: in case this has multilingual dataset use ISO-639-2 code

Datasets

name size image count link used for VAE captions given captions generated
Downscale OpenImagesv4 16GB 1.9M torrent
Stanford STL-10 2.64GB 113K torrent
CVPR Indoor Scene Recognition 2.59GB 15620 torrent
The Visual Genome Dataset v1.0 + v1.2 Images 15.20GB 108K torrent
Food-101 5.69GB 101K torrent
The Street View House Numbers (SVHN) Dataset 2.64GB 600K torrent
Downsampled ImageNet 64x64 12.59GB 1.28M torrent
COCO 2017 52.44GB 287K torrent website
Flickr 30k Captions (bad data, downloads duplicates) 8GB 31K kaggle

Other Projects

This a big community led effort, find more projects:

Connect with us

You can join the discord for direct communication.

dalle-pytorch-datasets's People

Contributors

afiaka87 avatar yashbonde avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

dalle-pytorch-datasets's Issues

Contact

Afiaka87, can I please have your email? Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.