GithubHelp home page GithubHelp logo

hnyu / scalor Goto Github PK

View Code? Open in Web Editor NEW

This project forked from jindongjiang/scalor

0.0 0.0 0.0 35.79 MB

Official PyTorch implementation of "SCALOR: Generative World Models with Scalable Object Representations"

License: MIT License

Python 98.88% Dockerfile 1.12%

scalor's Introduction

SCALOR

This repository is the official implementation of "SCALOR: Generative World Models with Scalable Object Representations" by Jindong Jiang*, Sepehr Janghorbani*, Gerard de Melo, and Sungjin Ahn; accepted at the International Conference on Learning Representations (ICLR) 2020. Project Website

Architecture

Requirements

To install requirements:

conda env create -f environment.yml

To activate:

conda activate scalor_env

Dataset

The "Grand Central Station" dataset can be downloaded here. After downloading the file, extract the dataset using this command:

tar -xf grandcentralframes.tar.gz -C /path/to/dataset/

Training

To train SCALOR with default settings, run this command:

python train.py --data-dir /path/to/dataset/

Results

toy

natural-scene

Using SCALOR in your project

Foreground not working

If you find the background module explains everything in the image and the foreground module is turned off, first check the following two settings:

  1. The num_cell_h and num_cell_w in common.py. If the objects in the scene are densely positioned in a local area, the number of cells (num_cell_h and num_cell_w) should be larger to provide enough cells in that local area.

  2. The max_num_obj in common.py. In the early training stage, this number is higher the better (smaller than the total number of cells) since it allows more activated cells to accelerate the foreground training. Feel free to reduce it later.

Additionally, I also added the following two settings in the code. Feel free to try any of them:

  1. Using a weaker background decoder, one option is to set the using_bg_sbd flag to True in common.py.

  2. Using a training curriculum in the early training stage. This can be done by setting the phase_bg_alpha_curriculum to True in common.py.

Feel free to let me know if you face any other problems when adopting SCALOR in your project.

Citation

@inproceedings{JiangJanghorbaniDeMeloAhn2020SCALOR,
  title={SCALOR: Generative World Models with Scalable Object Representations},
  author={Jindong Jiang and Sepehr Janghorbani and Gerard {de Melo} and Sungjin Ahn},
  booktitle={Proceedings of ICLR 2020},
  year={2020},
  publisher = {OpenReview.net},
  location = {Addis Ababa, Ethiopia},
  url = {https://openreview.net/pdf?id=SJxrKgStDH},
}

scalor's People

Contributors

jindongjiang avatar hnyu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.