GithubHelp home page GithubHelp logo

abagaria / lwm Goto Github PK

View Code? Open in Web Editor NEW

This project forked from htdt/lwm

0.0 1.0 0.0 155 KB

Latent World Models For Intrinsically Motivated Exploration | Official repository

Home Page: https://arxiv.org/abs/2010.02302

License: MIT License

Python 99.51% Dockerfile 0.49%

lwm's Introduction

Latent World Models For Intrinsically Motivated Exploration

Official repository | arXiv:2010.02302 | NeurIPS 2020 Spotlight

10m video presentation from NeurIPS

montezuma's revenge t-sne

Installation

The implementation is based on PyTorch. Logging works on wandb.ai. See docker/Dockerfile.

Usage

After training, the resulting models will be saved as models/dqn.pt, models/predictor.pt etc. For evaluation, models will be loaded from the same filenames.

Atari

To reproduce LWM results from Table 2:

cd atari
python -m train --env MontezumaRevenge --seed 0
python -m eval --env MontezumaRevenge --seed 0

See default.yaml for detailed configuration.

To get trajectory plots as on Figure 3:

cd atari
# first train encoders for random agent
python -m train_emb
# next play the game with keyboard
python -m emb_vis
# see plot_*.png

Partially Observable Labyrinth

To reproduce scores from Table 1:

cd pol
# DQN agent
python -m train --size 3
python -m eval --size 3

# DQN + WM agent
python -m train --size 3 --add_ri
python -m eval --size 3 --add_ri

# random agent
python -m eval --size 3 --random

Code of the environment is in pol/pol_env.py, it extends gym.Env and can be used as usual:

from pol_env import PolEnv
env = PolEnv(size=3)
obs = env.reset()
action = env.observation_space.sample()
obs, reward, done, infos = env.step(action)
env.render()
#######
# #   #
# ### #
# #@  #
# # # #
#   # #
#######

lwm's People

Contributors

htdt avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.