GithubHelp home page GithubHelp logo

sandy4321 / a2c-te-nog Goto Github PK

View Code? Open in Web Editor NEW

This project forked from galatolofederico/a2c-te-nog

0.0 1.0 0.0 31 KB

Repository for "Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms"

License: GNU General Public License v3.0

Shell 0.11% Python 99.89%

a2c-te-nog's Introduction

Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms

Repository for the paper Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms

Installation

Clone the repository

git clone https://github.com/galatolofederico/a2c-te-nog.git
cd a2c-te-nog

Create a virtualenv and install the requirements

virtualenv --python=python3.6 env
. ./env/bin/activate
pip install -r requirements.txt

Additional packages are needed to use wandb or tensorboard

Usage

To run an experiment run

python run.py --env <environment_name> ....

Check arguments.py to see which hyperparameters can be set as arguments

To create an Optuna study

python create_study.py --study-name <study_name>

The environment variable OPTUNA_STORAGE must be set to a valid Optuna storage

To run a trial from an Optuna study

python run_trial.py --study-name <study_name>

Examples

To create an hyperparameters optimization for the agent A2CTENOG as in the paper

python create_study.py --env lunarlander --log stdout --agent A2CTENOG --study-name A2CTENOG-1 --prune-reward -500 --total-steps 3e4 --num-envs 8

To run a run from this study

python run_trial.py --study-name A2CTENOG-1

To run a specify experiment (for example the best one found for A2CTENOG in the paper)

python run.py --env lunarlander --log stdout --prune-reward -500 --total-steps 3e4 --num-envs 8 --agent A2CTENOG --target-entropy 0.0917 --lr 0.0002292 --max-clip-norm 0.3462 --train-steps 64 --gamma 0.999

Citing

If you want to cite use you can use this BibTeX

@article{galatolo_a2c
,	author	= {Galatolo, Federico A and Cimino, Mario GCA and Vaglini, Gigliola}
,	title	= {Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms}
,	year	= {2020}
}

Contributions and license

The code is released as Free Software under the GNU/GPLv3 license. Coping, adapting e republishing it is not only consent but also encouraged.

For any further question feel free to reach me at [email protected] or on Telegram @galatolo

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.