GithubHelp home page GithubHelp logo

vermouth1992 / mbrl-hvac Goto Github PK

View Code? Open in Web Editor NEW
24.0 1.0 5.0 8.58 MB

Model-based Reinforcement Learning for Building HVAC Control

License: MIT License

Python 14.27% Shell 0.31% Jupyter Notebook 85.41%

mbrl-hvac's Introduction

Model-based Reinforcement Learning for Building HVAC Control

This repository is the official implementation of Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation.

Requirements

To install requirements:

Python package

pip install -r requirements.txt

EnergyPlus

Please follow https://github.com/IBM/rl-testbed-for-energyplus and install EnergyPlus version 9.1.0

Training

To run the PID agent, run

python train_pid.py --city SF

To train the PPO agent, run

python train_ppo.py --city SF

To train the model-based RL with random shooting (RS), run

python train_model_based.py --city SF --mpc_horizon 5 --num_days_on_policy 10 --training_epochs 100

To train the model-based RL with dagger, run

python train_model_based.py --city SF --mpc_horizon 5 --num_days_on_policy 10 --training_epochs 100 --dagger

It will create a folder called runs that includes all the state, action and rewards during the training. The EnergyPlus generated files will be in the log folder.

Available cities

  • SF
  • Golden
  • Chicago
  • Sterling

We also provide shell script file in case you want to run everything. Checkout

  • run_pid.sh
  • run_ppo.sh
  • run_model_based_plan.sh
  • run_model_based_dagger.sh

Citation

@article{Zhang2019BuildingHS,
  title={Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation},
  author={Chi Zhang and S. Kuppannagari and R. Kannan and V. Prasanna},
  journal={Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation},
  year={2019}
}

Please also cite the paper that introduces the environment

@InProceedings{10.1007/978-981-13-2853-4_4,
author="Moriyama, Takao and De Magistris, Giovanni and Tatsubori, Michiaki and Pham, Tu-Hoa and Munawar, Asim and Tachibana, Ryuki",
title="Reinforcement Learning Testbed for Power-Consumption Optimization",
booktitle="Methods and Applications for Modeling and Simulation of Complex Systems",
year="2018",
publisher="Springer Singapore",
address="Singapore",
pages="45--59",
isbn="978-981-13-2853-4"
}

mbrl-hvac's People

Contributors

vermouth1992 avatar dependabot[bot] avatar

Stargazers

Goh Jet Wei avatar  avatar Sudarshan Khandelwal avatar  avatar  avatar Samer Kharboush avatar  avatar  avatar  avatar  avatar Jack Yang avatar  avatar  avatar  avatar  avatar Yoon, Seungje avatar  avatar  avatar  avatar Minjae Son avatar Jamal Aldahmashi  avatar CC avatar Thiago P. Bueno avatar Parth Maheshwari avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.