Implementation of the paper "Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization" [1].
The "ringworld" tests use our implemented version of the environment.
This repository also contains our reproduced
- Python 3.6+
- Dependent python modules
Please kindly cite our work if necessary:
@article{zhao2019faster,
title={Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization},
author={Zhao, Mingde and Porada, Ian and Luan, Sitao and Chang, Xiao-Wen and Precup, Doina},
journal={arXiv},
volume={1904.11439},
year={2019},
url={https://arxiv.org/abs/1904.11439},
}