zlj123-max / mpg Goto Github PK
View Code? Open in Web Editor NEWThis project forked from idthanm/mpg
MPG is originated from the paper "Mixed policy gradient", which also contains a cluster of high-quality implementations of deep reinforcement learning algorithms.
Home Page: https://arxiv.org/abs/2102.11513