freekang / a2oc_a2c Goto Github PK
View Code? Open in Web Editor NEWThis project forked from ronsailer/a2oc_a2c
PyTorch implementation of Advantage Actor-Critic (A2C), Asynchronous Advantage Option-Critic (A2OC), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
License: MIT License