Reinforcement learning with PyTorch, inspired by MorvanZhou, change the framework from Tensorflow to PyTorch

Python 100.00%

reinforcement-learning-with-pytorch's People

Contributors

Stargazers

Watchers

reinforcement-learning-with-pytorch's Issues

Using tensorboard to show graph of DQN

在莫烦的网课里使用了 tensorboard来展示网络结构，想请教一下pytorch如何使用tensorboard来展示网络结构（看过很多教程，都是在add_graph方法中传入module，但DQN类并不是一个module，而是由两个module组成的，所以想请教一下有无解决方案）

RuntimeError about AC_CartPole.py

I didn't change anything about 8_Actor_Critic_Advantage/AC_CartPole.py. I just ran it, but I got this

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [20, 1]], which is o
utput 0 of TBackward, is at version 2; expected version 1 instead. Hint: enable anomaly detection to find the operation that failed to compute its gr
adient, with torch.autograd.set_detect_anomaly(True).

So, I add torch.autograd.set_detect_anomaly(True) to code, but I got this

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [20, 1]], which is o
utput 0 of TBackward, is at version 2; expected version 1 instead. Hint: the backtrace further above shows the operation that failed to compute its g
radient. The variable in question was changed in there or anywhere later. Good luck!

my pytorch version is 1.7.0.
my numpy version is 1.18.5

5_Deep_Q_Network中RuntimeError: Can't call numpy() on Tensor that requires grad. Use tensor.detach().numpy() instead.

RuntimeError: Can't call numpy() on Tensor that requires grad. Use tensor.detach().numpy() instead.增加了detach()也一样报错，输出loss图异常

clownw / reinforcement-learning-with-pytorch Goto Github PK

reinforcement-learning-with-pytorch's People

Contributors

Stargazers

Watchers

Forkers

reinforcement-learning-with-pytorch's Issues

Using tensorboard to show graph of DQN

RuntimeError about AC_CartPole.py

5_Deep_Q_Network中RuntimeError: Can't call numpy() on Tensor that requires grad. Use tensor.detach().numpy() instead.

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs