the implement of TD error in Actor_Critic

Machine-Learning-Basic-Codes🏆

朱子云：

所谓致知在格物者，言欲致吾之知，在即物而穷其理也。盖人心之灵，莫不有知，而天下之物，莫不有理。惟于理有未穷，故其知有不尽也。是以大学始教，必使学者即凡天下之物，莫不因其已知之理而益穷之，以求至乎其极。至于用力之久，而一时豁然贯通焉，则众物之表里精粗无不到，而吾心之全体大用无不明矣。

📐📏

格物 (Ko Wu) which means 'investigate the essence of things' in English is a key method for study and better understanding of the knowledge. It is proposed by ancient Chinese philosophers about 2000 years ago and has a profound impact on later generations. The spirit of Ko Wu asks us to not only learn how to use knowledge, but also clearly understand the intrinsic theory. Therefore, it is necessary to re-implement ML algorithms by ourselves to figure out what exactly they did and why they succeed.

This repository aims to implement popular Machine Learning and Deep Learning algorithms by both pure python and use open-source frameworks.

Common Machine Learning Part: switch by use_sklearn flag in the main function；
Deep Learning Part: four implement methods for each algorithm (use_sklearn, use_keras, use_torch and self_implement)；
Applications Part: RL + NLP + CV
New trend: GNNs

Welcome everyone to help me finish this Ko Wu project by pulling requests or giving me some suggestions and issues!!!

关联知乎专栏 Associated Zhihu Blog

RL in Robotics

Machine Learning 格物志

代码目录 Code Catalog

Regression

Classification

Regression & Classification

Neural Network

Unsupervised Learning

PCA
K-Means

Ensemble Model

Boosting

Reinforcement Learning

Computer Vision

Natural Language Processing

Graph Neural Networks

Graph Neural Network (GNN)
Graph Convolutional Neural Network (GCN)
Graph Attention Networks (GAT)
GraphSAGE
GraphRNN
Variational Graph Auto-Encoders (GAE)

If you're interested in reinforcement learning, we encourage you to check out our latest library of reinforcement learning and imitation learning in (humanoid) robotics.

Repository address: https://github.com/Skylark0924/Rofunc

	R = 0
	saved_actions = self.model.saved_actions
	policy_losses = [] # list to save actor (policy) loss
	value_losses = [] # list to save critic (value) loss
	returns = [] # list to save the true values

	# calculate the true value using rewards returned from the environment
	for r in self.model.rewards[::-1]:
	# calculate the discounted value
	R = r + self.gamma * R
	returns.insert(0, R)

	returns = torch.tensor(returns)

skylark0924 / machine-learning-is-all-you-need Goto Github PK

machine-learning-is-all-you-need's Introduction

Machine-Learning-Basic-Codes🏆

Welcome everyone to help me finish this Ko Wu project by pulling requests or giving me some suggestions and issues!!!

关联知乎专栏 Associated Zhihu Blog

代码目录 Code Catalog

Regression

Classification

Regression & Classification

Neural Network

Unsupervised Learning

Ensemble Model

Reinforcement Learning

Computer Vision

Natural Language Processing

Graph Neural Networks

machine-learning-is-all-you-need's People

Contributors

Stargazers

Watchers

Forkers

machine-learning-is-all-you-need's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs