Comments (2)
Oops. I haven't seen a homework suggesting fixing this when I wrote the issue. Probably we should delete this...
from practical_rl.
I'm not sure I understand which homework you are referring to where you say that it suggests to fix this issue. The next homework is deep CEM, which differs only by the environment (MountainCar instead of Taxi) and the estimator (a neural network instead of a table). We don't expect any changes for the learning algorithm there. In week 2 and onwards, we move on to more complex algorithms, none of which are a generalization of CEM.
However, I don't think that investing any effort into tuning the CEM assignment is worthwhile. It only serves the function of demonstrating that an extremely primitive algorithm can solve RL problems, but its performance is weak compared to Q-learning or any other "true RL" algorithms.
from practical_rl.
Related Issues (20)
- Remove mentions of python2 from week7/practice
- PyTorch notebooks should detect GPU and run on it if it's available
- Backport changes from PyTorch version of DQN Atari into the TF notebook
- Complete code HOT 1
- Colab no longer bundles Atari ROMs in their Gym installation HOT 1
- The links can't be opened now HOT 2
- class DQN uses global variable in assert
- CrossEntropy Exercise not working despite every step being marked as correct HOT 1
- Week2 Seminar-VI frozen lake problem HOT 1
- Equation of state-action value function in seminar_vi week 02 HOT 2
- Can I get answer for my confusion in week07_seq2seq/practice_torch
- In Coursera branch, why do the last commit deleted the grading module for week1? HOT 2
- Issue with Slides in Materials of #4 HOT 5
- Videos of Week 05 have issues of not being shown
- Is it still possible to submit assignments to coursera? HOT 5
- `week09_policy_II` – `trpo` slides has an invalid link HOT 4
- Questioning Regards Conjugate Gradient Algorithm HOT 1
- week06_policy_based - invalid link
- The link for OpenAI recommended reading material is not working anymore
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from practical_rl.