GithubHelp home page GithubHelp logo

paper_notes's Introduction

Inspired by Adrian Colyer and Denny Britz.

This contains my notes for research papers that I've read. Papers are arranged according to three broad categories and then further numbered on a (1) to (5) scale where a (1) means I have only barely skimmed it, while a (5) means I feel confident that I understand almost everything about the paper. Within a single year, these papers should be organized according to publication date. The links here go to my paper summaries if I have them, otherwise those papers are on my TODO list.

Contents:

Reinforcement Learning and Imitation Learning

2019 RL/IL Papers

  • Extending Deep MPC with Safety Augmented Value Estimation from Demonstrations, arXiv 2019 (3)
  • Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction, arXiv 2019 (1)
  • SQIL: Imitation Learning via Regularized Behavioral Cloning, arXiv 2019 (1)
  • Towards Characterizing Divergence in Deep Q-Learning, arXiv 2019 (1)
  • Skew-Fit: State-Covering Self-Supervised Reinforcement Learning, arXiv 2019 (1)
  • Visual Hindsight Experience Replay, arXiv 2019 (1)
  • Diagnosing Bottlenecks in Deep Q-Learning Algorithms, ICML 2019 (1)
  • Efficient Off-Policy Meta-Reinforcement learning via Probabilistic Context Variables, ICML 2019 (1)
  • Off-Policy Deep Reinforcement Learning Without Exploration ICML 2019 (5)

Early-year

2018 RL/IL Papers

Late-year

Mid-year

Early-year

2017 RL/IL Papers

Late-year

Mid-year

Early-year

2016 RL/IL Papers

2015 RL/IL Papers

2014 and Earlier RL/IL Papers

Deep Learning

2019 DL Papers

  • On The Power of Curriculum Learning in Training Deep Neural Networks, ICML 2019 (1)

2018 DL Papers

2017 DL Papers

2016 DL Papers

2015 DL Papers

2014 and Earlier DL Papers

Miscellaneous

(Mostly about MCMC, Machine Learning, and/or Robotics.)

2019 Misc Papers

2018 Misc Papers

2017 Misc Papers

2016 Misc Papers

2015 Misc Papers

2014 Misc Papers

2013 and Earlier Misc Papers

paper_notes's People

Contributors

danieltakeshi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

paper_notes's Issues

The Implementation of Guided Cost Learning

Dear Dr. Seita,

Thanks for your paper_notes. I am now working on an autonomous driving project based on GCL. However, I have no idea how to implement the GCL algorithm. I really want to implement the GCL algorithm, I was wondering if you could provide some implementation code with me. Thank you.

Sincerely

About the code!

Hello, I am reading papers about the "learning to learn",I have some doubts about the implements of RL2, just like one trail includes n same MDPs or n different MDPs, or the meaning of fast and slow,even the difference of it and MAML.You mentioned you would read the code,could you email the code to me for my research work!I haven't found any code about RL2! I will appreciate your help if you can.

on what “representation” means

Hello, and thanks for your very insightful review to many papers I'm deeply interested in.

In the "Loss is its own reward" paper's review, you said:

They're interested in representation, which in English, means how we encode and communicate "good-ness" and "bad-ness" to an agent

In this sentence, two parts of the model is implied. An unamed part encodes and communicates the "good-ness", and agent. However, classifying which action was taken between two frames, can be done without the knowledge of what the agent values, or really, any reward at all. And classifying which action was taken is a self-supervisation task that is used to improve the representation. (I'm sure you already understand and agrees with all this, just bringing my reasons to your attention)

Would it not be more accurate to say representation is about learning the underlying mechnisms of the environment, akin to visual development and intrinsic physics in infants, which would (hopefully) help the agent to decide "good-ness" and "bad-ness" of a state?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.