GithubHelp home page GithubHelp logo

Comments (8)

ColdCodeCool avatar ColdCodeCool commented on April 28, 2024

@guotong1988 I think you should learn the very basic concept of reinforcement learning. It is basically a dynamic program, the state changes from time to time. You'd better learn Markov Decision Process and Bellman Equation first.

from deeplearningflappybird.

guotong1988 avatar guotong1988 commented on April 28, 2024

the state changes from time to time
thank you
could you please have a look at my another question? thx!
the question is also in the issues

from deeplearningflappybird.

guotong1988 avatar guotong1988 commented on April 28, 2024

反过来想,为什么不只用1个state呢,而用了2个state

from deeplearningflappybird.

ColdCodeCool avatar ColdCodeCool commented on April 28, 2024

@guotong1988 no, you cannot use only one state, since intuitively you must communicate with the environment by behaving to learn a lesson. Once your action done, you are in another state, and you get reward or punishment from the environment, thus you can learn something.

from deeplearningflappybird.

ColdCodeCool avatar ColdCodeCool commented on April 28, 2024

@guotong1988 for comprehensive understanding, you should learn mdp theory first.

from deeplearningflappybird.

guotong1988 avatar guotong1988 commented on April 28, 2024

关键这两个state是紧挨着的,
就是说第二个state有情况,是前若干步决定的啊

from deeplearningflappybird.

ColdCodeCool avatar ColdCodeCool commented on April 28, 2024

@guotong1988 like I said, you really need to learn mdp first. Markov property informs the current state captures all relevant information from the history. Thus the future state only depends on the current state. In mathematical forms, P[s_{t+1}|s_{t}] = P[s_{t+1}|s_1,...,s_t].

from deeplearningflappybird.

guotong1988 avatar guotong1988 commented on April 28, 2024

The answer: One state contains 4 frame.

from deeplearningflappybird.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.