GithubHelp home page GithubHelp logo

Comments (5)

edwhu avatar edwhu commented on May 29, 2024

Hi, that is a good question. One is to use some known distribution over state dimensions, like if you have a robot joint limit, then you can sample from a uniform distribution defined over the joint limits. Another way is to sample goals from the replay buffer.

The first way works well if you have prior knowledge about the state space. The second way works without any knowledge. However, you need to sample enough goal states to have a diverse candidate pool for MPPI optimization. An interesting future extension here is to figuring out how to efficiently sample diverse goals.

from peg.

IDayday avatar IDayday commented on May 29, 2024

Hi, I agree with you, but I have more detailed questions about sampling the goals from the replay buffer.

As far as I know, before we sample goals in the replay buffer, we still have to do some exploration, and where do we get the goals for these explorations?

Could you please explain how this is implemented in your PEG? (Maybe there is still a prior distribution, or make several key states as goals?)

from peg.

IDayday avatar IDayday commented on May 29, 2024

As a general case, if we don't have any knowledge about the envrionment, the only way I can think of is to use a randomized policy. ( maybe $\pi(s)$ not $\pi(s,g)$ )

from peg.

edwhu avatar edwhu commented on May 29, 2024

At the very start of training, there are no goals in the replay buffer. So how can we run any goal-directed exploration strategy?

One way is to just run a non-goal-conditioned policy, like the P2E exploration policy $\pi(s)$ to gather some initial trajectories to fill the replay buffer. After this, we can pick goals from the replay buffer for Go-Explore.

Another way, is to just use an arbitrary goal, like all 0s, and run Go-Explore with this arbitrary goal.

For PEG, since we know the bounds of the state space, we do not sample the initial goals from the replay buffer, we just sample candidates from the known bounds of the state space.

from peg.

IDayday avatar IDayday commented on May 29, 2024

Thanks for your patience. good luck!

from peg.

Related Issues (4)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.