Comments (3)
Hi @troiwill, what do you expect to happen at goal
and failure
states? In general, you should be able to define such states through how the transition and reward functions behave at them.
from pomdp-py.
Hey @zkytony, thanks for the quick follow up. I will focus on failure
but the concept sort of applies to goal
as well. Essentially, I am seeking a way to force rollout
to stop executing because a failure state was entered; this would save me from further unnecessary computations because I (or the system) would know that no transitions are possible.
I added a failure
state to my transition model, where the state simply remains the same. But I do not think it stops rollout
from continuing to propagate the state.
from pomdp-py.
You are correct that rollout
in POMCP/POUCT doesn't terminate until reaching a given depth. pomdp-py treats final/goal states the same as any other states at the library level (for simplicity). It achieves the same effect for planning if you make rewards after reaching goal or failure states be 0, and have those states transition to themselves. You can use max_depth to limit rollout depth.
Are you implementing your own rollout function? (if not, I think the default rollout is not computationally expensive). If you are worried that rollout eats up planning time, you can always set num_sims
instead.
from pomdp-py.
Related Issues (20)
- ModuleNotFoundError: No module named 'pomdp_py.framework.basics' HOT 23
- Remove the dependency on pygraphviz
- Can this solve Dec MDPs? HOT 1
- Potential issue in value iteration algorithm HOT 5
- Using POMCP to solve a time-dependent problem HOT 4
- Benchmark w.r.t the original Silver's code? HOT 2
- SARSOP choose same action everytime. HOT 6
- Can't find module named 'pomdp_py.algorithms.po_uct' HOT 8
- Call for Contributions HOT 5
- Cannot convert type to pomdp_py.framework.basics.Action HOT 13
- Moving a problem from SARSOP to POMCP HOT 16
- Sarsop won't compute policy after changing observation model HOT 9
- Changing max_depth and planning_time for POMCP HOT 10
- Particle deprivation in POMCP at the end of the trial HOT 2
- Making a greedy rollout policy for POMCP HOT 8
- How to correctly modify a planner object HOT 9
- Does this library support continuous pomdp problems? HOT 1
- How to correctly implement a goal/terminal state HOT 8
- Enable random seeding in POUCT / POMCP for deterministic behavior
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pomdp-py.