When using SampledEfficientZero implementation there is leaking Illegal Actions in nod

Looks like Commit <a href="https://github.com/opendilab/LightZero/commit/e32e495ee5c8e

Leaking illegal actions in SampledEfficientZero about lightzero HOT 3 CLOSED

opendilab commented on May 22, 2024

Leaking illegal actions in SampledEfficientZero

from lightzero.

Comments (3)

puyuan1996 commented on May 22, 2024 1

Hello, thank you very much for your interest.

Please note that the SampledEfficientZero algorithm implemented in LightZero currently supports environments with fixed discrete action spaces, such as Atari, as well as environments with continuous action spaces like MuJoCo.

We haven't yet adapted the algorithm for board game environments with variable action spaces. We expect to make these adjustments around the end of October.

However, if you currently have a need for this functionality, we warmly welcome your participation and contribution. We can discuss and address any potential adaptation issues together in this issue thread. We look forward to your contributions, and deeply appreciate your support.

Wishing you all the best.

from lightzero.

Kostyansa commented on May 22, 2024 1

To give some context and tried solutions to the problem:

Limiting the amount of sampled actions inside the aforementioned expand function breaks the homogeneity of child_sampled_actions_segment inside the GameSegment and child_sampled_actions_batch inside the _learn_forward in SampledEfficientzero Policy
Accurately limiting reassigning of legal_actions inside the aforementioned expand function to block the expanding of illegal action nodes seems to work, but my experiments right now are inconclusive, it can be a good enough temporary solution if you need the functionality right now.

from lightzero.

Kostyansa commented on May 22, 2024

Looks like Commit e32e495e solved this issue.

from lightzero.

Recommend Projects

Leaking illegal actions in SampledEfficientZero about lightzero HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs