GithubHelp home page GithubHelp logo

Comments (6)

petermattson avatar petermattson commented on June 18, 2024

Under the present closed division rules, the RL environment cannot be altered, including most of those hyper parameters. We should make this distinction clearer. It can, however, be reimplemented to run faster as long as the underlying semantics don't change. So, more threads, depending on your use, might be allowed -- If it does the same work faster, you can do it. If it changes the result, you can't.

from training.

jgong5 avatar jgong5 commented on June 18, 2024

@petermattson Will you publish a doc on the detailed rules, e.g. changing what parameters is allowed and what not allowed? My understanding is that it is allowed to re-implement this on another framework like Caffe, right? I feel it is difficult for a re-implementation by simply looking into the reference implementation since there are many variables...

from training.

bitfort avatar bitfort commented on June 18, 2024

These are good questions, I can provide some clarify.

Basically, for the closed the division we do not want to adjust work done by adjusting parameters. For example, reducing number of games or fan out in the tree. Though, if the particular values we provide in the reference implementation are clearly subpar, we can propose changes (which everyone will follow).

If you have results to suggest that there are clearly better parameters, please do share and I'm happy to propose changes to the reference to the larger committee.

As far as "Algorithm used to determine end-of-game winner" -- you can change such algorithms as long as they produce the same output. For example, if you have a better way to enumerate legal moves, feel free to enumerate more efficiently. The guiding wisdom is that better implementations are fine, as long as they produce the same output.

I'm happy to dig into the details if you have additional questions.

from training.

jgong5 avatar jgong5 commented on June 18, 2024

@bitfort Thanks for the clarification. I understand your explanation on the parameter settings. But I feel it would be more clear to list all the relevant parameters with their values in a document instead of embed them in all the places of the reference implementation. That can make participants much easier to follow for a re-implementation.

The guiding wisdom is that better implementations are fine, as long as they produce the same output.

I feel the wording "better implementations" here is ambiguous. What kind of "algorithm" can be tuned in the CLOSED division? Can I say CNN-based policy/value network part of the "algorithm"?

from training.

petermattson avatar petermattson commented on June 18, 2024

from training.

jgong5 avatar jgong5 commented on June 18, 2024

@petermattson @bitfort Thank you both for the kind answers. I will close the issue now but with the hope that you can publish doc on more detailed list of hyper-parameters which are currently embedded in the reference code.

from training.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.