Comments (6)
Under the present closed division rules, the RL environment cannot be altered, including most of those hyper parameters. We should make this distinction clearer. It can, however, be reimplemented to run faster as long as the underlying semantics don't change. So, more threads, depending on your use, might be allowed -- If it does the same work faster, you can do it. If it changes the result, you can't.
from training.
@petermattson Will you publish a doc on the detailed rules, e.g. changing what parameters is allowed and what not allowed? My understanding is that it is allowed to re-implement this on another framework like Caffe, right? I feel it is difficult for a re-implementation by simply looking into the reference implementation since there are many variables...
from training.
These are good questions, I can provide some clarify.
Basically, for the closed the division we do not want to adjust work done by adjusting parameters. For example, reducing number of games or fan out in the tree. Though, if the particular values we provide in the reference implementation are clearly subpar, we can propose changes (which everyone will follow).
If you have results to suggest that there are clearly better parameters, please do share and I'm happy to propose changes to the reference to the larger committee.
As far as "Algorithm used to determine end-of-game winner" -- you can change such algorithms as long as they produce the same output. For example, if you have a better way to enumerate legal moves, feel free to enumerate more efficiently. The guiding wisdom is that better implementations are fine, as long as they produce the same output.
I'm happy to dig into the details if you have additional questions.
from training.
@bitfort Thanks for the clarification. I understand your explanation on the parameter settings. But I feel it would be more clear to list all the relevant parameters with their values in a document instead of embed them in all the places of the reference implementation. That can make participants much easier to follow for a re-implementation.
The guiding wisdom is that better implementations are fine, as long as they produce the same output.
I feel the wording "better implementations" here is ambiguous. What kind of "algorithm" can be tuned in the CLOSED division? Can I say CNN-based policy/value network part of the "algorithm"?
from training.
from training.
@petermattson @bitfort Thank you both for the kind answers. I will close the issue now but with the hope that you can publish doc on more detailed list of hyper-parameters which are currently embedded in the reference code.
from training.
Related Issues (20)
- Unable to download tar file in the mlcommons-training-wg-s3 S3 Bucket
- run stable diffusion see no space left on device error HOT 2
- Potential private information leak in retired benchmark
- unable to find image 'mlperf/object_detection'
- OCI runtime create failed
- where is the definition of mlperf_logging HOT 1
- [Stable Diffusion] VAE Moments to image outputs whited out image. HOT 1
- Alternative method for downloading Llama2 70b
- Gradient clipping not working for llama2_70b_lora benchmark HOT 1
- MLPerf library version for 4.0 Submission
- Llama2 - LoRA Reference Implementation HOT 2
- Data download for Stable Diffusion fails
- DLRM criteo day23 MD5 varify faild
- Hardware Configuration
- llama3 support?
- Invalid `local_replica_id` with `MultiWorkerMirroredStrategy`
- `IndexError` in `cross_device_ops` with `MultiWorkerMirroredStrategy`
- Scope of ML based benchmarks in MLPerf.
- Bus error (core dumped) in graph_neural_network HOT 1
- Problem downloading S3 bucket
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from training.