Comments (5)
I can answer some of those questions:
3 - AFAIK max_iters
is the hard limit of iterations, but the early stopping algorithm may choose to end it earlier.
4 - try setting verbosity to 2 and see what's being printed out. For me, the number of trials was equal to what I set.
from tune-sklearn.
@Yard1 ,
3 - If max_iters
is the hard limit, what is n_iter
used for?
4 - I did set verbosity=2
and noticed that only 10 trials are run even when I set both to 50
from tune-sklearn.
3 - they are two different things. max_iters
is the number of iteration per trial, and n_iter
is the number of trials. So the maximum amount of total iterations is max_iters
* n_iter
.
from tune-sklearn.
- I'm actually not entirely sure how this is supposed to behave. Maybe @richardliaw can answer this one better?
n_iter=10
means you sample 10 hyperparameters from the hyperparameter space, andcv=3
means each of the 10 models will be cross validated using 3 fold cross validation. The way we figure out how to early stop at the moment is just to get the average performance across all folds, as this is how cross_validation is generally done. I'm not sure it's safe to conclude the model is bad just because it does worse on one fold.- @Yard1 gave a good answer to this so I don't need to add much here.
- Did you check the
cv_results_
dictionary? n_jobs
is used to figure out how many trainables can be run in parallel. So if you specify it to be -1, it will do the maximum number of parallel jobs, using 1 core per job.sk_n_jobs
is just used to set then_jobs
parameter of the underlying sklearn estimator. This defaults to -1 to tell sklearn to use all the cores available to it, and can usually be ignored unless you run into errors. I'm not sure why it's using 1/2 a core, but it'd be helpful to have more information/output. Are there 2 cores total on your machine?
from tune-sklearn.
@rohan-gt these are great questions, I've pushed a PR to address them.
- great catch, #81 should fix it up and you'll see the expected behavior.
- I've renamed n_iter -> n_trials in #81.
- max_iters + early_stopping => ASHA + absolute limit of max_iters. @Yard1 gave a great explanation.
- Maybe an artifact of BOHB. I've pushed some changes and will ping you to try it out on master.
- n_jobs=-1 should imply that there are 2 parallel trials running. Did you set n_iter=1? If not, could you post a new issue about how to repro?
Thanks a bunch for trying things out and asking questions - we really want to make you successful!
from tune-sklearn.
Related Issues (20)
- TuneSearchCV not correctly handling error_score parameter HOT 5
- Save TuneSearchCV object with tensorflow and keras models HOT 7
- Can't suppress warning messages through standard python methods HOT 4
- n_jobs doesn't seem to be taken into account by TuneSearchCV HOT 3
- Resuming from checkpoint?
- Fail to run the conda installed tune_sklearn package HOT 2
- sk_n_jobs bug
- "training_iteration" from TuneSearchCV is always 1, and accuracy does not improve over time
- For TuneGridSearchCV: Where should I put reuse_actors=True?
- AttributeError: 'str' object has no attribute 'setup'
- TuneSearchCV doesn't seem to search for modules in alternative locations included in the PATH environment variable HOT 2
- during pickling there is an error HOT 1
- False Error log complains failed to read the result of trails
- How to tune Skorch model using GPU
- Since Ray-2.7.0, fetch_trial_dataframes is deprecated and raise an DeprecationWarning exception HOT 2
- No experiment checkpoint file of form 'experiment_state-*.json' was found HOT 1
- context is not passed with `set_config`
- Label management problem for Multilable classification
- Is it possible to save all models when doing TuneSearchCV or equivalent?
- Installation fails on Python 3.11/Windows
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tune-sklearn.