Comments (1)
As far as I understand it, this happens sometimes with tensorflow when the learning rate is borderline too high (for the selected model).
It can then be a result of the model constantly overshooting local optima because of the high step size and some tensorflow internal safeguards that basically output predictions as a dummy classifier.
Usually this behavior is also quite stochastic, meaning it might only appear in one of the 3 runs.
For this reason, some work simply disregards architectures that achieve <80% final accuracy as noise.
However, it appears as if the models you listed could not be trained in all 3 runs, so across 3 different random inits.
Also the models were trained with cosine lr decay which should prevent this kind of training problem in the first place.
It is probably an issue with the TPU v2 architecture.
from nasbench.
Related Issues (20)
- y-axis in Fig 7(left) HOT 4
- Reproduce Figure 8 in the NASBench paper
- Are there any api of turning the 01 spec_matrix into model_architecture? HOT 1
- Generating a dataset (tfrecord files) HOT 1
- Not compatible with tensorflow 2.0 HOT 3
- how to create model instance using matrix?
- RMSProp epsilon=1.0, why?
- How to get the FLOPs of a model HOT 2
- Data Query is non-deterministic HOT 1
- hdf5 file format
- tensorflow 2.3.1 incompatibility HOT 1
- How to extract all the architectures and metrics from the records? HOT 1
- source code for calculating the graph edit distance HOT 1
- what is "total_time" in the tfrecord file?
- is_valid is true, query function report errors HOT 1
- how to get the model spec by accuracy?
- How much mean test accuracy can the best architecure achieve? HOT 1
- Have you compare the execution time of various algorithms? HOT 4
- Query neighbors of an architecture HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nasbench.