GithubHelp home page GithubHelp logo

CIFAR10 test error about pc-darts HOT 12 CLOSED

yuhuixu1993 avatar yuhuixu1993 commented on September 28, 2024
CIFAR10 test error

from pc-darts.

Comments (12)

yuhuixu1993 avatar yuhuixu1993 commented on September 28, 2024

@sxs11 , hi, which architecture do you evaluate?

from pc-darts.

hixiaye avatar hixiaye commented on September 28, 2024

@yuhuixu1993 , I just followed " the evaluation on CIFAR10/100" in README.md.
Actually, I only changed the batch size, the default of data path for "--data" was changed to my path.

By the way , I think the "train_cifar.py" you mentioned in README.md is actually train.py?
thx

from pc-darts.

yuhuixu1993 avatar yuhuixu1993 commented on September 28, 2024

Yes, it is train.py. I am confused about your result too and I will evaluate the model again. You can run this model again, or change the default model to PCDARTS-image. I will reease the model as soon as possible.

from pc-darts.

yuhuixu1993 avatar yuhuixu1993 commented on September 28, 2024

@sxs11 , hi, I add some codes to train.py to display the best validation accuracy. Thanks.

from pc-darts.

EvanJamesMG avatar EvanJamesMG commented on September 28, 2024

@yuhuixu1993 I try to reproduce the search and train process on CIFAR10.
My valid_acc on CIFAR10 is about 97.2~97.3, I also can not reach the accuracy 97.57 listed on the paper.
All the Hyperparameters are same as the code provided ,and I tried several times.
So could you check your code listed.

from pc-darts.

yuhuixu1993 avatar yuhuixu1993 commented on September 28, 2024

@EvanJamesMG, which architecture did you evaluate, searched new architectures or the architecture I have searched?

from pc-darts.

EvanJamesMG avatar EvanJamesMG commented on September 28, 2024

@yuhuixu1993 I searched new architectures ,I did not use the architecture you listed . I tryied to reproduce the search process.

from pc-darts.

yuhuixu1993 avatar yuhuixu1993 commented on September 28, 2024

@EvanJamesMG , hi, as we all know, the training results on CIFAR-10 may have high variance, so we need to train more times to use the mean of them. Besides,the high variance will also influence the search process including both DARTS and our method which means that we can not search the best result each time, while our method is more stable than DARTS. According to our experiments, most of the results fall into (97.30,97.45).

from pc-darts.

EvanJamesMG avatar EvanJamesMG commented on September 28, 2024

@yuhuixu1993 Igot it. thanks for your explain. I tried to reproduce the search process on cifar10 dataset first, then try to search directly on imagenet. and I think this is the highlight of your paper.
Memory-Efficient ,good job.

from pc-darts.

allenxcp avatar allenxcp commented on September 28, 2024

@yuhuixu1993 I run the "python train.py --auxiliary --cutout ",some software version is: python == 2.7 torch==0.4.0 cuda==9.0。gpu==v100,16G
the result is valid_acc 93.739998, best_acc 95.139999

my script and log is

log.txt
train.py.txt

from pc-darts.

yuhuixu1993 avatar yuhuixu1993 commented on September 28, 2024

Hi, @allenxcp , according to your log, the training process has not finished yet. The total training epochs are 600, while the log ends at 282.

from pc-darts.

allenxcp avatar allenxcp commented on September 28, 2024

hi @yuhuixu1993 thanks for your reply ,I will continue,welcome join the wechat group

WechatIMG1

from pc-darts.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.