GithubHelp home page GithubHelp logo

cnl-icip2020's Introduction

Associating Multi-Scale Receptive Fields for Fine-grained Recognition

By Zihan Ye ([email protected]), Fuyuan Hu, Yin Liu ([email protected]), Zhenping Xia, Fan Lyu ([email protected]) and Pengqing Liu

Introduction

This is a PyTorch implementation for the paper "Associating Multi-Scale Receptive Fields for Fine-grained Recognition" in ICIP2020. It brings the CNL models trained on the CUB-200, Stanford-Dogs and Stanford-Cars.

introfig

Citation

If you think this code is useful in your research or wish to refer to the baseline results published in our paper, please use the following BibTeX entry.

@article{Associating2020Zihan,
    author={Zihan Ye and Fuyuan Hu and Yin Liu and Zhenping Xia and Fan Lyu and Pengqing Liu},
    title={Associating Multi-Scale Receptive Fields for Fine-grained Recognition},
    journal={ICIP},
    year={2020}
}

Requirements

  • PyTorch >= 0.4.1 or 1.0 from a nightly release
  • Python >= 3.5
  • torchvision >= 0.2.1
  • termcolor >= 1.1.0

Environment

The code is developed and tested under 1 RTX 8000 GPU cards on CentOS with installed CUDA-9.2/8.0 and cuDNN-7.1.

Baselines and Main Results on CUB-200 Dataset

Model Best Top-1 (%) Top-5 (%)
R-50 84.05 96.00
R-50 w/ 5NL 85.10 96.18
R-50 w/ 5CNL 85.64 96.84
R-101 85.05 96.70
R-101 w/ 5NL 85.53 96.65
R-101 w/ 5CNL 86.73 96.75

Notes:

  • The input size is 448.
  • Prolonging the WARMUP_ITERS appropriately would produce the better results for CNLNet models.

Getting Start

Prepare Dataset

  • Download pytorch imagenet pretrained models from pytorch model zoo. The optional download links can be found in torchvision. Put them in the pretrained folder.

  • Download the training and validation lists for CUB-200 dataset from Baidu Pan(Password: b8r6).

  • Download the training and validation lists for Stanford_Car dataset from Baidu Pan(Password: lrtz).

  • Download the training and validation lists for Stanford_Dog dataset from Baidu Pan(Password: ih96). Put them in the data folder and make them look like:

    ${THIS REPO ROOT}
     `-- pretrained
         |-- resnet50-19c8e357.pth
         |-- resnet101-5d3b4d8f.pth
     `-- data
         `-- cub
             `-- images
             |   |-- 001.Black_footed_Albatross
             |   |-- 002.Laysan_Albatross
             |   |-- ...
             |   |-- 200.Common_Yellowthroat
             |-- cub_train.list
             |-- cub_val.list
             |-- images.txt
             |-- image_class_labels.txt
             |-- README
         `-- Stanford_Car
             `-- images
             |-- cars_train.list
             |-- cars_test.list
             |-- ...
         `-- Stanford_Dog
             `-- Images
             |-- dogs_train.list
             |-- dogs_test.list
             |-- ...
    

Perform Validating

$ python train_val.py --arch '50' --dataset 'cub' --checkpoints ${FOLDER_DIR} --valid

Perform Training ResNet with Cross-layer Non-Local module

$ python train_val.py --arch '50' --dataset 'cub' --warmup

License

This code is released under the MIT License. See LICENSE for additional details.

cnl-icip2020's People

Contributors

fouriye avatar

Stargazers

Jingwei avatar Tankaishen avatar Zheng HT avatar GuanHua WU avatar Dyne Yi avatar Shuyue Jia (Bruce) avatar Chenhao Wang avatar Pang avatar  avatar  avatar miao avatar  avatar  avatar Akon Lau  avatar  avatar Byronnar avatar  avatar  avatar  avatar  avatar  avatar zimenglan avatar Artur Daveyan avatar  avatar  avatar 爱可可-爱生活 avatar Yicheng avatar Kurumi233 avatar Howard H. Tang avatar  avatar

Watchers

 avatar  avatar paper2code - bot avatar

Forkers

xrosliang yncao

cnl-icip2020's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.