GithubHelp home page GithubHelp logo

val-iisc / deit-lt Goto Github PK

View Code? Open in Web Editor NEW
13.0 15.0 0.0 17.48 MB

[CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"

Home Page: https://rangwani-harsh.github.io/DeiT-LT/

License: MIT License

Python 97.67% Shell 2.33%
cvpr2024 deit long-tail long-tailed-recognition vision-transformer

deit-lt's Introduction

DeiT-LT: Distillation strikes back for Vision Transformer training on Long-Tailed datasets

This repository contains the training, evaluation codes and checkpoints for the paper DeiT-LT: Distillation strikes back for Vision Transformer training on Long-Tailed datasets accepted at CVPR 2024.

PWC PWC

DeiT-LT Teaser

Usage

  1. Clone the respository.
git clone https://github.com/val-iisc/DeiT-LT.git
  1. CIFAR-10 and CIFAR-100 datasets are downloaded on its own through the code. However, in case of Imagenet-LT and iNaturalist-2018, download these datasets from the given links.

  2. Create the conda environment and activate it.

conda env create -f environment.yml
conda activate deitlt
  1. Download the teacher models as given in Results table.

  2. Run the corresponding training script for any dataset after adding teacher path in the script file for the argument --teacher-path. For example:

bash sh/train_c10_if100.sh
bash sh/train_imagenetlt.sh
  1. For evaluation a DeiT-LT checkpoint, run the eval bash script for the corresponding dataset. Ensure that the checkpoint path is provided to the script for the argument --resume. For example:
bash sh/eval_c10.sh
bash sh/eval_imagenetlt.sh

Results

Dataset Imbalance Factor Overall Head Mid Tail Teacher path Student path
CIFAR 10-LT 100 87.5 94.5 84.1 85.0 Link Link
50 89.8 94.9 87.0 88.6 Link Link
CIFAR 100-LT 100 55.6 72.8 55.4 31.4 Link Link
50 60.5 74.8 60.3 43.1 Link Link
ImageNet-LT - 59.1 66.6 58.3 40.0 Link Link
iNaturalist-2018 - 75.1 70.3 75.2 76.2 Link Link

Acknowledgement

This codebase is heavily inspired from DeiT (ICML 2021). The concepts and methodologies adopted from DeiT have been instrumental in enabling us to push the boundaries of our research and development. We extend our sincerest thanks to the developers and contributors of DeiT.

License

DeiT-LT is an open-source project released under the MIT license (MIT). The codebase is derived from that of DeiT (ICML 2021), which is released under Apache 2.0 license.

BibTex

If you find this code or idea useful, please consider citing our work:

@article{rangwani2024deit-lt,
  author    = {Rangwani, Harsh and Mondal, Pradipto and Mishra, Mayank and Ramayee Asokan, Ashish and Babu, R Venkatesh},
  title     = {DeiT-LT: Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets},
  journal   = {CVPR},
  year      = {2024},
}

deit-lt's People

Contributors

pradipto111 avatar

Stargazers

Thomas Friedel avatar David-Hown avatar Mingyang Wu avatar  avatar  avatar Luke Meyers avatar Ashish Asokan avatar Pratinav Seth avatar Alexandridis, Konstantinos Panagiotis avatar Shreyas Dixit avatar Akshay Kulkarni avatar shyam_patadia avatar Harsh Rangwani avatar

Watchers

Aditya Ganeshan avatar Maharshi Gor avatar crazymuse avatar Shiv Surya avatar Ravi Kiran avatar  avatar Jogendra Nath Kundu avatar Utsav Garg avatar Harsh Rangwani avatar Harsha Vardhan avatar Navaneet K L avatar Kostas Georgiou avatar  avatar IronMan avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.