GithubHelp home page GithubHelp logo

rastna12 / cascade-detr Goto Github PK

View Code? Open in Web Editor NEW

This project forked from syscv/cascade-detr

0.0 0.0 0.0 114 KB

[ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection

Home Page: https://arxiv.org/abs/2307.11035

License: Apache License 2.0

Shell 0.40% C++ 1.36% Python 84.56% Cuda 13.68%

cascade-detr's Introduction

Cascade-DETR [ICCV'23]

Official code of "Cascade-DETR: Delving into High-Quality Universal Object Detection".

More results comparison are in the paper.

teaser_fig_cascade-detr

Introduction

We introduce Cascade-DETR for high-quality universal object detection. We jointly tackle the generalization to diverse domains and localization accuracy by proposing the Cascade Attention layer, which explicitly integrates objectcentric information into the detection decoder by limiting the attention to the previous box prediction. To further enhance accuracy, we also revisit the scoring of queries. Instead of relying on classification scores, we predict the expected IoU of the query, leading to substantially more well-calibrated confidences. Lastly, we introduce a universal object detection benchmark, UDB10, that contains 10 datasets from diverse domains. While also advancing the state-of-the-art on COCO, Cascade-DETR substantially improves DETR-based detectors on all datasets in UDB10, even by over 10 mAP in some cases. The improvements under stringent quality requirements are even more pronounced.

Installation

We use python=3.8,pytorch=1.7.1,cuda=11.0. Other versions may also be available.

Please follow the instructions to install both PyTorch and TorchVision dependencies. Installing both PyTorch and TorchVision with CUDA support is needed.

Clone the repository locally and install with

git clone https://github.com/SysCV/cascade-detr.git
cd cascade-detr

Example conda environment setup

# 1. Create environment
conda create --name cascade_detr python=3.8 -y
conda activate cascade_detr

# 2. Install pytorch https://pytorch.org/get-started/previous-versions/
pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html

# 3. Other needed packages
pip install -r requirements.txt

# 4. deformable attention
cd cascade_dn_detr/models/cascade_dn_detr/ops
sh make.sh
# unit test (should see all checking is True)
python test.py
cd ../../..

Data Download

UDB10 Benchmark includes 10 datasets in a varity of domains. All datasets can be downloaded from public links. In our paper, we provide separate training result on each dataset.

We provide link for each dataset here. Expected dataset structure for training and evaluation can be found in cascade_dn_detr.

1. COCO

Standard COCO 2017 train/val split.

2. UVO

UVO v0.5 dataset and its default train/val split.

3. Cityscapes

Cityscapes can be downloaded from its official website. The detection set annotation in COCO format can be downloaded from DE-DETR

4. Brain tumor

Brain tumor Dataset can be downloaded from Roboflow link. We recommend using the COCO JSON download format.

5. Document Parts

Document Parts Dataset can be downloaded from Roboflow link. We recommend using the COCO JSON download format.

6. Smoke

Smoke Dataset can be downloaded from Roboflow link. We recommend using the COCO JSON download format.

7. Egohands

Egohands Dataset can be downloaded from Roboflow link. We recommend using the COCO JSON download format.

8. PlantDoc

PlantDoc Dataset can be downloaded from Roboflow link. We recommend using the COCO JSON download format.

9. BDD100K

BDD can be downloaded from its official website. You can follow the instructions to convert annotation files to COCO format. We also provide the pre-converted coco format jsons for downloading if you want to skip the transformation process to COCO-style.

10. People in Painting

People in Painting Dataset can be downloaded from Roboflow link. We recommend using the COCO JSON download format.

Getting Started

You can change the current folder path to cascade_dn_detr and we provide detailed training, evaluation scripts and pretrained checkpoints.

cd cascade_dn_detr

Citation

If you find Cascade-DETR useful in your research or refer to the provided baseline results, please star โญ this repository and consider citing ๐Ÿ“:

@inproceedings{ye2023cascade,
  title={Cascade-DETR: Delving into High-Quality Universal Object Detection},
  author={Ye, Mingqiao and Ke, Lei and Li, Siyuan and Tai, Yu-Wing and Tang, Chi-Keung and Danelljan, Martin and Yu, Fisher},
  booktitle = {ICCV},
  year={2023}
}

cascade-detr's People

Contributors

lkeab avatar ymq2017 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.