GithubHelp home page GithubHelp logo

longer-is-better / featurized-queryrcnn Goto Github PK

View Code? Open in Web Editor NEW

This project forked from hustvl/featurized-queryrcnn

0.0 0.0 0.0 1.3 MB

Featurized Query R-CNN

Home Page: https://arxiv.org/abs/2206.06258

License: MIT License

Python 100.00%

featurized-queryrcnn's Introduction

Featurized Query R-CNN

Wenqiang Zhang*, Tianheng Cheng*, Xinggang Wangโ€ , Shaoyu Chen, Qian Zhang, Wenyu Liu
(*: equal contribution, โ€ : corresponding author)

News

  • 14 June, 2022: Code and models of Featurized Query R-CNN have been released!

Introduction

The query mechanism introduced in the DETR method is changing the paradigm of object detection and recently there are many query-based methods have obtained strong object detection performance. However, the current query-based detection pipelines suffer from the following two issues. Firstly, multi-stage decoders are required to optimize the randomly initialized object queries, incurring a large computation burden. Secondly, the queries are fixed after training, leading to unsatisfying generalization capability. To remedy the above issues, we present featurized object queries predicted by a query generation network in the well-established Faster R-CNN framework and develop a Featurized Query R-CNN. Extensive experiments on the COCO dataset show that our Featurized Query R-CNN obtains the best speed-accuracy trade-off among all R-CNN detectors, including the recent state-of-the-art Sparse R-CNN detector. 1654667943617

Installation and Training

Our methods are based on detectron2, please refer to here for more details.

Install the detectron2:

git clone https://github.com/facebookresearch/detectron2.git

python setup.py build develop

For training, run:

python train_net.py --config-file <config-file> --num-gpus <num-gpus>

Main Results

Model Backbone Epoch AP FPS Weights
Featurized QR-CNN (100 Queries) ResNet-50 36 41.3 26 Google Drive
Cascade Featurized QR-CNN (100 Queries) ResNet-50 36 43.0 24 Google Drive
Cascade Featurized QR-CNN (300 Queries) ResNet-50 36 44.6 24 Google Drive
Cascade Featurized QR-CNN (100 Queries) ResNet-101 36 43.9 18 Google Drive
Cascade Featurized QR-CNN (300 Queries) ResNet-101 36 45.8 17 Google Drive
  • The speed is tested on a single RTX 2080Ti GPU on COCO val set.
  • If you have trouble accessing the models in Google Drive, we also provide the models in BaiduPan for you (Password: n91e).

Acknowledgements

Our implementation is based on detectron2 and Sparse R-CNN, we thank for their open-source code.

License

Featurized Query R-CNN is released under the MIT Licence.

Citation

If you find Featurized Query R-CNN is useful in your research or applications, please consider giving us a star ๐ŸŒŸ and citing it by the following BibTeX entry.

@article{FeaturizedQR-CNN,
  title={Featurized Query R-CNN},
  author={Zhang, Wenqiang and Cheng, Tianheng and Wang, Xinggang and Chen, Shaoyu and Zhang, Qian and Liu, Wenyu},
  journal={arXiv preprint arXiv:2206.04584},
  year={2022}
}

featurized-queryrcnn's People

Contributors

mulinmeng avatar xinggangw avatar wondervictor avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.