GithubHelp home page GithubHelp logo

btmop / lgsearch_ddgan_pytorch Goto Github PK

View Code? Open in Web Editor NEW

This project forked from wangxiao5791509/lgsearch_ddgan_pytorch

0.0 0.0 0.0 47.19 MB

Tracking by Joint Local and Global Search: A Target-aware Attention based Approach (IEEE TNNLS 2021)

Home Page: https://sites.google.com/view/globalattentiontracking/home/extend

Python 100.00%

lgsearch_ddgan_pytorch's Introduction

LGSearch_DDGAN_PyTorch

Official Implementation of Joint Local and Global Search for Visual Tracking with Adversarial Learning.

BMVC-2019: Learning Target-aware Attention for Robust Tracking with Conditional Adversarial Network, Xiao Wang, Tao Sun, Rui Yang, Bin Luo [Project] [Paper] [Supplement] [Poster]

Journal Extension: Tracking by Joint Local and Global Search: A Target-aware Attention based Approach, Xiao Wang, Jin Tang, Bin Luo, Yaowei Wang, Yonghong Tian, and Feng Wu, IEEE TNNLS 2021 [Paper] [Video] [Slides]

Abstract

Tracking-by-detection is a very popular framework for single object tracking which attempts to search the target object within a local search window for each frame. Although such local search mechanism works well on simple videos, however, it makes the trackers sensitive to extremely challenging scenarios, such as heavy occlusion and fast motion. In this paper, we propose a novel and general target-aware attention mechanism (termed TANet) and integrate it with tracking-by-detection framework to conduct joint local and global search for robust tracking. Specifically, we extract the features of target object patch and continuous video frames, then we concatenate and feed them into a decoder network to generate target-aware global attention maps. More importantly, we resort to adversarial training for better attention prediction. The appearance and motion discriminator networks are designed to ensure its consistency in spatial and temporal views. In the tracking procedure, we integrate the target-aware attention with multiple trackers by exploring candidate search regions for robust tracking. Extensive experiments on both short-term and long-term tracking benchmark datasets all validated the effectiveness of our algorithm.

Tracking Framework

rgbt_car10

Inference

  1. You can directly generate all the attention images for your testing dataset, for example, [GOT-10K] or [TNL2K].
python test_got10k.py
  1. You can also integrate the code into your own tracker, and conduct local-global search only when needed. Our pre-trained model on the GOT-10K training subset is available here. You can use it to predict gloabl attention for your tracker.

Cite

If you find this paper useful for your research, please consider citing our paper:

@inproceedings{wang2019GANTrack,
  title={Learning Target-aware Attention for Robust Tracking with Conditional Adversarial Network},
  author={Wang, Xiao and Sun, Tao and Yang, Rui and Luo, Bin},
  booktitle={30TH British Machine Vision Conference},
  year={2019}
} 

@inproceedings{wang2021ganTANetTrack,
  title={Tracking by Joint Local and Global Search: A Target-aware Attention based Approach},
  author={Wang, Xiao and Tang, Jin and Luo, Bin and Wang, Yaowei and Tian, Yonghong and Wu, Feng },
  journal={IEEE Transactions on Neural Networks and Learning Systems},
  year={2021},
  publisher={IEEE}
} 

If you have any questions, please contact me via email: [email protected].

lgsearch_ddgan_pytorch's People

Contributors

wangxiao5791509 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.