GithubHelp home page GithubHelp logo

jiachen0212 / ada-track Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dsx0511/ada-track

0.0 0.0 0.0 2.44 MB

Offical implementation of CVPR2024 paper ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association.

License: Other

Shell 0.17% Python 99.37% Dockerfile 0.46%

ada-track's Introduction

[CVPR2024] ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association

This is the official implementation of the CVPR'24 paper

ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association
Shuxiao Ding, Lukas Schneider, Marius Cordts, Juergen Gall

Arxiv | CVF Open Access

ADA-Track-PETR.mp4

Abstract

Query-based 3D Multi-Object Tracking (MOT) facilitates seamless integration into end-to-end frameworks. Many existing methods adopt the tracking-by-attention paradigm, utilizing track queries for identity-consistent detection and object queries for identity-agnostic track spawning. Tracking-by-attention, however, entangles detection and tracking queries in one embedding for both the detection and tracking task, which is sub-optimal. Other approaches resemble the tracking-by-detection paradigm, detecting objects using decoupled track and detection queries followed by a subsequent association. These methods, however, do not leverage synergies between the detection and association task. Combining the strengths of both paradigms, we introduce ADA-Track, a novel end-to-end framework for 3D MOT from multi-view cameras. We introduce a learnable data association module based on edge-augmented cross-attention, leveraging appearance and geometric features. Furthermore, we integrate this association module into the decoder layer of a DETR-based 3D detector, enabling simultaneous DETR-like query-to-image cross-attention for detection and query-to-query cross-attention for data association. By stacking these decoder layers, queries are refined for the detection and association task alternately, effectively harnessing the task dependencies. We evaluate our method on the nuScenes dataset and demonstrate the advantage of our approach compared to the two previous paradigms.

News

  • [2024/02/26] Our paper is accepted by CVPR 2024! ๐ŸŽ‰๐ŸŽ‰
  • [2024/05/14] We release our source code.

Getting started

  1. Installation
  2. Data preparation
  3. Training & inference

Experimental results

Detector Backbone N_GPU AMOTA AMOTP Logfile Checkpoint
DETR3D ResNet-101 4 38.4 1.378 log Download
PETR V2-99 8 48.3 1.242 log Download

NOTE: After fixing a small issue in the code, we achieved slightly better performance than reported in the paper ;-)

License

See LICENSE for more details.

Citation

If you find ADA-Track is helpful for your research, please give us a star and cite it by:

@inproceedings{ding2024adatrack,
  title={ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association},
  author={Ding, Shuxiao and Schneider, Lukas and Cordts, Marius and Gall, Juergen},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2024}
}

Acknowledgement

Our implementation is a plugin to mmdetection3d, and it is highly dependent on MUTR3D. We build our tracker based on the implementation of the detectors DETR3D and PETR. The success of this work is impossible without these aforementioned projects as well as many other high-quality open-source projects. Many thanks to them for enabling our research!

ada-track's People

Contributors

dsx0511 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.