GithubHelp home page GithubHelp logo

alexxiao95 / multi-camera-vehicle-tracking-and-reidentification Goto Github PK

View Code? Open in Web Editor NEW
138.0 7.0 30.0 33.16 MB

This repository contains our source code of Track 3 in the NVIDIA AI City Challenge Workshop at CVPR 2018. Please find the full source code of both Track 1 and Track 3 here ->

Home Page: https://github.com/AlexXiao95/2018AICity_TeamUW

CMake 0.62% Makefile 0.38% Shell 0.61% HTML 0.03% CSS 0.17% Python 51.20% C++ 42.38% MATLAB 2.45% M 0.01% Cuda 2.15%

multi-camera-vehicle-tracking-and-reidentification's Introduction

Multi-Camera Vehicle Tracking and Re-identification

This repository contains our source code of Track 3 in the NVIDIA AI City Challenge Workshop at CVPR 2018.

[Full source code], [Slides], [Paper], [Poster], [Project Page], [2018 NVIDIA AI City Challenge]

How It Works

We achieved Multi-Camera Vehicle Tracking and Re-identification based on a fusion of histogram-based adaptive appearance models, DCNN features, detected license plates, detected car types and traveling time information.

Getting Started

Prerequisites

The code has been tested on Ubuntu 16.04.

Dataset

The track 3 dataset contains 15 videos from 4 different locations, each around 0.5-1.5 hours long, recorded at 30 fps and 1080p resolution (1920×1080). The task is to identify all vehicles that pass through each recorded location at least once in the given set of videos. The camera locations and linked areas are shown below:

The UA-DETRAC dataset includes 10 hours of videos captured with a Canon EOS 550D camera at 24 different locations at Beijing and Tianjin in China. The videos are recorded at 25 frames per seconds (fps), with resolution of 960×540 pixels. There are more than 140 thousand frames in the UA-DETRAC dataset and 8250 vehicles that are manually annotated, leading to a total of 1.21 million labeled bounding boxes of objects. The primary aim of this dataset is to train vehicle detection algorithms.

The Comprehensive Cars (CompCars) dataset contains data from two scenarios, including images from web-nature and surveillance-nature. The web-nature data contains 163 car makes with 1,716 car models. The dataset is well prepared for the following computer vision tasks:
  • Fine-grained classification
  • Attribute prediction
  • Car model verification

You can find a pre-trained model here. Please refer the dataset website and follow the author's instruction to download the dataset.

The BoxCars dataset contains 116k of images of vehicles with fine-grained labels taken from surveillance cameras under various viewpoints.

Input/Output Format

Simply run bash src/run_all.sh in the command line. The input is single camera tracking results for all 15 videos by using our method in Track 1. The format of each line is as follows:

<video_id> <frame_id> <obj_id> <xmin> <ymin> <xmax> <ymax> <speed> <confidence>

  • <video_id> is the video numeric identifier, starting with 1. It represents the position of the video in the list of all track videos, sorted in alphanumeric order.
  • <frame_id> represents the frame count for the current frame in the current video, starting with 1.
  • <obj_id> is a numeric identifier. It is integer. It can be ignored for Track 1 (set to -1).
  • The axis-aligned rectangular bounding box of the detected video will be denoted by its pixel-valued coordinates within the image canvas, <xmin> <ymin> <xmax> <ymax>, computed from the top-left corner of the image (similar to the VOC2012 challenge format). All coordinates are integers.
  • <speed> denotes the instantaneous speed of the vehicle in the given frame, measured in miles per hour (mi/h), which is a non-negative real value.
  • <confidence> denotes the confidence of the prediction. Should be between 0 and 1.

The output is all possible candidates which will be used for license plate comparison. The format of each line is as follows:

<img_path> <similarity>

  • <img_path> is the path of probe vehicle and gallery vehicle. The first line is the probe image and the follwing is gallery images which are in a descending order in terms of similarity.
  • <similarity> is the similarity between probe vehicle and gallery vehicle based on a fusion of histogram-based adaptive appearance models, DCNN features, detected car types and traveling time information.

Demo [video]

Reference

Please cite these papers in your publications if it helps your research:

@inproceedings{tang2018vehicle,
  author = {Zheng Tang and Gaoang Wang and Hao Xiao and Aotian Zheng and Jenq-Neng Hwang},
  booktitle = {CVPR Workshop (CVPRW) on the AI City Challenge},
  title = {Single-camera and Inter-camera Vehicle Tracking and 3D Speed Estimation Based on Fusion of Visual and Semantic Features},
  year = {2018},
  pages = {108--115}
}

multi-camera-vehicle-tracking-and-reidentification's People

Contributors

alexxiao95 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

multi-camera-vehicle-tracking-and-reidentification's Issues

dataset for track3

hill, thank you for your excellent works. But, I can not get the dataset of track3. I have sent a email to [email protected], but there is no any reply. Can you share the dataset with me. Thanks in advance.

数据集的提供

你好。我想问一下关于这个官方数据集的下载。因为现在比赛早已结束,可是获取数据集必须要验证码。所以可否提供一下呢?绝对不是用于商业用途。或者用其他公开数据集也可以测试吗?
谢谢。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.