GithubHelp home page GithubHelp logo

381263772 / yolo_slowfast Goto Github PK

View Code? Open in Web Editor NEW

This project forked from wufan-tb/yolo_slowfast

0.0 0.0 0.0 7.21 MB

Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Python 100.00%

yolo_slowfast's Introduction

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

  • we choose yolov5 as an object detector instead of Faster R-CNN, it is faster and more convenient
  • we use a tracker(deepsort) to allocate action labels to all objects(with same ids) in different frames
  • our processing speed reached 24.2 FPS at 30 inference batch size (on a single RTX 2080Ti GPU)

Relevant infomation: FAIR/PytorchVideo; Ultralytics/Yolov5

Demo comparison between original(<-left) and ours(->right).

Update Log:

  • 2023.03.31 fix some bugs(maybe caused by yolov5 version upgrade), support real time testing(test on camera or video stearm).

  • 2022.01.24 optimize pre-process method(no need to extract video to image before processing), faster and cleaner.

Installation

  1. clone this repo:

    git clone https://github.com/wufan-tb/yolo_slowfast
    cd yolo_slowfast
    
  2. create a new python environment (optional):

    conda create -n {your_env_name} python=3.7.11
    conda activate {your_env_name}
    
  3. install requiments:

    pip install -r requirements.txt
    
  4. download weights file(ckpt.t7) from [deepsort] to this folder:

    ./deep_sort/deep_sort/deep/checkpoint/
    
  5. test on your video/camera/stream:

    python yolo_slowfast.py --input {path to your video/camera/stream}
    

    The first time execute this command may take some times to download the yolov5 code and it's weights file from torch.hub, keep your network connection.

    set --input 0 to test on your local camera, set --input {stream path, such as "rtsp://xxx" or "rtmp://xxxx"} to test on viewo stream.

References

Thanks for these great works:

[1] Ultralytics/Yolov5

[2] ZQPei/deepsort

[3] FAIR/PytorchVideo

[4] AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions. paper

[5] SlowFast Networks for Video Recognition. paper

Citation

If you find our work useful, please cite as follow:

{   yolo_slowfast,
    author = {Wu Fan},
    title = { A realtime action detection frame work based on PytorchVideo},
    year = {2021},
    url = {\url{https://github.com/wufan-tb/yolo_slowfast}}
}

Stargazers over time

Stargazers over time

yolo_slowfast's People

Contributors

wufan-tb avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.