GithubHelp home page GithubHelp logo

flower-butterfly / st-resnet Goto Github PK

View Code? Open in Web Editor NEW

This project forked from feichtenhofer/st-resnet

1.0 1.0 0.0 313 KB

Code release for "Spatiotemporal Residual Networks for Video Action Recognition", NIPS16 & "Spatiotemporal Multiplier Networks for Video Action Recognition", CVPR17.

MATLAB 100.00%

st-resnet's Introduction

================================================================================

Spatiotemporal Residual Networks for Video Action Recognition

This repository contains the code for our NIPS'16 and CVPR'17 papers:

Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes
"Spatiotemporal Residual Networks for Video Action Recognition"
in Proc. NIPS 2016

Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes
"Spatiotemporal Multiplier Networks for Video Action Recognition"
in Proc. CVPR 2017

If you use our code/models/data for your research, please cite our papers:

    @inproceedings{feichtenhofer2016spatiotemporal,
      title={Spatiotemporal residual networks for video action recognition},
      author={Feichtenhofer, Christoph and Pinz, Axel and Wildes, Richard},
      booktitle={Advances in Neural Information Processing Systems (NIPS)},
      pages={3468--3476},
      year={2016}
    }

    @inproceedings{feichtenhofer2017multiplier,
      title={Spatiotemporal multiplier networks for video action recognition},
      author={Feichtenhofer, Christoph and Pinz, Axel and Wildes, Richard P}
      booktitle={Conference on Computer Vision and Pattern Recognition (CVPR)},
      year={2017}
    }

Requirements

The code was tested on Ubuntu 14.04, 16.04 and Windows 10 using MATLAB R2016b and NVIDIA Titan X GPUs.

If you have questions regarding the implementation please contact:

Christoph Feichtenhofer <feichtenhofer AT tugraz.at>

================================================================================

Setup

  1. Download the code git clone --recursive https://github.com/feichtenhofer/st-resnet

  2. Compile the code by running compile.m.

  3. Edit the file cnn_setup_environment.m to adjust the models and data paths.

  4. (Optional) Download pretrained model files and the datasets, linked below and unpack them into your models/data directory. Otherwise the scripts will attempt to download the models at runtime.

  • Optionally you can also pretrain your own twostream base models by running
    1. base_streams/cnn_ucf101_spatial(); to train the appearance network streams.
    2. base_streams/cnn_ucf101_temporal(); to train the optical flow network streams.
  1. Training STResNet_stage1();, STResNet_stage2(); to train the architecture in our NIPS 2016 paper. STMulNet(); to train the architecture in our CVPR 2017 paper.
    • In case you did not download or trained the base models, the script will attempt to download these accordingly.
    • In case you would like to train on the CPU, clear the variable opts.train.gpus
    • In case you encounter memory issues on your GPU, consider decreasing the cudnnWorkspaceLimit (512MB is default)

Models: ST-ResNet

  • Download final models here: (in case you do not, STResNet_stage1();, STResNet_stage2();, and STResNet_test(); will attempt to download the respective models at runtime.)

Models: ST-MulNet

  • Download final models here: (in case you do not, STMulNet(); and STMulNet_test(); will attempt to download the models at runtime.)

Data

Pre-computed optical flow images and resized rgb frames for the UCF101 and HMDB51 datasets

Use it on your own dataset

st-resnet's People

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.