GithubHelp home page GithubHelp logo

jizhihang / kaori-ins Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ledduy/kaori-ins

0.0 2.0 0.0 62.42 MB

KAORI-INS - A Framework for Instance Search

Python 2.18% MATLAB 25.96% HTML 2.51% PHP 13.78% Shell 21.59% C++ 0.46% M 0.05% Makefile 1.62% CSS 0.80% Clean 0.20% TeX 0.09% C 29.82% Objective-C 0.62% Roff 0.32%

kaori-ins's Introduction

kaori-ins

KAORI-INS - A Framework for Instance Search

  1. IDE Setup
  1. New repository
  1. Clone
  • to local D:\ for coding
  1. Datasets
  • TV2011:
  • 3 keyframes/sec
  • Keyframe size; 352x288
  • 20,982 clips --> 1,650,827 keyframes
  • 25 topics (9023-9047)
  • All videos were chopped into 20 to 10s clips using ffmpeg
  • ~ 100 hours, BBC rushes
  • Submission format:
  • TV2012
  • 3 keyframes/sec
  • 33 keyframes/clip
  • 74,958 clips --> 2,256,930
  • 21 topics (9048-9068)
  • Keyframe size: 640x480
  • Flickr video
  • Submission format:
  • TV2013:
  • 5 keyframes/sec
  • Keyframe size: 768x576
  • 471,526 shots --> 2,245,924 keyframes
  • 30 topics (9069-9098)
  • BBC EastEnders, approximately 244 video files (totally 300 GB, 464 h)
  • Submission format:
  1. Steps 5.1. Generate metadata (DONE Aug 12)
  • Code: php -f ksc-Tool-GenerateMetaData.php 2011|2012|2013 test|query|queryext50
  • Generate metadata for new corresponding pats, e.g. test2013-new
  • For tv2011, tv2012 --> copy data to keyframe-5/tv2012/test2012-new/TRECVID2012_1/*.tar
  • For tv2013 --> only 5KF/shot are copied and packed in .tar file.
  • Running time on SGE (24 cores) for tv2013 is 16 hours, tv2012 & tv2011 is 4 hours.

5.2. Generate metadata for subtest --> only subtest2012-new

  • Code: ksc-Tool-Tool-GenerateMetaDataForSubTest.php 2012
  • Note: Copy *.tar files of selected shots to new dir (it is better to use softlink, but here cp is used).
  • Shot sampling rate: $arSamplingRateList = array(2011 => 10, 2012 => 20, 2013 => 20);
  • subtest2012-new:
  • #keyframes: 146,966 (full: 2,256,930)
  • #shots: ~98x50
  • note: shots of groundtruth are also added.

5.2. Extract raw local features using colorsift

  • Code: ksc-Feature-ExtractRawAffCovSIFTFeature-COLORSIFT*.*
  • Max image size: 500x500 (specified in ksc-AppConfig.php)
  • Keypoint detector: HarLap & Dense sampling (step size 6 pixel, 2 scales).
  • Descriptor: SIFT and rgbSIFT.

***colordescriptor ver 3.0 - Processing time:

  • harlap x rgbsift: 12 sec/KF
  • harlap x sift: 7 sec/KF
  • dense6 x rgbsift: 10 sec/KF
  • dense6 x sift: 6 sec/KF
  • note: colordescriptor auto resize large KF into max 500x500, and CPU is usually 2x

5.3. Running DPM model

  • tv2012 ~ 4 sec/keyframe
  • tv2013: 16 sec/keyframe for 2x scale factor on test keyframes, 6 sec/keyframe for normal KF.

*************************** RE-TEST THE FRAMEWORK **********************

6.1. Raw feature extraction

  • Only dense6mul.sift is used for test2012-new

  • Pat are defined in ksc-AppConfigForProject.php

  • Total time: 6*2.25M/3600 = 3,750 hours (Aug23-23:30 --> [Aug25-->10:00 - Last jobs!])

  • 56 cores until Aug25-03:00AM, 550 cores after that (INS deadline)

  • test2011-new

  • Aug26-->23:55 ==> Aug27-->9:55 - 1,000jobs*90mins/job = 1,500 hours (max 280 cores --> 150 cores because colordescriptor requires 1.5-2.0 CPUs *** Clustering

  • Aug27-->10:10 ==> Aug27-->12:00

6.2. Quantization

  • 1K codebook of subtest2012-new

  • Total time: 6.5*2.25/3600 = 4,100 hours --> 20 hours (max 200 cores) --> [Aug25-->23:15 - Aug25-->16:27 - Last jobs! 280 cores]

  • test2011-new Aug27-->16:45 ==> Aug28-->05:00

6.3. Matching

  • Total time: 2,100 jobs (21 queries) x 30 mins/job --> 1,050 hours --> [Aug26--> 16:45 --> Aug26-->20:45 280 cores]

6.4. Ranking and Selecting for DPM

6.5. Running DPM

6.6. Fusion and Evaluation

kaori-ins's People

Contributors

ledduy avatar nvtiep avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.