GithubHelp home page GithubHelp logo

deep-action-proposals's People

Contributors

cabaf avatar escorciav avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

deep-action-proposals's Issues

DAPs paper understanding problem

I don't understand how much proposals are generated in all in one video.During training, only one stream is processed in one video,so the overall number of proposals is K.Is that true?

Simplify compute priors

compute_priors function on data_generation module is a little bit convoluted. The matching of priors to segments should be on other function because we will end-up duplicating code if all the segments of the dataset are not used to compute priors. That is the case of ActivityNet where there is an explicit validation set.

Similar refactoring should be done on tool create_dataset.py

activity-net helper module

We need a module to interact with activity-net dataset. I would recommend to make something similar to thumos14_helper.Thumos14 that generate the CSV-files required to interact with the annotations.

Drop invalid annotations in segment_info

Some videos of Thumos14 are incorrectly annotated e.g. video_validation_0000364 and video_validation_0000856. In order to avoid unpleasant surprises, it's useful to drop rows of the data-frame where t-init or t-end < video-duration

idx_drop = ((df['video-duration'] < df['t-init']) | (df['video-duration'] < df['t-end'])).nonzero()[0]
df.drop(idx_drop, inplace=True)

Bug generate_segments

this line should be f_init + t_size- 1 because the end-frame should be contained on the segment according to segment utilities.

this line imposes a constraint on the initial-frame.

Note: indexing of features with this convention will require an increment on end-frame.
@cabaf

batch_frame_extraction

hi,
i am geting "no such a file or directory" message after runing tools programm batch_frame_extraction.py

improve generate_segments

I wrote this function replicating the idea of the cvpr-paper. It would be great to refine it in the following aspects:

  • Apply intersection-threshold less than 1 just on annotations with length similar or greater that t_size.
  • Segments with high coverage ratio should be more likely to be selected i.e. sampling based on cov_ratio_per_segment.
  • define appropriate values for RATIO_INTERVALS
  • main bug is here. index i is iterating over videos while score.shape[0] corresponds to the number of segments for all the videos. My fault ๐Ÿ˜ž

Note: if t_size is closer to l_size is possible that not enough segments are generated to satisfy a uniform distribution on RATIO_INTERVALS. We should decide how to overcome it.

Utility for merge c3d features

We are interested on retrieve c3d features associated with a segment of interest. Given a video segment, a feature stride, a folder with features of the video.

  • compute list with frames of interest
  • read features on previous list
  • stack features
  • apply a pooling operator, optional
  • save stack, optional

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.