GithubHelp home page GithubHelp logo

pooling strategy about videval HOT 6 CLOSED

vztu avatar vztu commented on July 25, 2024
pooling strategy

from videval.

Comments (6)

vztu avatar vztu commented on July 25, 2024

This is a great question. My procedure in this pooling paper is (take BRSIQUE as example):

  1. First extract 36 features per frame for a video, yielding a total of 300 x 36 frame features, supposing each video has 300 frames.
  2. Average pool frame features for each video, resulting in 1 x 36 features for each video.
  3. Do this on the training set (eg, 500 videos), you get a 500 x 36 feature matrix, and then train the SVR on MOSs. Now you have a quality predictor for any 36-dim BRISQUE feature (Even though trained on videos)
  4. Inference phase on the test set (eg, 100 videos): for a given video with 300 frames, extract 300 x 36 BRISQUE features, then apply the trained SVR model, you get 300 x 1 quality predictions. Now you can apply your favorite (training-free) temporal pooling methods to reduce these time-series scores!

Another method is to assign the video score to each frame and then train on 300x more samples. I quickly tried this one and found that it didn't give better results, but training is much slower since you get 300x more training samples. These two methods both have flaws however. I think a better way is to train the implicit temporal pooling in an end-to-end manner, such as VSFA, or CoINVQ

from videval.

vztu avatar vztu commented on July 25, 2024

For your reference, we have open-sourced several pooling methods here: https://github.com/vztu/Temporal_Pooling

from videval.

kobe233333 avatar kobe233333 commented on July 25, 2024

For your reference, we have open-sourced several pooling methods here: https://github.com/vztu/Temporal_Pooling

Appreciate a lot! I will have a try.

from videval.

kobe233333 avatar kobe233333 commented on July 25, 2024

Another question is whether SVM can be replaced by other regression methods, such as MLP, when performing regression after obtaining features?

from videval.

vztu avatar vztu commented on July 25, 2024

from videval.

kobe233333 avatar kobe233333 commented on July 25, 2024

Yes you can use your favorite regressor. My experience is on relatively small dataset (<5k), SVM usually performs well

On Fri, Oct 8, 2021 at 2:01 AM kobe233333 @.***> wrote: Another question is whether SVM can be replaced by other regression methods, such as MLP, when performing regression after obtaining features? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#11 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKKGPFQ5CHX3FKMVCEVBJOLUF2JN5ANCNFSM5D7S6KMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub .

Thanks!

from videval.

Related Issues (15)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.