GithubHelp home page GithubHelp logo

neu-vi / sportsslomo Goto Github PK

View Code? Open in Web Editor NEW
59.0 59.0 2.0 61.34 MB

SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876)

Home Page: https://neu-vi.github.io/SportsSlomo/

Python 99.37% Shell 0.63%

sportsslomo's People

Contributors

jiabenchen avatar playerkk avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

Forkers

jiabenchen cv-ip

sportsslomo's Issues

splits link unavailable

link here gives "Sorry, the file you have requested does not exist."

Then download our split files for training/testing from this [link](https://drive.google.com/file/d/1qRaJ0i6z0z8pP8fAKjGYljCg4KZMxp6m/view?usp=sharing) to splits/ folder.

Is there any segmentation mask alignment/registration when calculating the aux seg loss?

Hi!
Thanks for sharing the code and data!

I have a question regarding the auxiliary segmentation loss. When you use the segmentation mask of I_t and I_t_hat to calculate the CE loss and dice loss, is there any segment alignment/ registration step to make the two segmentation masks aligned (i.e. same index for same object)? If so, how did you do that?

Thanks again!

Target FPS 7.5 to 30

Hi! Thanks for sharing code and data.

What I want to do is to convert video of 7.5 fps to 30 fps.

  • Interpolate middle 3 frames given first and last frame.
    Then, what if I want to fine-tune with SportsSloMo dataset, what is the correct sampling for this?
  • In SportsSloMo they are converting from 30 fps to 240 fps (interpolate 7 frames)
    image
  • Is it okay if sample frames [0, 4, 8, 12, 16] and [0,16] frames as input and predict [4,8,12] frames ?

Train Validation split

Hi! Thanks for sharing work!

I've downloded data from web-demo project hompage and gonna ask how train_vfi and test_vfi work.
In train/test_vfi.txt files. it contains file names like this ***/****, but raw clip data have names like clips_****.mp4 .
Then how would I split train and test mp4 files ??

Thank you.

Is There a Mistake in Line 59 of dataset.py base_idx = 9 * (index // 9)? Shouldn't it be base_idx = 9 * (index // 7)?

I am working with the dataset in SportsSloMo_EBME/core/dataset.py and I believe there is an error in the definition of base_idx. I suggest the following change:

base_idx = 9 * (index // 9)
โ†“
base_idx = 9 * (index // 7)

With the original code, for example, when the index is 9-17, the base_idx becomes 1, leading to target_idx values of 2, 3, 4, 5, 6, 0, 1, 2, 3. Notice that target_idx=2, 3 are duplicated. Due to this duplication, the dataset size is effectively less than its total size.

This suggests that only 7/9 of the entire dataset is being used for training and testing. Regarding the test data, out of a total of 135072 images, it appears that 30016 images are not being used. ๐Ÿง๐Ÿ’ป๐Ÿ”

Thank you

Number of clips?

Hi, thank you for sharing your work to the public.
I have tried downloading and processing the SportsSlomo dataset.
Seems like it consists of 8498 clips, where as in the arxiv version of the paper, it claims to have 130K clips.
According to the processing code in the repo, I believe 8498 clips is the correct number, but I couldn't find such number on the paper, so I was wondering if I am missing something here.

Is there a difference in counting the number of clips, or is this only a small portion of the full dataset?

The size of dataset

I have downloaded the videos from the given link, but there are only 8000~ clips, is there any preprocessing for cutting short clips? Thx.

missing eval in DefaultPredictor of detectron2_modify

DefaultPredictor of detectron2_modify, only disable parameters grad, but remain model in training, this cause wrong logic when using Mask2Former as criterion.

solution: restore the model.eval() as in original detectron2:
https://github.com/facebookresearch/detectron2/blob/0ae803b1449cd2d3f8fa1b7c0f59356db10b3083/detectron2/engine/defaults.py#L280-L285

another bug: it's loading the first char of a path string

# Load Segmentation Masks
seg = np.load(seg_path_target[0], allow_pickle=True)

solution: remove [0]

other problems:
the readme says pip install -e to install detectron2, this will make our own tools unable to import. for now, need to use this pr facebookresearch/detectron2#5283

there is also missing symbol link of Mask2Former in SportsSloMo_EBME, it's no said in readme, neither tracked in git repo.

the shell command in readme is not perfect, it's common that command needs to be run in parent or sub dir.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.