GithubHelp home page GithubHelp logo

visinf / dense-ulearn-vos Goto Github PK

View Code? Open in Web Editor NEW
182.0 9.0 21.0 7.66 MB

Dense Unsupervised Learning for Video Segmentation (NeurIPS*2021)

License: Apache License 2.0

Python 96.98% Shell 3.02%
unsupervised-learning video-object-segmentation neurips-2021

dense-ulearn-vos's People

Contributors

arnike avatar sroth-visinf avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dense-ulearn-vos's Issues

incomplete training samples when using YoutubeVOS dataset

After downloading the filelist.txt for YoutubeVOS, I found that the images are sampled per 5 frames, which is the fully supervised setting, since only one out of every five frames will be annotated.
But under self-supervised setting, previous methods (like MAST), are using the full version with all training frames.
Have you tried this later one ?

TypeError: forward() missing 1 required positional argument: 'frames'

In infer_vos.py, Line 211 and 237, I've got the TypeError: forward() missing 1 required positional argument: 'frames' , after I change them as keyward arguments (frames=frames[:1]), the error solved. Is it a bug or something related to my environment? I haven't found the reason yet since the frames is not a keyward arguments in framework.py

By the way, does your evaluation script support Multi-GPU inference? It seems that inference on YouTube-VOS will take a very long time ?

Best,

inference error

the inference result is black

Available threads: 12
Loaded 2 sequences
Dataloader: filelists/val_ytvos2018_test # 271
filelists/val_ytvos2018_test: no augmentation
Sequence 00 | 0062f687f1

..........................................................................................<
Sequence 01 | 00f88c4f0a
...................................................................................................................................................................................<
984.928 elapsed: Inference completed

image

GPU out of memory when running infer_vos.sh/py

I am trying to run bash ./launch/infer_vos.sh ytvos, but am getting errors of "GPU out of memory". Trying to reduce the batch_size down to 8, 4, 2, 1, but still getting the error. I have nVidia K2000, with only 4G GPU memory. Any suggestions/advice how to get around the issue? Thanks.

train on youtube-vos

Grear work! Thanks for sharing your code!

I use the default training configure of ytvos to train the network.
But I only got best performance J&F=65.5 at epoch 490.
Is the default configure supposed to have this performance?
How can I get the best performance like your provided checkpoint?
I'd appreciate it if you could point out what I did wrong.
image

Are you training your own model or a pretrained model (Resnet18) ?

In the readme file you're training the data on resnet, what about your own model ? , Are you contributing in the data preprocessing level or you created your own framework.? If so, why it appears that you are training the data not with you own model(framework), but with the pretrained model (resnet)?

Thanks in advance.

colab

please add a google colab for inference

Add indexing='ij' in torch.meshgrid()

https://github.com/visinf/dense-ulearn-vos/blob/88e5e3518cb0c4a636ad652a3c0f544a370f4c6c/labelprop/crw.py#L234C7-L234C7

Hi,

Could you add indexing='ij' in torch.meshgrid() to suppress the warning?

/home/user/anaconda3/envs/dense-ulearn-vos/lib/python3.10/site-packages/torch/functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at /opt/conda/conda-bld/pytorch_1670525552843/work/aten/src/ATen/native/TensorShape.cpp:3190.)
return _VF.meshgrid(tensors, **kwargs) # type: ignore[attr-defined]

See : https://pytorch.org/docs/stable/generated/torch.meshgrid.html for reason behind warning.

Regards
Akshit Maurya

all data not found when training

I uploaded the training data and put it in the path that exists on the data/filelists, everytime I am facing this error : (this is one example: AssertionError: cfg.DATASET.ROOT/ytvos/train/JPEGImages/003234408d/00000.jpg not found. I also brought a new data that I wanted to train the model with and it keeps giving this error. it seems that it is considering that the data is not there where it is there and I am pretty sure of the path. Any suggestions please ?

image

Train on custom video data

What kind of preprocessing and preparation are required if I want to train from scratch on my own video data? Thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.