GithubHelp home page GithubHelp logo

minghanli / boxvis Goto Github PK

View Code? Open in Web Editor NEW
11.0 2.0 0.0 1.63 MB

Code release for "BoxVIS: Video Instance Segmentation with Box Annotation"

Python 92.45% Shell 0.14% C++ 0.74% Cuda 6.68%
video-instance-segmentation box-supervised-segmentation pairwise-affinity-segmentation-loss weak-supervision-segmentation

boxvis's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

boxvis's Issues

How to train BoxVIS on OVIS only?

I want to train a BoxVIS model only with OVIS dataset, and I think the following params in yaml file should be adjusted as below:

    DATASETS:
        DATASET_RATIO: []
        TRAIN: ("ovis_train_org",),   # registered in the **buildin** file of original OVIS train subset
        TEST: ("ovis_val", )
    SEM_SEG_HEAD:
        NUM_CLASSES: 25
    MODEL:
        BoxVIS:
            BVISD_ENABLED: False

Are there any other config modifications I should adopt? Thanks for answering.

precision of the baseline model

Thank you for your excellent work!
In your paper, the mentioned fully-supervised Mask2Former model for performance comparing achieves 43.2 mAP on YouTube-VIS-2021 validation subset. I tried to reproduce this baseline according to the cues from both the paper and the repository of yours, but I still get around 40.6 mAP as reported in the original Mask2Former paper.

I set the following training configs:

    cfg.INPUT.SAMPLING_FRAME_NUM = 3
    cfg.INPUT.SAMPLING_FRAME_RANGE = 10
    cfg.INPUT.MIN_SIZE_TRAIN = (320, 352, 384, 416, 448, 480, 512)
    cfg.SOLVER.IMS_PER_BATCH = 16
    cfg.SOLVER.BASE_LR = 0.0001
    cfg.SOLVER.STEPS = (12000,)
    cfg.SOLVER.MAX_ITER = 15000
    cfg.DATASETS.TRAIN: ("ytvis_2021_train", )
    cfg.DATASETS.TEST: ("ytvis_2021_val", )

Can you tell me how I can reproduce the fully-supervised Mask2Former precision of 43.2 mAP?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.