Wangbo Zhao(明先生)'s Projects
The code for CVPR2021 Weakly Supervised Video Salient Object Detection
The code for ICCV2021 Light Field Saliency Detection with Dual Local Graph Learning and Reciprocative Guidance
The code for SCG: Saliency and Contour Guided Salient Instance Segmentation
This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Learning Unsupervised Video Object Segmentation through Visual Attention (CVPR19)
Associating Objects with Transformers for Video Object Segmentation
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
Implementation of Binsformer code
Making big AI models cheaper, easier, and scalable
See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks (CVPR19)
Crossover Learning for Fast Online Video Instance Segmentation (ICCV 2021)
[CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
Evaluation Framework for DAVIS 2017 Semi-supervised and Unsupervised used in the DAVIS Challenges
Shifting More Attention to Video Salient Objection Detection, CVPR2019
[ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging techniques, while incorporating a differentiable compression rate.
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Deep Reinforcement Learning
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models"
Exploring the Limits of Masked Visual Representation Learning at Scale (https://arxiv.org/abs/2211.07636)
A Generalized Framework for Video Instance Segmentation
⚡ Building applications with LLMs through composability ⚡
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
LeetCode 刷题攻略:配思维导图,将近200道经典算法题目刷题顺序、经典算法模板、共60w字的详细图解,以及难点视频题解。按照刷题攻略上的顺序来刷题,让你在算法学习上不再迷茫!🔥🔥给个star支持一下吧!🚀
[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models