Comments (2)
Hi, thanks for having interested in our paper.
- We will included it in the updated version. Actually, the mPLUG-2 Base surpasses Singularity(17M) with 7% improvement on R1 (41.5 v.s. 48.3) for MSRVTT, and it achieves better performance than Singularity on DiDeMo in terms of R5 and R10.
- We aim to demonstrate the generalization ability of our proposed method with the proposed universal layer module on uni-modal datasets. For X-CLIP and UniformerV2, these two methods are designed for general video action recognition thus needs to follow the standard protocol. However, as the pre-training approaches which utilize much more extra video data (e.g. CoCa, InternVideo, Merlot-Reserve), we cannot ensure the fair comparison justification of these methods. We will cite UniFormerV2 and add Kinetics-710 clarification in the updated version.
from alicemind.
Thanks for your answers! Hopefully for the public code soon!
from alicemind.
Related Issues (20)
- When the mPLUG-2 model can be released? HOT 2
- RuntimeError: gather(): Expected dtype int64 for index
- There might be sth wrong in this file mPLUG/videocap_mplug.py
- Fine-tuning video captions HOT 1
- Inference of image captioning on single image HOT 4
- how to get the pre-trained model "ViT-L-14.tar"
- how to get ued model A Unified Pretraining Framework For Passage Ranking And Expansion
- “mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video”代码是否会开源?
- Grounding checkpoint evaluation results
- “VECO 2.0: Cross-lingual Language Model Pre-training with Multi-granularity Contrastive Learning”代码是否会开源?
- Missing partial code and files of gqa for VQA in mPLUG
- Zero-Shot Video Captioning script issues HOT 1
- Could you upload StructuralLM to HuggingFace ?
- The logprob of image captioning result of mplug is very small
- SDCUP
- SDCUP训练问题
- 表格数据集
- mplug中两处代码错误问题
- can i pretrain mPLUG model with my own dataset
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from alicemind.