Comments (10)
You could also check
https://laion.ai/blog/video2dataset/
from open-sora-plan.
Our preliminary goal is to achieve impressive results in a specific data domain to verify the effectiveness of our pipeline and then extend our plan for generalization. Please keep attention on our project and feel free to contact us for discussion and potential cooperation.
from open-sora-plan.
A random thought. As a community, would be great to have a discord channel for discussions and updates.
from open-sora-plan.
I should be able to help on the high quality video data aspect (along with transcripts), although the source of captions is a more difficult problem.
from open-sora-plan.
Is the domain of the data fixed? If so, do you have more public information available on that?
Regardless, I think having some sort of a data curation pipeline (like the one used in the Stable Video Diffusion paper) would be really nice.
from open-sora-plan.
Agree. After the preliminary validation, we will construct a data curation pipeline following successful projects and pay more attention to data.
from open-sora-plan.
Cool, sounds like a plan. FWIW, I put together a simple repository that walks through the primary steps of the Stable Video Diffusion curation pipeline: https://github.com/sayakpaul/single-video-curation-svd.
from open-sora-plan.
We will check it and thanks for your effort!
from open-sora-plan.
You could also check data-juicer, it seems to be beneficial for video data curation.
from open-sora-plan.
https://arxiv.org/abs/2403.06098
from open-sora-plan.
Related Issues (20)
- Potential reference for VAE-GAN HOT 1
- Any plan to release the Pexels videos or links? HOT 2
- v.1.1 inferece invalid video?
- Provide wheels for cuda compiled extensions like Rope2D HOT 1
- Great work! A question about sampler_method. HOT 6
- Options for frames, width and height? HOT 4
- Hello, on which video card or processor should I install MagicTime: slow-motion video generation models like metamorphic simulators, please answer.
- Open-Sora-Plan will work on A4000 please answer.
- Open-Sora-Plan will run GPU in A100 please answer.
- How to understand the training stage of videoVAE? HOT 2
- 如何在自有数据上微调模型? HOT 5
- V1.1 Inference problem on NPU HOT 1
- Training memory for one node A100*8. HOT 4
- Google colab for 1.1.0
- Great work!A question about 65 frame inference reprocudtion. HOT 1
- No CFG involved in the training process? HOT 1
- Adding temporal attention into vae HOT 1
- 训练数据的json格式 HOT 1
- Licence problem
- cannot reproduce results HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from open-sora-plan.