Comments (14)
Hi everyone,
I think this project is not useful. There are other functional, better and scalable projects. You can search for better lip sync alternatives in github and you will find these.
The best projects that I found are:
DINet (audio input). This project was released a month ago. Unfortunately the autor said that has not time to keep it updated, but there is a functional version that you can use and train.
I am about to train a model. I downloaded the AVSpeech dataset, I used OpenFace to detect the landmarks and this week I will start a training with a little part of the dataset to test the training.
Anyway, there is a pretrained model that you can use if you want a demo.
https://github.com/MRzzm/DINet
Geneface (NeRF). the author is very attentive to new incidences and will answer you. It is a project that for the moment is kept up to date:
https://github.com/yerfor/GeneFace
Wav2lip with GFPGAN:
It is the original project with face restoration. The inference is quite slow because it has to restore all the frames. But the quality is good enough at all.
https://github.com/ajay-sainy/Wav2Lip-GFPGAN
Also there is a list of new lip sync projects at:
https://github.com/YunjinPark/awesome_talking_face_generation
I hope this is helpful for you 👍
David Martin Rius
from wav2lip_288x288.
Great. I didn't know about this kind of service. I can't wait to try it :D
I will have to do a lot of checkpoints during the trainings because I have seen that the service can be interrupted depending on demand. It totally pays off though because of the cost savings.
I am very grateful for your help. Thank you very much! 😊
Unfortunately, you won't be able to get success in training syncnet without modifications to this repo
from wav2lip_288x288.
but how? are we disscuss at github? or a private group chat ?
Author had released stripped down version of his developments for some reason. Maybe NDA
yes, this project already cost me some weeks,the loss do not drop around 0.51 i think the dataset process is the biggest reason,read issue ask for help . some bugs have find and can be bugfixed but not. so this project is just make a contribute to power stations 。
from wav2lip_288x288.
Hi everyone,
I think this project is not useful. There are other functional, better and scalable projects. You can search for better lip sync alternatives in github and you will find these.
The best projects that I found are:
DINet (audio input). This project was released a month ago. Unfortunately the autor said that has not time to keep it updated, but there is a functional version that you can use and train. I am about to train a model. I downloaded the AVSpeech dataset, I used OpenFace to detect the landmarks and this week I will start a training with a little part of the dataset to test the training. Anyway, there is a pretrained model that you can use if you want a demo. https://github.com/MRzzm/DINet
Geneface (NeRF). the author is very attentive to new incidences and will answer you. It is a project that for the moment is kept up to date: https://github.com/yerfor/GeneFace
Also there is a list of new lip sync projects at: https://github.com/YunjinPark/awesome_talking_face_generation
I hope this is helpful for you 👍
David Martin Rius
thank you! David,you are a real githuber
from wav2lip_288x288.
got it!
from wav2lip_288x288.
1 - This also depends on your dataset, how much your dataset is? for LRW that could be more than a month to train sycnnet & wav2lip, the hardest part is syncnet which could make more than 3 weeks for tuning parameters
2 - you can calculate price for two month in at least.
from wav2lip_288x288.
Sorry, I overlooked it. I took the AVSpeech dataset for granted.
Although if you say that with LRW it can be more than a month and that I can calculate a price for a couple of months I have an idea of the cost.
Could you suggest me some good and ""economical"" service that offers uninterrupted and reliable service?
The GPU's from google, amazon, and bigger companies are quite expensive.
Thanks!
from wav2lip_288x288.
you should request spot instance of azure or preemptive of google, which can reduce 90% cost of your machine
from wav2lip_288x288.
Great. I didn't know about this kind of service. I can't wait to try it :D
I will have to do a lot of checkpoints during the trainings because I have seen that the service can be interrupted depending on demand. It totally pays off though because of the cost savings.
I am very grateful for your help.
Thank you very much! 😊
from wav2lip_288x288.
Great. I didn't know about this kind of service. I can't wait to try it :D
I will have to do a lot of checkpoints during the trainings because I have seen that the service can be interrupted depending on demand. It totally pays off though because of the cost savings.
I am very grateful for your help. Thank you very much! 😊Unfortunately, you won't be able to get success in training syncnet without modifications to this repo
Absolutely
from wav2lip_288x288.
Great. I didn't know about this kind of service. I can't wait to try it :D
I will have to do a lot of checkpoints during the trainings because I have seen that the service can be interrupted depending on demand. It totally pays off though because of the cost savings.
I am very grateful for your help. Thank you very much! blushUnfortunately, you won't be able to get success in training syncnet without modifications to this repo
@NikitaKononov Thanks for letting me know this. Some months ago I asked it in
#16
@hannarud told she used BCEwithLogits to train color_syncnet_train.py
Unfortunately, I got the same result and I have not enough experience to solve it.
@NikitaKononov Can you help me to make the appropriate modifications to train it?
Thank you!
from wav2lip_288x288.
Great. I didn't know about this kind of service. I can't wait to try it :D
I will have to do a lot of checkpoints during the trainings because I have seen that the service can be interrupted depending on demand. It totally pays off though because of the cost savings.
I am very grateful for your help. Thank you very much! 😊Unfortunately, you won't be able to get success in training syncnet without modifications to this repo
but how? are we disscuss at github? or a private group chat ?
from wav2lip_288x288.
but how? are we disscuss at github? or a private group chat ?
Author had released stripped down version of his developments for some reason. Maybe NDA
from wav2lip_288x288.
By the way, I forgot to add another interesting project that you can use. Check the above message, I edited it.
from wav2lip_288x288.
Related Issues (20)
- the input of lpips loss HOT 1
- High resolution dataset HOT 1
- Hi sir, I am a beginner and I would like to inquire whether I should prepare a video of no less than 288 or a video of 384
- Find friends who are training models and share ideas with them.Welcome HOT 3
- Train syncnet use SyncNet_color_384 but train wav2lip use SyncNet_color? HOT 1
- When I use hq_wav2lip_sam_train.py。 HOT 3
- DINet implementation HOT 1
- video clips length
- train_syncnet_sam.py is not using GPU (RTX 4090) HOT 1
- What indicator represents the end of training hq_wav2lip_sam_train? HOT 4
- Why my train loss after introducing sync loss? HOT 4
- How to train HOT 6
- Why can’t training start? HOT 2
- do inference
- Generated bottom half face always blur. HOT 2
- Training failed. The lip shape of a character cannot change according to changes in speech HOT 6
- Syncnet loss does not converge HOT 20
- dataset
- DINet HOT 1
- 这个和普通的easyw字幕交换网站lip有什么区别
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from wav2lip_288x288.