Comments (10)
@ZohaibAhmed Unfortunately I don't see the same error on my end - can you do me a small favor? If you have an IDE with breakpoints can you check which function is causing that in gen_tacotron.py (should be somewhere in the loop starting on line 91)?
If you don't have breakpoints you can just print('a', True), print('b', True) after each function in that loop to see what's throwing the error.
Thanks.
from wavernn.
looks like the issue is on the vocoder generate function in fatchord_wavernn, specifically when it calls:
h1 = rnn1(x, h1)
Note, that just using the pretrained model out of the box seems to work. It's just when I train the model further, the error occurs.
More details about my setup:
ubuntu16.04
pytorch=1.0.0
cuda10.0
cudnn7.4.1_1
GPU: RTX 2080 Ti
from wavernn.
@ZohaibAhmed can I get the exact steps you went through to get that error? Have you tried training a fresh model for a couple of epochs and then tried generating?
Also is there no other error message besides "Floating point exception (core dumped)"?
from wavernn.
@fatchord - training a model from scratch seems to work.
The exact steps I did were as follows:
- take your pretrained models
- get a different dataset, run preprocessor on that (the dataset is structured exactly like LJ)
Input File : '100.wav'
Channels : 1
Sample Rate : 22050
Precision : 16-bit
Duration : 00:00:03.42 = 75411 samples ~ 256.5 CDDA sectors
File Size : 151k
Bit Rate : 353k
Sample Encoding: 16-bit Signed Integer PCM
- Run
train_tacotron.py
for a bit. - Run
gen_tacotron.py
after the first checkpoint (i made it after 500 steps instead of the default).
And that's how I get to that error. Even if i keep the WaveRNN as the pretrained model, it still results in the Floating point exception (core dumped)
. Theres no other stack trace.
from wavernn.
@ZohaibAhmed can you try training LJ from scratch to see if you get the same error?
from wavernn.
@fatchord training Tacotron from scratch makes it work. But I don't have enough data for my own dataset to effectively train the model.
Have you had any success with fine-tuning?
EDIT: the main issue seems to be that the decoder is producing all silent values
It looks like the shape of the output from the original pretrained model is different then when I train on top of it:
Original:
torch.Size([1, 80, 338])
Tuned:
torch.Size([1, 80, 1])
Looks like I hit the condition where if silent frames are present:
if (mel_frames < -3.8).all() : break
This is what the alignment plot looks like while training tacotron:
from wavernn.
@ZohaibAhmed I met the same error. The reason is that the first frame of mel_frames
is all silence (< -3.8), which makes the tacotron output empty. You could fix that by using the following code:
if (mel_frames < -3.8).all() and i > 10 : break
from wavernn.
@candlewill Nice catch, I'll push a fix for that later today.
from wavernn.
@candlewill - I still largely get silence (with some static). Did you try to train your model on top of the checkpoint that @fatchord provided? Or did you just train it from scratch?
from wavernn.
Tacotron has been updated to fix the premature stopping of generation.
from wavernn.
Related Issues (20)
- TTS not generating output even after 900k steps of tacotron model HOT 4
- Infinite loop during create_training_data.py
- Slow tacotron training 1step/sec on AWS p3.2xlarge (Tesla V100) HOT 1
- Using wavernn pretrained model, loss stuck at 5.6
- Can I use pretrained models with different hparams settings?
- sentence long problem
- Train WaveRnn AttributeError HOT 5
- ValueError - gen_tacotron.py HOT 1
- Error During Computing Consensus Step HOT 1
- adding support for windows sapi5
- why do you minus 2 in preprocessing ?
- AttributeError: module 'librosa' has no attribute 'output' HOT 4
- data\\dataset.pkl isssue HOT 1
- [feature request] dynamic batch size during WaveRNN training depending on free/total GPU memory
- Tacotron to Onnx HOT 1
- Where is the audio file for which itis generating the text? HOT 2
- (Solved, but can be useful to someone) Problems getting the project working for the first time
- spectrogram (image_-to-wav HOT 1
- Help
- Is it possible to generate music using WaveRNN?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from wavernn.