No matter what settings I try in training, I always get an "IndexError: list index out of range" error.
E:\ai-voice-cloning>set PYTHONUTF8=1
E:\ai-voice-cloning>runtime\python.exe .\src\main.py
2024-03-09 10:33:30 | INFO | rvc.configs.config | Found GPU NVIDIA GeForce RTX 4080
Whisper detected
Traceback (most recent call last):
File "E:\ai-voice-cloning\src\utils.py", line 98, in <module>
from vall_e.emb.qnt import encode as valle_quantize
ModuleNotFoundError: No module named 'vall_e'
Traceback (most recent call last):
File "E:\ai-voice-cloning\src\utils.py", line 118, in <module>
import bark
ModuleNotFoundError: No module named 'bark'
[textbox, textbox, radio, textbox, dropdown, audio, number, slider, number, slider, slider, slider, radio, slider, slider, slider, slider, slider, slider, slider, checkboxgroup, checkbox, checkbox]
[dropdown, slider, dropdown, slider, slider, slider, slider, slider]
Running on local URL: http://127.0.0.1:7860
To create a public link, set `share=True` in `launch()`.
Loading TorToiSe... (AR: E:\ai-voice-cloning\models\tortoise\autoregressive.pth, diffusion: ./models/tortoise/diffusion_decoder.pth, vocoder: bigvgan_24khz_100band)
Hardware acceleration found: cuda
use_deepspeed api_debug False
E:\ai-voice-cloning\runtime\lib\site-packages\torch\nn\utils\weight_norm.py:30: UserWarning: torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.
warnings.warn("torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.")
Loading tokenizer JSON: ./modules/tortoise-tts/tortoise/data/tokenizer.json
Loaded tokenizer
Loading autoregressive model: E:\ai-voice-cloning\models\tortoise\autoregressive.pth
Loaded autoregressive model
Loaded diffusion model
Loading vocoder model: bigvgan_24khz_100band
Loading vocoder model: bigvgan_24khz_100band.pth
Removing weight norm...
Loaded vocoder model
Loaded TTS, ready for generation.
Unloaded TTS
Spawning process: train.bat ./training/neeko/train.yaml
[Training] [2024-03-09T10:35:22.642353]
2024-03-09 10:35:22 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:22.645354] E:\ai-voice-cloning>set PYTHONUTF8=1
2024-03-09 10:35:22 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:22.648355]
2024-03-09 10:35:22 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:22.650356] E:\ai-voice-cloning>.\runtime\python.exe .\src\train.py --yaml "./training/neeko/train.yaml"
2024-03-09 10:35:22 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:22 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:22 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/reset "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:24.427039] [2024-03-09 10:35:24,427] torch.distributed.elastic.multiprocessing.redirects: [WARNING] NOTE: Redirects are currently not supported in Windows or MacOs.
2024-03-09 10:35:24 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:24 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:24 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/reset "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.472498] 24-03-09 10:35:26.471 - INFO: name: neeko
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.475499] model: extensibletrainer
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.478500] scale: 1
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.480500] gpu_ids: [0]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.484501] start_step: 0
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.486502] checkpointing_enabled: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.489502] fp16: False
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.491503] bitsandbytes: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.493503] gpus: 1
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.496504] datasets:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.498505] train:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.500504] name: training
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.502505] n_workers: 2
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.504506] batch_size: 40
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.507506] mode: paired_voice_audio
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.509520] path: ./training/neeko/train.txt
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.512507] fetcher_mode: ['lj']
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.514507] phase: train
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.516508] max_wav_length: 255995
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.518509] max_text_length: 200
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.520509] sample_rate: 22050
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.522510] load_conditioning: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.525510] num_conditioning_candidates: 2
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.527511] conditioning_length: 44000
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.529511] use_bpe_tokenizer: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.532512] tokenizer_vocab: ./modules/tortoise-tts/tortoise/data/tokenizer.json
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.534513] load_aligned_codes: False
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.537513] data_type: img
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.539526] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.541514] val:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.543514] name: validation
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.545515] n_workers: 2
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.548515] batch_size: 4
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.550516] mode: paired_voice_audio
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.553517] path: ./training/neeko/validation.txt
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.556517] fetcher_mode: ['lj']
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.558517] phase: val
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.560518] max_wav_length: 255995
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.562519] max_text_length: 200
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.565519] sample_rate: 22050
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.567519] load_conditioning: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.569533] num_conditioning_candidates: 2
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.572521] conditioning_length: 44000
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.574521] use_bpe_tokenizer: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.577522] tokenizer_vocab: ./modules/tortoise-tts/tortoise/data/tokenizer.json
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.579523] load_aligned_codes: False
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.581523] data_type: img
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.583523] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.585524] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.587524] steps:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.589546] gpt_train:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.592525] training: gpt
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.594527] loss_log_buffer: 500
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.596526] optimizer: adamw
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.598526] optimizer_params:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.601528] lr: 1e-05
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.604528] weight_decay: 0.01
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.606528] beta1: 0.9
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.608529] beta2: 0.96
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.610529] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.612530] clip_grad_eps: 4
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.615530] injectors:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.617531] paired_to_mel:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.619532] type: torch_mel_spectrogram
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.622532] mel_norm_file: ./modules/tortoise-tts/tortoise/data/mel_norms.pth
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.624532] in: wav
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.626533] out: paired_mel
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.628533] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.630534] paired_cond_to_mel:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.632534] type: for_each
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.635535] subtype: torch_mel_spectrogram
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.637535] mel_norm_file: ./modules/tortoise-tts/tortoise/data/mel_norms.pth
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.639535] in: conditioning
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.642537] out: paired_conditioning_mel
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.646538] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.648538] to_codes:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.650538] type: discrete_token
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.652538] in: paired_mel
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.655539] out: paired_mel_codes
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.657540] dvae_config: ./models/tortoise/train_diffusion_vocoder_22k_level.yml
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.659540] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.662541] paired_fwd_text:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.664541] type: generator
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.666542] generator: gpt
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.668543] in: ['paired_conditioning_mel', 'padded_text', 'text_lengths', 'paired_mel_codes', 'wav_lengths']
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.671543] out: ['loss_text_ce', 'loss_mel_ce', 'logits']
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.673556] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.675544] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.678545] losses:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.680545] text_ce:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.682545] type: direct
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.684545] weight: 0.02
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.686546] key: loss_text_ce
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.689559] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.691548] mel_ce:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.693548] type: direct
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.695548] weight: 1
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.697549] key: loss_mel_ce
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.700550] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.702550] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.704551] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.706551] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.709552] networks:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.711552] gpt:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.713553] type: generator
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.715553] which_model_G: unified_voice2
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.718553] kwargs:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.720554] layers: 30
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.722554] model_dim: 1024
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.724555] heads: 16
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.727556] max_text_tokens: 402
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.729556] max_mel_tokens: 604
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.731557] max_conditioning_inputs: 2
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.733557] mel_length_compression: 1024
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.736558] number_text_tokens: 256
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.738558] number_mel_codes: 8194
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.740559] start_mel_token: 8192
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.743559] stop_mel_token: 8193
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.745559] start_text_token: 255
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.747561] train_solo_embeddings: False
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.750561] use_mel_codes_as_input: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.752561] checkpointing: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.755562] tortoise_compat: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.757562] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.760563] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.763564] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.765564] path:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.767564] strict_load: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.769565] pretrain_model_gpt: ./models/tortoise/autoregressive.pth
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.772565] root: ./
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.774566] experiments_root: ./training\neeko\finetune
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.776566] models: ./training\neeko\finetune\models
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.778567] training_state: ./training\neeko\finetune\training_state
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.781567] log: ./training\neeko\finetune
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.783568] val_images: ./training\neeko\finetune\val_images
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.785569] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.787570] train:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.790570] niter: 1600
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.792570] warmup_iter: -1
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.794570] mega_batch_factor: 10
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.796571] val_freq: 800
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.799572] ema_enabled: False
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.801572] default_lr_scheme: CosineAnnealingLR_Restart
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.803573] T_period: [200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200, 200]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.806573] warmup: 0
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.809586] eta_min: 1e-08
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.811574] restarts: [200, 400, 600, 800, 1000, 1200, 1400, 1600]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.813575] restart_weights: [0.875, 0.75, 0.625, 0.5, 0.375, 0.25, 0.125, 0.0625]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.815575] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.818576] eval:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.820576] pure: False
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.822577] output_state: gen
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.825578] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.827578] logger:[
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.829578] save_checkpoint_freq: 800
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.832579] visuals: ['gen', 'mel']
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.834580] visual_debug_rate: 800
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.836580] is_mel_spectrogram: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.839594] ]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.841581] is_train: True
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.843581] dist: False
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.845582]
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:26.847583] 24-03-09 10:35:26.472 - INFO: Random seed: 5538
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:26 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/reset "HTTP/1.1 200 OK"
2024-03-09 10:35:27 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:27 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/reset "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:27.673678] 24-03-09 10:35:27.673 - INFO: Number of training data elements: 144, iters: 4
2024-03-09 10:35:27 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:27.676667] 24-03-09 10:35:27.673 - INFO: Total epochs needed: 400 for iters 1,600
2024-03-09 10:35:27 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:27 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:27 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/reset "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:28.574874] E:\ai-voice-cloning\runtime\lib\site-packages\transformers\configuration_utils.py:363: UserWarning: Passing `gradient_checkpointing` to a config initialization is deprecated and will be removed in v5 Transformers. Using `model.gradient_checkpointing_enable()` instead, or if you are using the `Trainer` API, pass `gradient_checkpointing=True` in your `TrainingArguments`.
2024-03-09 10:35:28 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:28.578875] warnings.warn(
2024-03-09 10:35:28 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:28 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:28 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/reset "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:35.076722] 24-03-09 10:35:35.075 - INFO: Loading model for [./models/tortoise/autoregressive.pth]
2024-03-09 10:35:35 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:35 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/api/predict "HTTP/1.1 200 OK"
2024-03-09 10:35:35 | INFO | httpx | HTTP Request: POST http://127.0.0.1:7860/reset "HTTP/1.1 200 OK"
[Training] [2024-03-09T10:35:35.562853] 24-03-09 10:35:35.556 - INFO: Start training from epoch: 0, iter: 0
[Training] [2024-03-09T10:35:37.342177] [2024-03-09 10:35:37,342] torch.distributed.elastic.multiprocessing.redirects: [WARNING] NOTE: Redirects are currently not supported in Windows or MacOs.
[Training] [2024-03-09T10:35:39.207932] [2024-03-09 10:35:39,207] torch.distributed.elastic.multiprocessing.redirects: [WARNING] NOTE: Redirects are currently not supported in Windows or MacOs.
[Training] [2024-03-09T10:35:40.294527] Disabled distributed training.
[Training] [2024-03-09T10:35:40.295527] Path already exists. Rename it to [./training\neeko\finetune_archived_240309-103526]
[Training] [2024-03-09T10:35:40.295527] Loading from ./models/tortoise/dvae.pth
[Training] [2024-03-09T10:35:40.296527] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.296527] File "E:\ai-voice-cloning\src\train.py", line 72, in <module>
[Training] [2024-03-09T10:35:40.296527] train(config_path, args.launcher)
[Training] [2024-03-09T10:35:40.297527] File "E:\ai-voice-cloning\src\train.py", line 39, in train
[Training] [2024-03-09T10:35:40.297527] trainer.do_training()
[Training] [2024-03-09T10:35:40.297527] File "E:\ai-voice-cloning\src\dlas\train.py", line 406, in do_training
[Training] [2024-03-09T10:35:40.298528] for train_data in tq_ldr:
[Training] [2024-03-09T10:35:40.298528] File "E:\ai-voice-cloning\runtime\lib\site-packages\torch\utils\data\dataloader.py", line 630, in __next__
[Training] [2024-03-09T10:35:40.298528] data = self._next_data()
[Training] [2024-03-09T10:35:40.299528] File "E:\ai-voice-cloning\runtime\lib\site-packages\torch\utils\data\dataloader.py", line 1345, in _next_data
[Training] [2024-03-09T10:35:40.299528] return self._process_data(data)
[Training] [2024-03-09T10:35:40.299528] File "E:\ai-voice-cloning\runtime\lib\site-packages\torch\utils\data\dataloader.py", line 1371, in _process_data
[Training] [2024-03-09T10:35:40.300528] data.reraise()
[Training] [2024-03-09T10:35:40.300528] File "E:\ai-voice-cloning\runtime\lib\site-packages\torch\_utils.py", line 694, in reraise
[Training] [2024-03-09T10:35:40.300528] raise exception
[Training] [2024-03-09T10:35:40.300528] IndexError: Caught IndexError in DataLoader worker process 0.
[Training] [2024-03-09T10:35:40.301528] Original Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.301528] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.301528] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.302528] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.302528] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.302528] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.303529]
[Training] [2024-03-09T10:35:40.303529] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.303529]
[Training] [2024-03-09T10:35:40.304529] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.305530] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.306531] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.306531] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.307530] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.307530] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.307530]
[Training] [2024-03-09T10:35:40.307530] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.308531]
[Training] [2024-03-09T10:35:40.308531] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.308531] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.309541] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.309541] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.309541] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.310530] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.310530]
[Training] [2024-03-09T10:35:40.310530] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.311531]
[Training] [2024-03-09T10:35:40.311531] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.311531] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.312531] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.312531] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.312531] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.312531] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.313532]
[Training] [2024-03-09T10:35:40.584592] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.585593]
[Training] [2024-03-09T10:35:40.585593] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.586638] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.586638] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.586638] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.587593] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.587593] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.587593]
[Training] [2024-03-09T10:35:40.588593] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.588593]
[Training] [2024-03-09T10:35:40.588593] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.588593] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.589593] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.589593] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.590593] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.590593] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.590593]
[Training] [2024-03-09T10:35:40.591594] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.591594]
[Training] [2024-03-09T10:35:40.591594] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.591594] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.592594] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.592594] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.592594] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.593594] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.593594]
[Training] [2024-03-09T10:35:40.593594] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.594595]
[Training] [2024-03-09T10:35:40.595594] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.595594] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.596595] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.596595] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.596595] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.597596] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.597596]
[Training] [2024-03-09T10:35:40.597596] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.598595]
[Training] [2024-03-09T10:35:40.598595] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.598595] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.598595] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.599595] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.599595] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.599595] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.600595]
[Training] [2024-03-09T10:35:40.600595] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.600595]
[Training] [2024-03-09T10:35:40.601596] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.601596] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.601596] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.602597] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.602597] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.602597] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.603597]
[Training] [2024-03-09T10:35:40.603597] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.603597]
[Training] [2024-03-09T10:35:40.603597] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.604642] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.605598] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.605598] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.606597] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.606597] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.606597]
[Training] [2024-03-09T10:35:40.607643] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.607643]
[Training] [2024-03-09T10:35:40.607643] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.608598] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.608598] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.608598] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.609599] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.609609] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.609609]
[Training] [2024-03-09T10:35:40.610597] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.610597]
[Training] [2024-03-09T10:35:40.610597] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.611598] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.611598] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.611598] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.612599] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.612599] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.612599]
[Training] [2024-03-09T10:35:40.613599] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.613599]
[Training] [2024-03-09T10:35:40.613599] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.614599] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.615600] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.615600] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.616600] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.616600] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.616600]
[Training] [2024-03-09T10:35:40.617599] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.617599]
[Training] [2024-03-09T10:35:40.617599] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.618599] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.618599] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.618599] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.619600] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.619600] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.619600]
[Training] [2024-03-09T10:35:40.620600] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.620600]
[Training] [2024-03-09T10:35:40.620600] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.621601] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.621601] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.621601] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.621601] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.622601] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.622601]
[Training] [2024-03-09T10:35:40.622601] During handling of the above exception, another exception occurred:
[Training] [2024-03-09T10:35:40.623617]
[Training] [2024-03-09T10:35:40.623617] Traceback (most recent call last):
[Training] [2024-03-09T10:35:40.623617] File "E:\ai-voice-cloning\runtime\lib\site-packages\torch\utils\data\_utils\worker.py", line 308, in _worker_loop
[Training] [2024-03-09T10:35:40.624601] data = fetcher.fetch(index)
[Training] [2024-03-09T10:35:40.625601] File "E:\ai-voice-cloning\runtime\lib\site-packages\torch\utils\data\_utils\fetch.py", line 51, in fetch
[Training] [2024-03-09T10:35:40.625601] data = [self.dataset[idx] for idx in possibly_batched_index]
[Training] [2024-03-09T10:35:40.626601] File "E:\ai-voice-cloning\runtime\lib\site-packages\torch\utils\data\_utils\fetch.py", line 51, in <listcomp>
[Training] [2024-03-09T10:35:40.626601] data = [self.dataset[idx] for idx in possibly_batched_index]
[Training] [2024-03-09T10:35:40.626601] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 233, in __getitem__
[Training] [2024-03-09T10:35:40.627602] return self[(index+1) % len(self)]
[Training] [2024-03-09T10:35:40.627602] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 233, in __getitem__
[Training] [2024-03-09T10:35:40.627602] return self[(index+1) % len(self)]
[Training] [2024-03-09T10:35:40.628602] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 233, in __getitem__
[Training] [2024-03-09T10:35:40.628602] return self[(index+1) % len(self)]
[Training] [2024-03-09T10:35:40.628602] [Previous line repeated 97 more times]
[Training] [2024-03-09T10:35:40.629603] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 218, in __getitem__
[Training] [2024-03-09T10:35:40.629603] tseq, wav, text, path, type = self.get_wav_text_pair(
[Training] [2024-03-09T10:35:40.629603] File "E:\ai-voice-cloning\src\dlas\data\audio\paired_voice_audio_dataset.py", line 200, in get_wav_text_pair
[Training] [2024-03-09T10:35:40.630602] audiopath, text, type = audiopath_and_text[0], audiopath_and_text[1], audiopath_and_text[2]
[Training] [2024-03-09T10:35:40.630602] IndexError: list index out of range
[Training] [2024-03-09T10:35:40.630602]
[Training] [2024-03-09T10:35:51.183741]
[Training] [2024-03-09T10:35:51.183741] E:\ai-voice-cloning>pause