GithubHelp home page GithubHelp logo

boredbean / vits-finetuning Goto Github PK

View Code? Open in Web Editor NEW

This project forked from sayass/vits-finetuning

0.0 0.0 0.0 12.13 MB

Fine-Tuning your VITS model using a pre-trained model

License: MIT License

Python 94.37% Jupyter Notebook 4.84% Cython 0.79%

vits-finetuning's Introduction

text cleaner from https://github.com/CjangCjengh/vits

original repo: https://github.com/jaywalnut310/vits

Online training and inference

colab

See vits-finetuning

How to use

(Suggestion) Python == 3.7

Only Japanese datasets can be used for fine-tuning in this repo.

Clone this repository

git clone --single-branch --branch master https://github.com/BoredBean/vits-finetuning.git

Install requirements

pip install -r requirements.txt

Download pre-trained model

  • G_0.pth
  • D_0.pth
  • Edit "model_dir"(line 152) in utils.py
  • Put pre-trained models in the "model_dir"/checkpoints

If you need to customize "n_speakers", please replace the pre-trained model with these two.

Create datasets

  • Speaker ID should be between 0-803.
  • About 50 audio-text pairs will suffice and 100-600 epochs could have quite good performance, but more data may be better.
  • Resample all audio to 22050Hz, 16-bit, mono wav files.
  • Audio files should be >=1s and <=10s.
path/to/XXX.wav|speaker id|transcript
  • Example
dataset/001.wav|10|こんにちは。

For complete examples, please see filelists/miyu_train.txt and filelists/miyu_val.txt.

Preprocess

python preprocess.py --filelists path/to/filelist_train.txt path/to/filelist_val.txt

Edit "training_files" and "validation_files" in configs/config.json

Build monotonic alignment search

cd monotonic_align
python setup.py build_ext --inplace
cd ..

Train

# Mutiple speakers
python train_ms.py -c configs/config.json -m checkpoints

vits-finetuning's People

Contributors

boredbean avatar sayass avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.