GithubHelp home page GithubHelp logo

brentjohnston / xtts-rvc-ui-extraoptions Goto Github PK

View Code? Open in Web Editor NEW

This project forked from vali-98/xtts-rvc-ui

0.0 0.0 0.0 97 KB

A Gradio UI for XTTSv2 and RVC. (Extra Voice Options)

License: MIT License

Shell 0.01% Python 99.97% Batchfile 0.02%

xtts-rvc-ui-extraoptions's Introduction

XTTS-RVC-UI

This is a Fork of XTTS-RVC-UI that adds realtime typing, updates voice playback to happen right away if any changes are made in the interface, and adds additional temperature and repetition penality sliders to adjust your voice. Made it autoplay only the RVC output.

Note: You can also separately adjust the xtts model's config.json top_k and top_p settings for further tweaking before starting the start.bat. Here is what I am using for that (experimental):

"top_k": 70, "top_p": 0.95,

Note2: When you finish typing if it didn't read the entire thing, you can simply press . or spacebar or backspace and it will read the entire sentence during a refresh (usually within 1 or 2 seconds). I have found the best results by using Dragon Naturally Speaking and my microphone. Having it type in the box for me and using a "custom dragon command" word "erase" to erase the box. My dragon step-by-step command is like this, Steps: Control + A, Backspace" when myCommand "erase" is spoken.

Original Repo Info:

This is a simple UI that utilize's Coqui's XTTSv2 paired with RVC functionality to improve output quality.

Prerequisites

  • Requires MSVC - VC 2022 C++ x64/x86 build tools.

Usage

Clone this repository:

git clone https://github.com/Vali-98/XTTS-RVC-UI.git

It is recommended to create a venv.

Then, install the requirements:

pip install -r requirements.txt

If you have a CUDA device available, it is also recommended to install PyTorch with CUDA for faster conversions.

pip install torch==2.1.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu118

Then run start.bat , start.sh or simply python app.py

This will create the following folders within the project:

\models\xtts
\rvcs
\voices
  • Relevant models will be downloaded into \models. This will be approximately ~2.27GB.
  • You can manually add the desired XTTSv2 model files in \models\xtts.
  • Place RVC models in \rvcs. Rename them as needed. If an identically named .index file exists in \rvcs, it will also be used.
  • Place voice samples in \voices

xtts-rvc-ui-extraoptions's People

Contributors

vali-98 avatar 311-code avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.