GithubHelp home page GithubHelp logo

g-thor / muskits Goto Github PK

View Code? Open in Web Editor NEW

This project forked from sjtmusicteam/muskits

0.0 0.0 0.0 1.37 MB

An opensource music processing toolkit

License: Apache License 2.0

Shell 16.53% Python 81.65% Perl 1.51% Makefile 0.31%

muskits's Introduction

Muskit: Open-source music processing toolkits

Muskit is an open-source music processing toolkit. Currently we mostly focus on benchmarking the end-to-end singing voice synthesis and expect to extend more tasks in the future. Muskit employs pytorch as a deep learning engine and also follows ESPnet and Kaldi style data processing, and recipes to provide a complete setup for various music processing experiments. The main structure and base codes are adapted from ESPnet (we expect to merge the Muskit into ESPnet in later stages)

News

The project is current merging to ESPnet! If you have any comments and suggestions, please feel free to discuss either in this repo or espnet. See espnet/espnet#4437 for details.

We just start to merge the first version to ESPnet! With some improvements in the fundational data structure, we even reach better performances than the original Muskits. Please stay tuned for our later news!

Key Features

ESPnet style complete recipe

  • Support numbers of SVS recipes in several databases (e.g., Kiritan, Oniku_db, Ofuton_db, Natsume database, CSD database)
  • On the fly feature extraction and text processing

SVS: Singing Voice Synthesis

  • Reproducible results in serveral SVS public domain copora
  • Various network architecutres for end-to-end SVS
    • RNN-based non-autoregressive model
    • Xiaoice
    • Sequence-to-sequence Transformer (with GLU-based encoder)
    • MLP singer
    • Tacotron-singing (in progress)
    • DiffSinger (to be published)
  • Multi-speaker & Multilingual extention
    • Speaker ID embedding
    • Language ID embedding
    • Global sytle token (GST) embedding
  • Various language support
    • Jp / En / Kr / Zh
  • Integration with neural vocoders
    • the style matches the PWG repo with supports of various of vocoders

Installation

The full installation guide is available at https://github.com/SJTMusicTeam/Muskits/wiki/Installation-Instructions

Demonstration

  • Real-time SVS demo with Muskits Open In Colab

Pretrain models

Acoustic models are available at https://github.com/SJTMusicTeam/Muskits/blob/main/doc/pretrained_models.md Vocoders are available at https://github.com/kan-bayashi/ParallelWaveGAN/blob/master/README.md

Running instructions

The tutorial of how to use Muskits is at https://github.com/SJTMusicTeam/Muskits/blob/main/doc/tutorial.md

Recipe Explanation

A detailed recipe explanation in https://github.com/SJTMusicTeam/Muskits/blob/main/egs/TEMPLATE/svs1/README.md

muskits's People

Contributors

a-quarter-mile avatar cardroid avatar drwelles avatar frankxu2004 avatar ftshijt avatar jerryuhoo avatar nan-huo avatar nanhuo avatar peterguoruc avatar simpleoier avatar south-twilight avatar zltttt avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.