GithubHelp home page GithubHelp logo

danielthupku / diffsinger Goto Github PK

View Code? Open in Web Editor NEW

This project forked from openvpi/diffsinger

0.0 0.0 0.0 67.07 MB

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

License: Apache License 2.0

Python 100.00%

diffsinger's Introduction

DiffSinger (OpenVPI maintained version)

arXiv downloads Bilibili license

This is a refactored and enhanced version of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism based on the original paper and implementation, which provides:

  • Cleaner code structure: useless and redundant files are removed and the others are re-organized.
  • Better sound quality: the sampling rate of synthesized audio are adapted to 44.1 kHz instead of the original 24 kHz.
  • Higher fidelity: improved acoustic models and diffusion sampling acceleration algorithms are integrated.
  • More controllability: introduced variance models and parameters for prediction and control of pitch, energy, breathiness, etc.
  • Production compatibility: functionalities are designed to match the requirements of production deployment and the SVS communities.
Overview Variance Model Acoustic Model
arch-overview arch-variance arch-acoustic

User Guidance

中文教程 / Chinese Tutorials: Text, Video

Progress & Roadmap

Architecture & Algorithms

TBD

Development Resources

TBD

References

Disclaimer

Any organization or individual is prohibited from using any functionalities included in this repository to generate someone's speech without his/her consent, including but not limited to government leaders, political figures, and celebrities. If you do not comply with this item, you could be in violation of copyright laws.

License

This forked DiffSinger repository is licensed under the Apache 2.0 License.

diffsinger's People

Contributors

yqzhishen avatar hrukalive avatar yxlllc avatar moonintheriver avatar oxygen-dioxide avatar sinestriker avatar flutydeer avatar autumn-2-net avatar djkcyl avatar colourfulspring avatar cyclekiller avatar utautautau avatar ms903x1 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.