GithubHelp home page GithubHelp logo

Hi there ๐Ÿ‘‹

My name is Yuancheng Wang (็Ž‹่ฟœ็จ‹). I'm a first-year Ph.D. student at the Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), supervised by Professor Zhizheng Wu. before that, I received my B.S. degree at CUHK-Shenzhen. I also collaborate with Xu Tan (่ฐญๆ—ญ) from Microsoft Research Asia.

My research interest includes text-to-speech synthesis, text-to-audio generation, and unified audio representation and generation. I am one of the main contributors and leaders of the open-sourceย Amphionย toolkit.

I have developed NaturalSpeech 3, which is an advanced text-to-speech model with factorized speech representation and modeling.

๐Ÿ”ฅ News

  • 2024.09: ๐Ÿ”ฅ We released MaskGCT, A new SOTA large-scale TTS system with masked generative models.
  • 2024.08: ๐ŸŽ‰ our papers, Amphion and Emilia got accepted by IEEE SLT 2024.
  • 2024.07: ๐Ÿ”ฅ We released Emilia, an extensive, multilingual, and diverse speech dataset for large-scale speech generation with 101k hours of speech in six languages and features diverse speech with varied speaking styles.
  • 2024.05: ๐ŸŽ‰ Our paper Factorized Diffusion Models are Natural and Zero-shot Speech Synthesizers, aka NaturalSpeech 3, got accepted by ICML 2024 as an Oral presentation!
  • 2024.03: ๐ŸŽ‰ We are delighted to release NaturalSpeech 3, which is an advanced version of the NaturalSpeech series with speech factorization. And we release FACodec checkpoints and demo in HuggingFace Amphion Space.
  • 2023.11: ๐Ÿ”ฅ We releasedย Amphion v0.1 (โญ๏ธ 4.4k+), which is an open-source toolkit for audio, music, and speech generation.
  • 2023.09: ๐ŸŽ‰ My first paper about audio generation and editing AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models got accepted by NeurIPS 2023!

๐Ÿ”— Homepages

Yuancheng0625's Projects

Yuancheng0625 doesnโ€™t have any public repositories yet.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.