inconnu11 Goto Github PK

followers: 160.0 following: 1.2K repos: 90.0 gists: 0.0

Type: User

Company: Tsinghua University

Bio: TTS, Voice conversion(VC), Speech representation learning

Twitter: Amy31784799

Location: Beijing

Hi there 👋

🎓 3nd-year M.S. in Department of Cumputer Science and Technology, Tsinghua University.
🔭 Ongoing research on style transfer TTS recently. Research on disentangment in VC earlier.
💼 Research Intern: Tencent AI Lab, Huya, both were supervised by Shiyin Kang; And MSRA, Supervised by Frank K. Soong

📡 Graduate at 2022 and will join Xiaomi.
📫 How to reach me: [email protected] or twitter are probably fastest.
💬 Ask me about anything here.
⏰ Graduation 6 weeks left ( - 01/07/2022)

No Activity tracked this Week

inconnu11's Projects

randomcnn-voice-transfer

Audio style transfer with shallow random parameters CNN. Result: https://soundcloud.com/mazzzystar/sets/speech-conversion-sample

real-time-voice-cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

reformer-pytorch

Reformer, the efficient Transformer, in Pytorch

resemblyzer

A python package to analyze and compare voices with deep learning

resume

An elegant \LaTeX\ résumé template

speaker-embedding-with-phonetic-information

The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"

speech-emotion-classification-with-pytorch

This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.

speech-recognition-papers

Towards hot directions in industrial speech recognition

speechdecompose

speechsplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

sprocket

Voice Conversion Tool Kit

subband_wavernn

sw-sift

Matlab implementation of sift(opensift) algorithm.

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

tacotron2-vae

Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

trax

Trax — your path to advanced deep learning

trojan-go

A Trojan proxy written in golang. An unidentifiable mechanism that helps you bypass GFW. Golang实现的Trojan代理，支持多个平台，无依赖

ubisoft-laforge-daft-exprt-1

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

umap

Uniform Manifold Approximation and Projection

vae-npvc

Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder

vc-demos

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

vc_tacotron

Voice Conversion using Tacotron.

vcc20_baseline_cyclevae

Voice Conversion Challenge 2020 CycleVAE baseline system

voice_conversion

voice_converter_cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

wavenet

Chainer implementation of Deepmind's WaveNet

inconnu11 Goto Github PK

Hi there 👋

inconnu11's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs